task: Parsing & extraction quality — language/format coverage gaps

## Scope

Umbrella tracker for **graph extraction quality** — language/format coverage gaps, missing node/edge kinds, and extraction false positives. These are not crashes (see the stability/performance umbrellas) but cases where indexing *succeeds* yet the resulting graph is shallow, mistyped, or wrong.

## Sub-issues

### Language coverage / LSP depth
- [ ] #382 — Java: `@Annotation`, signatures, and AST properties missing from graph nodes
- [ ] #405 — Rust LSP (hybrid-LSP tier resolution)
- [ ] #535 — Hybrid LSP support for Julia
- [ ] #415 — Index inner declarations of factory/setup callbacks (Vue Pinia setup stores, composables, React hooks)

### Format / IaC / config extraction
- [ ] #450 — GitHub Actions semantic extraction (workflows, jobs, steps, uses/needs edges)
- [ ] #451 — Python class-field nodes, type-annotation edges, enum members
- [ ] #452 — Terraform module composition edges + block-type labels
- [ ] #454 — Helm nested `.Values` linkage, template→kind typing, image/env + hook/RBAC edges

### Correctness / false positives
- [ ] #495 — cfg-gated twin functions collapse into one node; `get_code_snippet` returns inactive branch
- [ ] #521 — Route nodes created from URL strings in config / non-source files

### Content / docs indexing (BM25)
- [ ] #490 — Index documents as well
- [ ] #518 — Section nodes don't index body text — BM25 can't search markdown content
- [ ] #519 — META.yaml / frontmatter description values not indexed for BM25

### Tracked via open PR (not part of this task's open work)
- #438 / #554 — cross-file & C++ out-of-line CALLS edges resolve to Module instead of the enclosing function — PR #463
- #459 — Perl LSP-tier semantic resolution — PR #461
- #462 — ObjectScript (InterSystems IRIS) language support — PR #467 / #590
- #440 — cross-repo Maven library dependency links — PR #442
- #574 — SQL DDL Table/View nodes + FROM/JOIN lineage — PR #582
- #575 — dbt lineage/macros from raw `.sql` — PR #584
- #576 — dbt `manifest.json` ingest + DEPENDS_ON — PR #583

## Acceptance

Per item: the missing nodes/edges are emitted (or the false positive is suppressed), with a reproduce-first test on a public fixture, or the item is closed with rationale (out of scope / by design).

## Why one task

These share the extraction pipeline (tree-sitter queries + hybrid-LSP resolvers + node/edge emission). Triaging them together keeps language/format coverage coherent instead of scattered one-off queries.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

task: Parsing & extraction quality — language/format coverage gaps #592

Scope

Sub-issues

Language coverage / LSP depth

Format / IaC / config extraction

Correctness / false positives

Content / docs indexing (BM25)

Tracked via open PR (not part of this task's open work)

Acceptance

Why one task

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

task: Parsing & extraction quality — language/format coverage gaps #592

Description

Scope

Sub-issues

Language coverage / LSP depth

Format / IaC / config extraction

Correctness / false positives

Content / docs indexing (BM25)

Tracked via open PR (not part of this task's open work)

Acceptance

Why one task

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions