Skip to content

task: Parsing & extraction quality — language/format coverage gaps #592

Description

@DeusData

Scope

Umbrella tracker for graph extraction quality — language/format coverage gaps, missing node/edge kinds, and extraction false positives. These are not crashes (see the stability/performance umbrellas) but cases where indexing succeeds yet the resulting graph is shallow, mistyped, or wrong.

Sub-issues

Language coverage / LSP depth

Format / IaC / config extraction

Correctness / false positives

Content / docs indexing (BM25)

Tracked via open PR (not part of this task's open work)

Acceptance

Per item: the missing nodes/edges are emitted (or the false positive is suppressed), with a reproduce-first test on a public fixture, or the item is closed with rationale (out of scope / by design).

Why one task

These share the extraction pipeline (tree-sitter queries + hybrid-LSP resolvers + node/edge emission). Triaging them together keeps language/format coverage coherent instead of scattered one-off queries.

Metadata

Metadata

Assignees

No one assigned

    Labels

    parsing/qualityGraph extraction bugs, false positives, missing edgestaskUmbrella task grouping multiple related issues

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions