feat(dependency-management): Improve dependency filtering for better output #20

DaxServer · 2025-08-31T14:50:19Z

This commit introduces several improvements to the dependency management system:

Filters out unconnected dependencies to optimize output
Automatically marks local types as required
Marks imported types as required only if they are actually referenced by other types
Automatically includes transitive dependencies when their parent types are referenced

The key changes are:

Added maincodeNodeIds and requiredNodeIds sets to track main code and required nodes
Updated addLocalTypes to mark local types as both main code and required
Adjusted collectFromImports to only mark imported types as required if they are referenced
Implemented filtering logic in getNodesToPrint to return only the required nodes

These changes ensure the dependency analysis produces a more optimized output, only including the necessary types and their transitive dependencies.

…output This commit introduces several improvements to the dependency management system: - Filters out unconnected dependencies to optimize output - Automatically marks local types as required - Marks imported types as required only if they are actually referenced by other types - Automatically includes transitive dependencies when their parent types are referenced The key changes are: - Added `maincodeNodeIds` and `requiredNodeIds` sets to track main code and required nodes - Updated `addLocalTypes` to mark local types as both main code and required - Adjusted `collectFromImports` to only mark imported types as required if they are referenced - Implemented filtering logic in `getNodesToPrint` to return only the required nodes These changes ensure the dependency analysis produces a more optimized output, only including the necessary types and their transitive dependencies.

coderabbitai · 2025-08-31T14:50:26Z

📝 Walkthrough

Summary by CodeRabbit

New Features
- Maincode filtering: output now includes only types defined in the main file and their connected dependencies, excluding unused imports.
- Smarter dependency inclusion with transitive resolution and cycle handling.
Refactor
- Dependency traversal modularized for clearer import collection, local type discovery, and dependency extraction.
- Output filtered to required nodes with stable ordering.
Documentation
- Expanded dependency management, added maincode filtering guide, and updated overview.
- Clarified circular dependency detection and resolution.
Tests
- Added comprehensive suites covering filtering, non-transitive vs. transitive inclusion, ordering, and circular references.

Walkthrough

Adds docs for dependency and maincode filtering. Refactors traversal to modular utilities for local type collection, import collection, and dependency extraction. Updates traversal to return only required nodes. Introduces TypeReferenceExtractor. Adds integration and unit tests validating filtered output, ordering, and circular references.

Changes

Cohort / File(s)	Summary
Documentation updates `docs/dependency-management.md`, `docs/maincode-filtering.md`, `docs/overview.md`	Expand dependency filtering, add maincode-filtering doc, and link it from overview; clarify detection/resolution for circular deps and filtering rules.
Traversal core refactor `src/traverse/dependency-traversal.ts`	Orchestrates via new utilities: collects local types, imports, extracts dependencies; filters to required nodes in getNodesToPrint; removes in-class helpers. Public API surface retained.
New traversal utilities `src/traverse/dependency-extractor.ts`, `src/traverse/import-collector.ts`, `src/traverse/local-type-collector.ts`, `src/traverse/type-reference-extractor.ts`	Add modular components: dependency propagation over NodeGraph, recursive import collection into graphs, local type collection and marking required/maincode, and AST-based type reference extraction.
Tests: traversal/codegen behavior `tests/traverse/dependency-ordering.test.ts`, `tests/traverse/dependency-traversal.integration.test.ts`, `tests/traverse/maincode-filter.test.ts`, `tests/traverse/non-transitive-dependency.test.ts`	Add/reshape tests to assert generated code only includes maincode and connected dependencies, correct ordering, and cycle handling; switch assertions to formatted output comparisons.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor TestOrCLI as Caller
  participant TS as SourceFile (ts-morph)
  participant DT as DependencyTraversal
  participant LTC as LocalTypeCollector
  participant IC as ImportCollector
  participant NG as NodeGraph
  participant FG as FileGraph
  participant DEX as DependencyExtractor
  participant TRE as TypeReferenceExtractor

  TestOrCLI->>DT: startTraversal(TS)
  DT->>LTC: addLocalTypes(TS, NG, mainIds, requiredIds)
  LTC-->>NG: register local nodes (isImported=false, isMainCode=true)
  LTC-->>DT: update mainIds, requiredIds

  DT->>IC: collectFromImports(TS.imports)
  IC->>FG: ensure file nodes / prevent cycles
  IC->>NG: add imported nodes (isImported=true)
  IC->>IC: recurse into nested imports

  DT->>DEX: extractDependencies(NG, requiredIds)
  loop until fixed-point
    DEX->>NG: get node to process
    DEX->>TRE: extractTypeReferences(tsNode)
    TRE-->>DEX: referenced qualified names[]
    DEX->>NG: addDependency(referenced -> current)
    DEX-->>DT: add to requiredIds / queue
  end

  DT->>DT: getNodesToPrint(requiredIds)
  DT-->>TestOrCLI: ordered subset for codegen

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

feat: Improve dependency traversal for interface inheritance #14 — Both update dependency extraction/traversal to recognize interface inheritance and related type-reference logic.
feat(graph): rework dependency traversal with graphology DAG #13 — Shares the refactor introducing FileGraph/NodeGraph and modular traversal utilities.
feat(dependency-traversal): Implement comprehensive type dependency traversal #12 — Also restructures the dependency traversal subsystem with a new architecture.

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch maincode-filtering

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore or @coderabbit ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 6

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/traverse/dependency-traversal.integration.test.ts (1)

435-486: Use lazy/recursive TypeBox APIs for circular references
Eagerly calling Type.Optional(B) in A before B is initialized triggers a TDZ at module load; switch to a lazy pattern (e.g. Type.Recursive or $id+Type.Ref) so cross‐references defer evaluation.

🧹 Nitpick comments (25)

docs/dependency-management.md (2)
32-38: Tighten the 6-step process wording for precision

Minor clarity/grammar tweaks and make Step 4 explicit about the criterion (membership in requiredNodeIds / reachability from maincode).
-1. Collect local types from main file (automatically marked as required)
-2. Process import chains recursively (adds types to graph)
-3. Extract type dependencies (marks referenced types as required)
-4. Filter to only connected dependencies
-5. Build dependency graph
-6. Topological sort for processing order
+1. Collect local types from the main file (automatically mark as required).
+2. Process import chains recursively (add discovered types to the graph).
+3. Extract type dependencies (mark actually referenced types as required).
+4. Filter nodes to the connected set (members of requiredNodeIds or reachable from maincode).
+5. Build the dependency graph.
+6. Topologically sort the filtered nodes to get the processing order.
11-13: Echo “filtering” intent in the component bullets

Consider mentioning the “required set” here so the orchestrator’s responsibilities mirror the algorithm.
-- Filters out unconnected dependencies
+- Maintains a required set and filters out unconnected dependencies
src/traverse/local-type-collector.ts (2)
16-22: Revisit “import-only entry file” behavior to match PR objective (“imported types only if referenced”)

Current branch adds all named imports as required when there are no local types, which bloats output and contradicts the stated goal. Either drop the auto-require or guard it behind a feature flag.

Option A (strict, matches docs): do not add imports to required here.
-  // If main file has no local types but has imports, add all imported types as required
+  // If the main file has no local types, do not auto-mark imports as required.
+  // They will be pulled in only if referenced during dependency extraction.
Option B (keep seeding but make it explicit): add a boolean parameter seedImportsWhenNoLocal = false and gate the requiredNodeIds.add(...) on it.

87-103: Coverage: add classes and type-only filter (optional)

If classes are a supported construct elsewhere, include them. Also consider skipping non type-only imports in the import-only branch to avoid accidental value captures.

I can send a small patch adding class collection and honoring importDecl.isTypeOnly().
docs/maincode-filtering.md (3)
18-29: Use headings instead of bold labels for file captions (fixes MD036)
-**external.ts**
+#### external.ts
@@
-**main.ts**
+#### main.ts
76-101: Standardize section captions in the second example (fixes MD036)
-**level1.ts**
+#### level1.ts
@@
-**level2.ts**
+#### level2.ts
@@
-**main.ts**
+#### main.ts
9-13: Clarify “connected dependencies” and state the import-only edge case

Slight wording tweak and add a brief note about files with no local types so readers know what to expect.
-2. **Connected dependencies**: Types that are directly or indirectly referenced by maincode nodes
+2. **Connected dependencies**: Types directly or indirectly referenced by maincode nodes (i.e., reachable via the dependency graph)
@@
-Imported types that are not referenced anywhere in the dependency chain are automatically filtered out.
+Imported types that are not referenced anywhere in the dependency chain are automatically filtered out.
+Note: If the entry file contains no local types, decide whether to seed its named imports as required (include all) or keep strict filtering (include none). The implementation defaults should be documented alongside configuration.
tests/traverse/dependency-ordering.test.ts (4)
151-223: Great end-to-end test; add one for re-exports to prevent regression on qualified-name resolution

This suite is strong. One missing case: main.ts imports a type re-exported from index.ts which re-exports from types.ts. This guards the qualified-name bug and ensures the traversal resolves to the declaration file.

If helpful, I can draft a test snippet covering export { Foo } from './types' via an index barrel.

152-153: Avoid shadowing the top-level project in these tests

Reuse the beforeEach-initialized project for consistency and to reduce setup overhead.
-  test('should handle complex structure without including unused types', () => {
-    const project = new Project()
+  test('should handle complex structure without including unused types', () => {
322-323: Same nit: reuse the shared project
-  test('should handle circular references', () => {
-    const project = new Project()
+  test('should handle circular references', () => {
224-319: Consider an “import-only entry file” test to lock intended behavior

Add a focused test where the entry file only has import type { Foo } from './foo' and no local declarations, then assert the output (either empty or seeded) per the chosen policy. This will keep docs, PR goals, and implementation in sync.
src/traverse/dependency-extractor.ts (3)
17-22: Avoid O(n) extraction from Set on every loop iteration

Use the Set iterator instead of Array.from(...)[0] to reduce per-iteration overhead and allocations.
-  while (nodesToProcess.size > 0) {
-    const currentNodeId = Array.from(nodesToProcess)[0]
-    if (!currentNodeId) break
-
-    nodesToProcess.delete(currentNodeId)
+  while (nodesToProcess.size > 0) {
+    const it = nodesToProcess.values()
+    const currentNodeId = it.next().value as string | undefined
+    if (!currentNodeId) break
+    nodesToProcess.delete(currentNodeId)
23-31: Guard graph access and mark processed after verifying node existence

Prevents potential throws from getNode and avoids marking a node as processed when it’s not in the graph.
-    if (processedNodes.has(currentNodeId)) continue
-    processedNodes.add(currentNodeId)
-
-    const nodeData = nodeGraph.getNode(currentNodeId)
-    if (!nodeData) continue
+    if (processedNodes.has(currentNodeId)) continue
+    if (!nodeGraph.hasNode(currentNodeId)) continue
+    const nodeData = nodeGraph.getNode(currentNodeId)
+    processedNodes.add(currentNodeId)
44-55: Deduplicate references and avoid re-queuing already processed nodes

This reduces redundant edge checks and churn in nodesToProcess.
-    const typeReferences = typeReferenceExtractor.extractTypeReferences(nodeToAnalyze)
+    const uniqueRefs = new Set(typeReferenceExtractor.extractTypeReferences(nodeToAnalyze))
 
-    for (const referencedType of typeReferences) {
+    for (const referencedType of uniqueRefs) {
       if (nodeGraph.hasNode(referencedType)) {
-        // Only add to required if not already processed
-        if (!requiredNodeIds.has(referencedType)) {
-          requiredNodeIds.add(referencedType)
-          nodesToProcess.add(referencedType)
-        }
+        if (!requiredNodeIds.has(referencedType)) requiredNodeIds.add(referencedType)
+        if (!processedNodes.has(referencedType)) nodesToProcess.add(referencedType)
         nodeGraph.addDependency(referencedType, currentNodeId)
       }
     }
src/traverse/import-collector.ts (3)
23-28: Consider restricting initial collection to exported declarations

Collecting every alias/interface/enum/function (including non-exported) can balloon the graph for large modules. If you only need non-exported nodes when they’re referenced by an exported surface, prefer starting from getExportedDeclarations() and lazily discovering internals during dependency extraction.

72-85: Confirm functions should be modeled as “type nodes”

You add function declarations as nodes (type: 'function'). If runtime function bodies are irrelevant to schema generation and only their signatures contribute types, ensure downstream code treats these nodes purely for type-surface traversal and never emits function runtime. Otherwise, skip functions here.

30-41: Maintain determinism: sort discovered declarations before insertion

Sorting typeAliases, interfaces, enums, and functions by name before addTypeNode makes graph construction deterministic across Node.js versions and file-system orders, improving reproducible builds.
-      const typeAliases = moduleSourceFile.getTypeAliases()
+      const typeAliases = moduleSourceFile.getTypeAliases().sort((a,b) => a.getName().localeCompare(b.getName()))
-      const interfaces = moduleSourceFile.getInterfaces()
+      const interfaces = moduleSourceFile.getInterfaces().sort((a,b) => a.getName().localeCompare(b.getName()))
-      const enums = moduleSourceFile.getEnums()
+      const enums = moduleSourceFile.getEnums().sort((a,b) => a.getName().localeCompare(b.getName()))
-      const functions = moduleSourceFile.getFunctions()
+      const functions = moduleSourceFile.getFunctions().sort((a,b) => (a.getName() ?? '').localeCompare(b.getName() ?? ''))
Also applies to: 43-57, 59-70
tests/traverse/maincode-filter.test.ts (1)

142-231: Add coverage for generics, typeof, and inheritance cases

Since TypeReferenceExtractor handles TypeQuery (typeof), ExpressionWithTypeArguments, and generic type args, add a test where maincode extends a generic imported base and uses typeof SomeConst to ensure references are picked up.

I can add a concise test snippet if you want.

tests/traverse/dependency-traversal.integration.test.ts (1)

181-188: Mixed testing styles

Most tests validate code-gen strings; this one still asserts via DependencyTraversal. Consider consolidating on the public code-gen surface for consistency unless this is intentionally covering the low-level API.

tests/traverse/non-transitive-dependency.test.ts (1)

75-115: Alias-transitive case covered; consider adding duplicate-name module test

Add a case with two imported modules exporting Id to validate disambiguation picks the correct one. This will surface the current name-only matching limitation.
src/traverse/type-reference-extractor.ts (2)
27-42: Handle typeof import('...').X and avoid missing references from TypeQuery import forms.

TypeQuery can be “typeof import(‘...’).Foo”. Support ImportTypeNode to resolve to Foo’s declaration.

Example addition:
       if (Node.isTypeQuery(node)) {
         const exprName = node.getExprName()
-        if (
+        if (
           Node.isIdentifier(exprName) ||
           Node.isQualifiedName(exprName) ||
           Node.isPropertyAccessExpression(exprName)
         ) {
           // existing logic
+        } else if (Node.isImportTypeNode(exprName)) {
+          const qualifier = exprName.getQualifier()
+          if (qualifier) {
+            const typeName = qualifier.getText().split('.').pop()!
+            for (const qualifiedName of this.nodeGraph.nodes()) {
+              const nodeData = this.nodeGraph.getNode(qualifiedName)
+              if (nodeData.originalName === typeName) {
+                references.push(qualifiedName)
+                break
+              }
+            }
+          }
         }
       }
7-10: Deduplicate before returning to cut redundant work downstream.

This keeps requiredNodeIds churn low without touching all push sites.
-    return references
+    return Array.from(new Set(references))
Also applies to: 119-120
src/traverse/dependency-traversal.ts (3)
41-50: Ensure deterministic output when cycles exist — sort fallback by qualifiedName.

Array.from(nodes) can produce order tied to insertion, which may vary by traversal. Sort by qualifiedName for stability.
-    const allNodesInOrder = hasCycle(this.nodeGraph)
-      ? Array.from(this.nodeGraph.nodes())
-      : topologicalSort(this.nodeGraph)
+    const allNodesInOrder = hasCycle(this.nodeGraph)
+      ? Array.from(this.nodeGraph.nodes()).sort((a, b) =>
+          this.nodeGraph.getNode(a).qualifiedName.localeCompare(
+            this.nodeGraph.getNode(b).qualifiedName,
+          ),
+        )
+      : topologicalSort(this.nodeGraph)
21-33: Guard against stale state across multiple runs — reset graphs/sets at start.

If a single DependencyTraversal instance is reused, previous state leaks. Clear graphs/sets before new traversal.

Add this method and call it at Line 21:
private reset(): void {
  this.fileGraph = new FileGraph()
  this.nodeGraph = new NodeGraph()
  this.maincodeNodeIds.clear()
  this.requiredNodeIds.clear()
  this.importCollector = new ImportCollector(this.fileGraph, this.nodeGraph)
}
-  startTraversal(mainSourceFile: SourceFile): TraversedNode[] {
+  startTraversal(mainSourceFile: SourceFile): TraversedNode[] {
+    this.reset()
17-20: maincodeNodeIds is populated but unused here. Remove or put it to use.

If not used for filtering/visualization, drop it to reduce state surface; otherwise, consider exposing it for debugging or tests.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 44a13e5 and 3426984.

📒 Files selected for processing (12)

docs/dependency-management.md (3 hunks)
docs/maincode-filtering.md (1 hunks)
docs/overview.md (1 hunks)
src/traverse/dependency-extractor.ts (1 hunks)
src/traverse/dependency-traversal.ts (1 hunks)
src/traverse/import-collector.ts (1 hunks)
src/traverse/local-type-collector.ts (1 hunks)
src/traverse/type-reference-extractor.ts (1 hunks)
tests/traverse/dependency-ordering.test.ts (2 hunks)
tests/traverse/dependency-traversal.integration.test.ts (14 hunks)
tests/traverse/maincode-filter.test.ts (1 hunks)
tests/traverse/non-transitive-dependency.test.ts (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (9)

src/traverse/local-type-collector.ts (2)

src/traverse/node-graph.ts (1)

NodeGraph (8-38)

src/utils/generate-qualified-name.ts (1)

generateQualifiedNodeName (20-25)

src/traverse/import-collector.ts (3)

src/traverse/file-graph.ts (1)

FileGraph (13-25)

src/traverse/node-graph.ts (1)

NodeGraph (8-38)

src/utils/generate-qualified-name.ts (1)

generateQualifiedNodeName (20-25)

src/traverse/dependency-extractor.ts (2)

src/traverse/node-graph.ts (1)

NodeGraph (8-38)

src/traverse/type-reference-extractor.ts (1)

TypeReferenceExtractor (4-121)

tests/traverse/non-transitive-dependency.test.ts (1)

tests/utils.ts (3)

createSourceFile (12-14)

generateFormattedCode (26-37)

formatWithPrettier (16-24)

src/traverse/type-reference-extractor.ts (1)

src/traverse/node-graph.ts (1)

NodeGraph (8-38)

tests/traverse/maincode-filter.test.ts (1)

tests/utils.ts (3)

createSourceFile (12-14)

generateFormattedCode (26-37)

formatWithPrettier (16-24)

tests/traverse/dependency-ordering.test.ts (1)

tests/utils.ts (3)

createSourceFile (12-14)

generateFormattedCode (26-37)

formatWithPrettier (16-24)

tests/traverse/dependency-traversal.integration.test.ts (1)

tests/utils.ts (3)

createSourceFile (12-14)

generateFormattedCode (26-37)

formatWithPrettier (16-24)

src/traverse/dependency-traversal.ts (6)

src/traverse/file-graph.ts (1)

FileGraph (13-25)

src/traverse/node-graph.ts (1)

NodeGraph (8-38)

src/traverse/import-collector.ts (1)

ImportCollector (6-91)

src/traverse/types.ts (1)

TraversedNode (3-10)

src/traverse/local-type-collector.ts (1)

addLocalTypes (5-104)

src/traverse/dependency-extractor.ts (1)

extractDependencies (11-57)

🪛 LanguageTool

docs/maincode-filtering.md

[grammar] ~9-~9: There might be a mistake here.
Context: ...d in the main source file (not imported) 2. Connected dependencies: Types that are...

(QB_NEW_EN)

[grammar] ~115-~115: There might be a mistake here.
Context: ...incode nodes are included in the output.

(QB_NEW_EN)

docs/dependency-management.md

[grammar] ~32-~32: There might be a mistake here.
Context: ...ies ## Process 1. Collect local types from main file (automatically marked as requ...

(QB_NEW_EN)

[grammar] ~33-~33: There might be a mistake here.
Context: ...s import chains recursively (adds types to graph) 3. Extract type dependencies (ma...

(QB_NEW_EN)

[grammar] ~36-~36: There might be a mistake here.
Context: ...ilter to only connected dependencies 5. Build dependency graph 6. Topological sort fo...

(QB_NEW_EN)

[grammar] ~43-~43: There might be a mistake here.
Context: ...: Always included (defined in main file) - Imported types: Only included if actua...

(QB_NEW_EN)

[grammar] ~44-~44: There might be a mistake here.
Context: ...ed if actually referenced by other types - Transitive dependencies: Automatically...

(QB_NEW_EN)

🪛 markdownlint-cli2 (0.17.2)

docs/maincode-filtering.md

18-18: Emphasis used instead of a heading