Go to file

Jarvis Prime 0265ec7a60 feat: confluence benchmark, pattern extractor, agent KB, UX spec

- extract-patterns.js: mines layered arch, ArgoCD appsets, cloud regions,
  CIDR allocations, naming conventions, sync waves, tech stack from code
- agent-kb.js: token-efficient JSON rendering of same doc tree
- eval-confluence-ref-questions.json: 32 reference-only benchmark questions
- wiggum-v2.sh: Ralph Wiggum loop targeting confluence baseline (77.8%)
- docs/human-ux-spec.md: BMad UX designer spec for human doc structure
- Eval results: V2 at 28.7% vs confluence 77.8% baseline
- Hub/spoke ownership now correctly extracted (95% on that question)
- Naming conventions, regions, CIDRs surfaced in system-architecture.md

2026-03-10 14:20:35 +00:00

docs

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

foxtrot-docs-v3

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

output

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

specs

Phase 8: Helm chart extraction with Go template support

2026-03-09 20:03:04 +00:00

test

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

.gitignore

Phase 8: Helm chart extraction with Go template support

2026-03-09 20:03:04 +00:00

agent-kb.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

benchmark-v1-v2.js

Phase 8: Helm chart extraction with Go template support

2026-03-09 20:03:04 +00:00

contracts.js

Phase 8b: Helm contract extraction + diagram support

2026-03-09 20:05:52 +00:00

diagrams.js

Phase 8b: Helm contract extraction + diagram support

2026-03-09 20:05:52 +00:00

doc-demo.js

Phase 8: Helm chart extraction with Go template support

2026-03-09 20:03:04 +00:00

docgen.js

Phase 6: LLM doc generation + Phase 7 system-docs spec

2026-03-09 06:20:54 +00:00

eval-agent-report-v2.json

Agent eval hits 93.4% — target exceeded

2026-03-10 00:40:38 +00:00

eval-agent-report-v3-oss.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-agent-report-v3.json

Agent eval hits 93.4% — target exceeded

2026-03-10 00:40:38 +00:00

eval-agent-report-v4.json

Agent eval hits 93.4% — target exceeded

2026-03-10 00:40:38 +00:00

eval-agent-report-v5.json

Agent eval hits 93.4% — target exceeded

2026-03-10 00:40:38 +00:00

eval-agent-report-v6.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-agent-report.json

Phase 9c: Split eval into Agent (file-browsing) and Human (readability) tracks

2026-03-09 23:55:54 +00:00

eval-agent.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-confluence-baseline.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-confluence-questions.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-confluence-ref-questions.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-human-report-v2.json

Phase 9d: Human eval score improvement\n\n- Human readability score increased from 63.9% to 78.6%\n- Structural table additions and quick lookup index resolved navigation bottlenecks\n- NOT_FOUND rate dropped from 17.9% to 3.6%

2026-03-10 00:46:37 +00:00

eval-human-report-v3.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-human-report.json

Phase 9c: Split eval into Agent (file-browsing) and Human (readability) tracks

2026-03-09 23:55:54 +00:00

eval-human.js

Phase 9c: Split eval into Agent (file-browsing) and Human (readability) tracks

2026-03-09 23:55:54 +00:00

eval-questions-iter1.json

Phase 9b: structural documentation improvements\n\n- sysdoc.js: Added Summary Statistics, Top Charts, and K8s Resource Types to architecture doc\n- Addresses ratchet failures where system-wide rollups were missing from generated prose\n- Eval v2 shows minor improvement, though RAG context window still limits wide scatter-gather queries

2026-03-09 23:40:07 +00:00

eval-questions-v2.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-questions.js

Agent eval hits 93.4% — target exceeded

2026-03-10 00:40:38 +00:00

eval-questions.json

Agent eval hits 93.4% — target exceeded

2026-03-10 00:40:38 +00:00

eval-report-v2.json

2026-03-09 23:40:07 +00:00

eval-report.json

Phase 9: Doc Evaluation Harness\n\n- eval-questions.js: Generates ground-truth questions from raw source data\n- eval.js: LLM-as-judge scoring harness (answers from docs, scores against truth)\n- Generated 33 questions covering config, dependencies, resources, and interactions\n- Baseline score: 66.7% (configuration 93%, dependencies 77%, structural 31%)

2026-03-09 22:32:41 +00:00

eval-v2-baseline.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval-wiggum-v2-iter-1.json

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

eval.js

2026-03-09 22:32:41 +00:00

extract-config.js

Dev Intel Pipeline v2 — multi-language semantic graph extractor

2026-03-09 05:29:29 +00:00

extract-helm.js

Phase 8: Helm chart extraction with Go template support

2026-03-09 20:03:04 +00:00

extract-patterns.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

extract-terraform.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

extract.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

fix-sysdoc-waves.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

flow.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

graph.js

Dev Intel Pipeline v2 — multi-language semantic graph extractor

2026-03-09 05:29:29 +00:00

impact.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

namespace.js

Dev Intel Pipeline v2 — multi-language semantic graph extractor

2026-03-09 05:29:29 +00:00

package-lock.json

Phase 8: Helm chart extraction with Go template support

2026-03-09 20:03:04 +00:00

package.json

Dev Intel Pipeline v2 — multi-language semantic graph extractor

2026-03-09 05:29:29 +00:00

patch-sysdoc-helm.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

patch-sysdoc.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

pipeline-v3.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

pipeline.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

promptfoo.yaml

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

prose.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

ratchet-history.json

2026-03-09 23:40:07 +00:00

ratchet.js

2026-03-09 23:40:07 +00:00

README.md

Add README with benchmarks and V1 vs V2 comparison

2026-03-09 05:30:21 +00:00

semantic-diff.js

Dev Intel Pipeline v2 — multi-language semantic graph extractor

2026-03-09 05:29:29 +00:00

subsystem.js

Phase 7A+7C: Subsystem aggregator + Flow tracer (post-review fixes)

2026-03-09 06:51:32 +00:00

supergraph.js

Phase 7F: Supergraph Multi-Repo Merge

2026-03-09 18:19:14 +00:00

sysdoc.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

validate-ground-truth.js

Dev Intel Pipeline v2 — multi-language semantic graph extractor

2026-03-09 05:29:29 +00:00

wiggum-fix.js

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

wiggum-v2-ref.log

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

wiggum-v2.log

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

wiggum-v2.sh

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

wiggum.sh

feat: confluence benchmark, pattern extractor, agent KB, UX spec

2026-03-10 14:20:35 +00:00

README.md

Developer Intelligence Pipeline v2

Multi-language semantic graph extractor that builds a knowledge graph from source code. Produces function-level call graphs, cross-file dependency maps, and semantic diffs — all without LLM calls.

Quick Start

npm install
node pipeline.js batch /path/to/repo --output /tmp/output

What It Does

Parses source code into a directed graph of entities (modules, functions, classes, configs) and relationships (CALLS, IMPORTS, CONTAINS, IMPLEMENTS). Then diffs snapshots to detect breaking changes, compute impact scores, and identify affected callers.

Supported Languages

Language	Parser	Entities
TypeScript/JavaScript	tree-sitter	Modules, Functions, Classes, Imports
Python	tree-sitter	Modules, Functions, Classes (with `_`/`__` visibility)
Go	tree-sitter	Modules, Functions, Structs, Receiver Methods
Java	tree-sitter	Modules, Functions, Classes, Interfaces
Bash	tree-sitter	Modules, Functions, `source` imports, Commands
YAML	js-yaml	Config keys (K8s manifests, Helm, KCL)
Terraform/HCL	regex	Resources, Data, Modules, Providers

Pipeline Phases

Phase 1: Entity Extraction (`extract.js`)

node extract.js /path/to/file.ts /repo/root

Outputs JSON with entities and relationships.

Phase 2: Graph Store (`graph.js`)

node graph.js build /dir/of/jsons snapshot.json
node graph.js query snapshot.json "cli/route.ts:tryRouteCli"
node graph.js diff old.json new.json

Phase 3: Namespace Registry (`namespace.js`)

node namespace.js build snap-a.json snap-b.json --output registry.json
node namespace.js resolve graph.json registry.json
node namespace.js lookup registry.json functionName

3-tier cross-repo resolution: exact ID → normalized path → name-only.

Phase 4: Semantic Diff (`semantic-diff.js`)

node semantic-diff.js diff old.json new.json
node semantic-diff.js score old.json new.json

Categorizes changes as breaking/significant/internal/cosmetic. Impact score 0-100.

Phase 5: Pipeline (`pipeline.js`)

node pipeline.js batch /repo --output /tmp/out     # Full extraction
node pipeline.js benchmark /repo --samples 20       # Performance test
node pipeline.js run /repo --snapshot prev.json     # Incremental diff

Benchmark (OpenClaw repo)

Metric	Value
Files	4,325
Extracted	4,259 (98.5%)
Nodes	21,646
Edges	133,979
Time	67 seconds
Avg/file	15ms

V1 vs V2

Metric	V1 POC	V2 Pipeline
Parse time	~2s	552ms
Total time	15-20 min (LLM)	552ms
Entities	files + imports	457 (4 types)
CALLS edges	0	1,290
Cross-file calls	No	51 resolved
Languages	Go only	8
Semantic diff	No	Yes
Impact scoring	No	Yes
Cost	~$0 (Ollama)	$0

Tested on labstack/echo (44 Go files)

Testing

bash test/run-all.sh          # 9/9 ground truth benchmark
node test/test-graph.js       # 25/25 graph store tests

Architecture

source files → extract.js → JSON → graph.js → snapshot.json
                                                    ↓
                                          semantic-diff.js → impact report
                                                    ↓
                                          namespace.js → cross-repo links

Zero external runtime dependencies beyond tree-sitter grammars.

License

MIT

README.md

Developer Intelligence Pipeline v2

Quick Start

What It Does

Supported Languages

Pipeline Phases

Phase 1: Entity Extraction (extract.js)

Phase 2: Graph Store (graph.js)

Phase 3: Namespace Registry (namespace.js)

Phase 4: Semantic Diff (semantic-diff.js)

Phase 5: Pipeline (pipeline.js)

Benchmark (OpenClaw repo)

V1 vs V2

Testing

Architecture

License

Phase 1: Entity Extraction (`extract.js`)

Phase 2: Graph Store (`graph.js`)

Phase 3: Namespace Registry (`namespace.js`)

Phase 4: Semantic Diff (`semantic-diff.js`)

Phase 5: Pipeline (`pipeline.js`)