Files

Jarvis Prime b8403be96c feat: repo-agnostic refactor (BMad spec-test-build loop)

- NEW: repo-profiler.js — deterministic archetype detection (Infra, Frontend, Backend, etc.)
- NEW: extract-dynamic.js — generic extractor replacing hardcoded Foxtrot patterns
- NEW: eval-generator.js — dynamic ground-truth question generation from any repo graph
- NEW: specs/bmad-agnostic-refactor-spec.md — full BMad spec with acceptance criteria
- REFACTORED: prose.js — two-pass LLM synthesis with rich context (shared secrets, ports, service refs)
- REFACTORED: sysdoc.js — wired repo-profiler + extract-dynamic, --legacy escape hatch
- REFACTORED: wiggum-v2.sh — uses eval-generator before benchmarks
- FIXED: graph.js — _edgeSet rebuilt on loadSnapshot() (edge dedup was broken)
- FIXED: graph.js — recursive sortKeys() for deep equality in diffing
- FIXED: prose.js — robust JSON array extraction from LLM output
- FIXED: ratchet.js — syntax validation (node --check) before saving LLM mutations
- FIXED: extract-dynamic.js — centralized state services regex, added console.warn for silent failures
- TESTS: test-eval-generator, test-repo-profiler, test-synthesis-quality + mock fixtures

Eval: 81.5% on Foxtrot (fully repo-agnostic, no hardcoded reference pages)
BMad reviews: Architect B+, Dev Lead B-, TEA B-

2026-03-11 14:40:31 +00:00

2.0 KiB

Raw Permalink Blame History

Implementation Plan: Repo-Agnostic Synthesis

1. Data Collection & Formatting

We already have the data in sysdoc.js:

deepData: Output of extract-deep.js (addons, tfConfigs, scriptParams, helmValues, stateServices).
helmGraph: The Helm dependencies and charts.
patterns: System patterns, layers, sync waves.
subs: The extracted subsystems.
Action: In sysdoc.js, format this raw data into a large stringified JSON or Markdown list to serve as context for the synthesis LLM.

2. LLM Synthesis Module (`prose.js`)

Create a new exported function: synthesizeReferencePages(extractedContext, outDir, llmOpts). This function will make parallel or sequential LLM calls to generate specific reference topics based on the extracted context.

Prompts:

network-architecture.md: Focus on CIDR allocations, VPCs, network routing, NAT, bastions found in the tfConfigs and helmValues.
operations.md: Focus on CI/CD pipelines, Jenkins jobs, branch mappings, timeout parameters, and deployment flows found in scriptParams and repo patterns.
configuration.md: Focus on config merge orders, region code logic, identifiers, naming conventions, and default values found in helmValues and tfConfigs.
dependencies.md: Focus on vertical layer dependencies, Helm chart consumption (e.g., runtime consuming common), and PR cross-repo dependencies.
index.md: An LLM call that takes summaries of the 4 generated pages and produces a keyword-rich routing table.

3. Pipeline Update (`sysdoc.js`)

At the end of the generateDocs function, just before writing the final files or after writing the basic ones, check if opts.prose is true. If so, call await proseMod.synthesizeReferencePages(extractedContext, referenceDir, llmOpts).

4. Cleanup

rm generate-reference-pages.js
Edit wiggum-v2.sh to remove the call to generate-reference-pages.js.

5. Execution

Run wiggum-v2.sh to generate the docs dynamically, then trigger the agent evaluation. The score should remain high without us cheating.

2.0 KiB Raw Permalink Blame History