Max Mayfield max
  • Joined on 2026-02-10
max pushed to master at max/dev-intel-v2 2026-03-11 07:40:41 -07:00
b8403be96c feat: repo-agnostic refactor (BMad spec-test-build loop)
max pushed to master at max/dev-intel-v2 2026-03-10 12:01:22 -07:00
15fb1a753b Add deep extractors, reference pages, keyword index; eval 53.3%
max pushed to master at max/dev-intel-v2 2026-03-10 07:20:48 -07:00
0265ec7a60 feat: confluence benchmark, pattern extractor, agent KB, UX spec
max pushed to master at max/dev-intel-v2 2026-03-09 17:46:38 -07:00
049609a358 Phase 9d: Human eval score improvement\n\n- Human readability score increased from 63.9% to 78.6%\n- Structural table additions and quick lookup index resolved navigation bottlenecks\n- NOT_FOUND rate dropped from 17.9% to 3.6%
max pushed to master at max/dev-intel-v2 2026-03-09 17:40:39 -07:00
ca11b4459a Agent eval hits 93.4% — target exceeded
max pushed to master at max/dev-intel-v2 2026-03-09 16:55:57 -07:00
304f0a9e9f Phase 9c: Split eval into Agent (file-browsing) and Human (readability) tracks
max pushed to master at max/dev-intel-v2 2026-03-09 16:40:09 -07:00
0cc4abcb0f Phase 9b: structural documentation improvements\n\n- sysdoc.js: Added Summary Statistics, Top Charts, and K8s Resource Types to architecture doc\n- Addresses ratchet failures where system-wide rollups were missing from generated prose\n- Eval v2 shows minor improvement, though RAG context window still limits wide scatter-gather queries
max pushed to master at max/dev-intel-v2 2026-03-09 15:32:42 -07:00
b99341e8bc Phase 9: Doc Evaluation Harness\n\n- eval-questions.js: Generates ground-truth questions from raw source data\n- eval.js: LLM-as-judge scoring harness (answers from docs, scores against truth)\n- Generated 33 questions covering config, dependencies, resources, and interactions\n- Baseline score: 66.7% (configuration 93%, dependencies 77%, structural 31%)
max pushed to master at max/dev-intel-v2 2026-03-09 13:15:51 -07:00
d9fa087e22 Phase 6+7: LLM prose generation pass over Foxtrot docs\n\n- Ran Claude Haiku to generate prose for architecture, subsystems, flows, and 124 Helm contracts\n- Fixed describeContract prompt in prose.js to correctly identify and describe Helm contract types without hallucinating\n- 80 files generated with rich architectural summaries
max pushed to master at max/dev-intel-v2 2026-03-09 13:05:54 -07:00
4f7c77b3b1 Phase 8b: Helm contract extraction + diagram support
max pushed to master at max/dev-intel-v2 2026-03-09 13:03:15 -07:00
f49a6c2dd9 Phase 8: Helm chart extraction with Go template support
max pushed to master at max/dev-intel-v2 2026-03-09 11:44:20 -07:00
d19cee36d7 Phase 6+7D: Sonnet prose generation integration
max pushed to master at max/dev-intel-v2 2026-03-09 11:35:15 -07:00
1869fcb5b2 7B: Add parse error tracking (BMad review fix)
max pushed to master at max/dev-intel-v2 2026-03-09 11:19:15 -07:00
ca02fe131b Phase 7F: Supergraph Multi-Repo Merge
max pushed to master at max/dev-intel-v2 2026-03-09 07:42:17 -07:00
d9fd7e3284 Phase 7B, 7E, 7D: Contracts, Diagrams, Sysdoc
max pushed to master at max/dev-intel-v2 2026-03-08 23:51:34 -07:00
4c212740a2 Phase 7A+7C: Subsystem aggregator + Flow tracer (post-review fixes)
max pushed to master at max/dev-intel-v2 2026-03-08 23:20:57 -07:00
4221ab4d76 Phase 6: LLM doc generation + Phase 7 system-docs spec
max pushed to master at max/dev-intel-v2 2026-03-08 22:30:22 -07:00
7d5b6cbc32 Add README with benchmarks and V1 vs V2 comparison
max pushed to master at max/dev-intel-v2 2026-03-08 22:29:53 -07:00
efb12d003b Dev Intel Pipeline v2 — multi-language semantic graph extractor
max created branch master in max/dev-intel-v2 2026-03-08 22:29:53 -07:00