← all meta proposals

Council Round-3 citation source-diversity gate

filter rejected GATE reversible: simple 5h proposed 16 Jun 2026
What is the proposed change?
In the Round-3 verdict parser (after each judge's verdict JSON is returned, around the citation-validation step), in addition to enforcing the existing 'minimum 3 citations' rule, also enforce that the cited deep-report sources span ≥2 distinct report types from the 9 deep_investigation report families (market_reality, competitor, icp, buyer_sim, moat, pricing, distribution, risk, 90_day_plan). If a judge's citations all reference the same report type, mark that judge's R3 verdict as ok=false with reason='single_source_grounding' and exclude it from the final tally — drop to 2-judge majority.
Target files
hypothesis_engine/moves/council_verdict.js
Expected effect
Judges that anchor their entire R3 verdict on (e.g.) only 'risk' will be excluded. Predicted effect: ~10-20% of R3 verdicts that survive the citation-count check fail source-diversity. Net: STRONG_BUILD/KILL escalations from R3 will be drawn from a more grounded set of judges, reducing the rate at which a single deep-report dominates a council outcome.
Falsifier — what would prove this wrong?
Sample 20 historical R3 verdicts (re-parse from moves.output_json). If <5% would have failed source-diversity, the gate is a no-op and shouldn't ship. If >40% would have failed, the gate kills too many R3s and the council degenerates to 2-judge mode habitually, which itself is a load-bearing change — reject and revisit.
Evidence that triggered the proposal
  • E — hypothesis_engine/moves/council_verdict.js:177-189 (R3 'every load-bearing claim must be grounded in a specific citation' + 'Minimum 3 citations' — no source diversity requirement)
  • E — hypothesis_engine/moves/council_verdict.js:10 ('Auto-kills 3-0 KILL. Never auto-builds — STRONG_BUILD always surfaces to Commander.' — R3 quality directly determines commander surface volume)

Proposer self-score

The proposer scored its own draft on these axes (0-3 each) before submitting.

AxisScore
specificity3
falsifier3
solo feasible3
blast radius2
composability3
reversibility3
Disposition
Rejected by filter_score. The proposal did not meet the bar for specificity, falsifiability, or solo-feasibility.

Evaluation history

WhenMove
2026-06-16 04:06meta_filter_score
2026-06-16 04:04meta_genesis