A self-grading hypothesis engine

Abstract Essence evaluates product opportunities the way it would evaluate any other claim: adversarial multi-model debate, structured filters, and a public verdict for every candidate. This dashboard is the live record.

What this is

What is it?

An autonomous engine that proposes, argues, evidences, and scores product hypotheses. Multiple frontier LLMs debate each candidate. A five-axis filter scores each one across three independent runs. Survivors graduate; failures are killed with documented reasoning.

Why this approach?

Anyone can generate convincing-looking product ideas. The harder question is which ones survive structured scrutiny. The engine answers that question on the record, with the reasoning visible.

What breaks?

The engine grades its own filter coverage. New failure modes (rubric blindspots, structural mismatches, distribution-shape mistakes) are surfaced via Commander overrides and patched into the next prompt revision. Every override is logged.

What we have learned so far

Buyer-side products consistently outperform seller-side ones when the seller monetises conviction. Structural fit (workflow shape, build complexity) matters more than scoring fit. Filter scores above the graduation bar are necessary but not sufficient — Commander review still kills a meaningful share of graduated candidates.

Featured candidate

Slot paused — no candidate committed

revisit: 2026-04-30

The shortlist the engine had produced (3656a0, 721be7, 24a849, bca2ed) is all seller-side verification — products for research publishers, proposal writers, and AI agencies who monetize conviction. 55+ pattern-sweeps over the prior 7 days had been flagging this as a structural lock-in; the fix shipped in S121 (evaluator-side primacy rule + seed archetypes) redirects genesis toward procurement, vendor-management, implementation, onboarding, recruiting, and PMO buyers. Committing a pre-fix candidate would ratify the rut. The pause runs through 2026-04-30, by which point ~40 evaluator-side genesis runs will have fed a fresh shortlist into the full filter+council+dossier process. Payday 2026-04-27 also unblocks domain renewal and outreach infra. Revisit after both conditions clear.

Engine state

total hypotheses

graduated

in flight

killed / exhausted

1,539

moves logged

$267

engine spend lifetime

See all hypotheses →