← all hypotheses

Search Firm Forecast-Accuracy Scorecard

exhausted [TRIANGULATED] signals: 1 independent
What is this?
A scorecard for heads of talent at UK/US founder-led SaaS (50-200 staff) using 3+ external search firms. Per candidate handed over, the buyer ticks 3-5 structured forecast claims the firm implicitly or explicitly made: 'will accept offer at quoted band', 'clean reference + background check', 'will start on agreed date', 'will pass statutory probation'. Each claim resolves against discrete native events already in the ATS (Greenhouse/Lever/Ashby) and HRIS — offer-acceptance timestamp, day-1 attendance, background-check outcome, UK 3-month probation decision. No forensic on-job audits, no retention proxies. AE's adversarial debate tags failures into the autopsy taxonomy (Cosmetic Confidence on band, Temporal Blindness on start dates, Premise-Conclusion Severing on references) and the constraint-lifecycle (probation/promoted/killed) drives renewal decisions. Buyer remains head of talent; resolution stays 2-12 weeks; ground truth stays binary and native.
Why did we consider it?
AE's scoring + autopsy stack drops cleanly onto search-firm hand-offs, where binary ATS-native events give heads of talent the fast, defensible accountability loop they currently lack — a solo-UK-founder-shaped wedge into a £100-300K ARR business.
What breaks?
  • Severe data starvation: 50-200 headcount SaaS yields too few search-firm placements to power the AE's taxonomy.
  • Confounded attribution: Search firms will reject probation/start-date metrics as client-side failures, mirroring the RAND scorecard critique.
  • Go-to-market friction: ATS/HRIS integrations require enterprise security reviews, hostile to a solo evening/weekend founder.
What did we learn?
Killed: evidence_search_exhausted.

Evidence

Signal D — Demand proxy

{"found":true,"summary":"A LinkedIn article proposes a 'New Recruiter Scorecard' that explicitly includes 'hiring forecast accuracy' as a measurement axis, showing the recruitment industry is beginning to frame recruiter evaluation in prediction-accuracy terms. A separate LinkedIn post critiques single-metric vendor scorecards in procurement, illustrating the analogous pattern of scoring supplier predictions in adjacent domains.","sources":["https://www.linkedin.com/pulse/new-recruiter-scorecard-measuring-success-ai-enabled-era-anderson-xj6we","https://www.linkedin.com/posts/ferris-cheng-6a84a…

Evaluation history

WhenStagePhase
2026-05-09 21:42evidence_searchevidence_hunt
2026-05-09 21:36evidence_searchevidence_hunt
2026-05-09 21:30evidence_searchevidence_hunt
2026-05-09 21:25evidence_searchargument
2026-05-09 21:06evidence_searchargument
2026-05-09 20:48evidence_searchargument
2026-05-09 20:19evidence_searchargument
2026-05-09 10:00evidence_searchargument
2026-05-09 09:54audience_simulationargument
2026-05-09 09:48red_team_killargument
2026-05-09 09:42steelmanargument
2026-05-09 09:39genesisargument