← all hypotheses

Forecast QA for Investment Committee Memos

ranked [A] filter 8.5/15 spread ±0.5 signals: 3 independent
What is this?
A pre-circulation quality gate for investment committee memos, market theses, and internal allocation notes at small hedge funds, family offices, and specialist asset managers. Instead of helping pundits publish cleaner narratives, it pressure-tests internal claims before capital is committed. AE flags severed premises, concession laundering, cosmetic confidence, temporal blindness, and missing falsifiers, then rewrites the memo into an auditable forecast structure: core claim, assumptions, disconfirming evidence, time horizon, observable resolution criteria, and calibrated confidence. After circulation, the same claims are tracked against public outcomes and internal review dates, creating a reality-graded archive of which analysts and thesis types actually held up. This uses AE's real strengths: adversarial multi-model debate, code-enforced grading loops, structured constraints, and sub-24h feedback tied to objective outcomes rather than style. The buyer incentive is different from publishers: better IC discipline, fewer thesis drift errors, cleaner post-mortems, and a durable internal edge in decision quality rather than public narrative management.
Why did we consider it?
Forecast QA for IC memos is compelling because it turns narrative investment writing into auditable, reality-scored forecasts, directly improving capital-allocation discipline in a way generic AI research tools do not.
What breaks?
  • LLM Memorization Bias: Research (Lopez-Lira et al., 2025) shows LLMs memorize economic data, rendering your 'reality-graded' multi-model debates dangerously susceptible to look-ahead bias and contamination.
  • Cultural Rejection & Analyst Mutiny: Discretionary investors (e.g., Howard Marks) reject rigid forecasting, and analysts will actively sabotage or game an AI system designed to audit their bonus-linked performance.
  • GTM/Founder Mismatch: Selling deeply integrated, highly confidential IC workflow software to paranoid financial institutions requires intense, relationship-led enterprise sales, impossible for a part-time, solo founder.
Fatal objection: This dies because the buyer must adopt a surveillance-grade accountability system for proprietary investment judgment, and the market evidence points to demand for memo convenience, not institutionalized self-indictment.
What did we learn?
Still in evaluation (phase: ranked). No verdict yet.

Filter scores

Five axes, each scored 0-3. Three independent runs by different model perspectives. Median shown.

AxisWhat it measures
data moatDoes this product accumulate proprietary data that compounds?
10x model testDoes a better model make this more valuable, or redundant?
fast feedback loopsCan outputs be graded against reality in <30 days?
solo founder feasibleCan a solo operator build and run this without a team?
AI providers cant eat itDo hyperscalers have structural reasons NOT to build this?
Composite median: 8.5 / 15. Graduation threshold: 9.0. IQR across runs: 0.5.

Evidence

Signal A — Primary source

the deal team will present its final Investment Committee memo with the results of final diligence, structure and terms with a final, formal recommendation to the Investment Committee for the investment decision.

Signal B — Competitor with documented gap

Automates extraction of key details, risk profile comparison, and summarization for PE/VC/CRE memos but lacks adversarial flagging of premises/concessions/temporal biases/missing falsifiers, forecast restructuring with resolution criteria/confidence calibration, or outcome tracking/post-mortems.

Signal D — Demand proxy

{"found":true,"summary":"Forum discussions on Reddit and HN highlight manual processes for investment thesis tracking, post-mortems, and memo reviews in hedge funds/family offices, with interest in AI tools for research/memo generation but no standard automated QA; Addepar whitepaper notes firms conduct post-mortems and audits of investment theses vs. performance.","sources":["https://longterminvesting.stanford.edu/sites/g/files/sbiybj23856/files/media/file/addepar-investment-memos-and-decision-making.pdf","https://www.reddit.com/r/ValueInvesting/comments/1iydy3r/how_do_you_manage_your_investm…

Evaluation history

WhenStagePhase
2026-04-22 21:20evidence_searchranked
2026-04-22 19:40evidence_searchranked
2026-04-22 18:50evidence_searchranked
2026-04-22 18:00evidence_searchranked
2026-04-22 17:10evidence_searchranked
2026-04-22 16:20evidence_searchranked
2026-04-22 15:30evidence_searchranked
2026-04-22 14:40evidence_searchranked
2026-04-22 13:50evidence_searchranked
2026-04-22 13:00evidence_searchranked
2026-04-22 12:10evidence_searchranked
2026-04-22 11:20evidence_searchranked
2026-04-22 10:30evidence_searchranked
2026-04-22 09:40evidence_searchranked
2026-04-22 08:50evidence_searchranked
2026-04-22 08:00evidence_searchranked
2026-04-22 07:10evidence_searchranked
2026-04-22 06:20evidence_searchranked
2026-04-22 05:30evidence_searchranked
2026-04-22 04:40evidence_searchranked
2026-04-22 03:50evidence_searchranked
2026-04-22 03:00evidence_searchranked
2026-04-22 02:10evidence_searchranked
2026-04-22 01:30evidence_searchranked
2026-04-22 00:50evidence_searchranked
2026-04-22 00:20evidence_searchranked
2026-04-21 23:50evidence_searchranked
2026-04-21 23:20evidence_searchranked
2026-04-21 22:50evidence_searchranked
2026-04-21 07:30evidence_searchranked
2026-04-21 07:00evidence_searchranked
2026-04-19 23:50fatal_objectionranked
2026-04-19 23:40fatal_objectionranked
2026-04-19 18:20filter_scorescored
2026-04-19 18:10filter_scorescored
2026-04-19 18:00filter_scorescored
2026-04-19 17:50evidence_searchargument
2026-04-19 17:40audience_simulationargument
2026-04-19 17:30red_team_killargument
2026-04-19 17:20steelmanargument
2026-04-19 17:10genesisargument