Probably Secures $9M from a16z to Build Hallucination-prevention Infrastructure for High-stakes AI Applications

1 U.S.A dollar banknotes

Probably, an AI reliability startup, has raised $9 million in seed funding from Andreessen Horowitz to tackle one of the most persistent problems in large language models: hallucinations and factual errors that escape detection before reaching end users.

Founded by Peter Elias, the company is targeting the kind of 99.99% accuracy common in deterministic software systems but rarely achieved with AI. Its first product is a data science tool that generates answers from complex datasets, each accompanied by a citation and full audit trail. The core innovation is what Elias describes as a validator harness: the LLM’s initial outputs are checked against a deterministic system that rejects any result inconsistent with the underlying dataset, and the model has been trained in conjunction with that validator to optimise for speed and accuracy simultaneously.

The approach has a notable commercial advantage. Because the harness reduces ambiguity so precisely, the system can run on AI models significantly smaller than frontier equivalents, specifically models four capability classes below the leading offerings, allowing deployment on local hardware rather than data centre infrastructure and dramatically cutting token costs.

Credit: Probably

Elias argued that the major AI laboratories have little incentive to solve hallucinations at this level, since their revenue scales with the number of corrections and retries a model requires. Probably’s architecture inverts that logic, and Elias said the same engine could extend beyond data science into accounting, medical services, and any other precision-sensitive domain.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

Stacks of hundred dollar bills are shown.
Baseten Nears $1.5B Raise at $13B Valuation as AI Inference Gold Rush Accelerates

Baseten, an AI inference startup, is close to finalising a $1.5 billion funding round at a $13 billion valuation, according to the Wall Street Journal,

OpenAI Launches Patch the Planet Initiative to Deploy AI-powered Cybersecurity Support Across Open Source Projects

OpenAI has announced a new cybersecurity initiative called Patch the Planet, partnering with security firm Trail of Bitsto help open source software maintainers identify and

Nvidia Claims Its New Cooling System Eliminates Data Centre Water Use, But the Full Picture is More Complicated

Nvidia has unveiled a warm-water cooling system it says can eliminate virtually all water consumption inside AI data centres, with chief sustainability officer Josh Parker

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

$ 0 M

Seed round tracked

Gitar — Code Validation

Get the Weekly Briefing

Funding analysis, market intelligence, and industry trends delivered to your inbox every week.

Need bespoke intelligence?

Our team combines real-time data with decades of sector experience to guide your decisions.

Subscribe today for the latest news about the AI landscape