Sveriges mest populära poddar
The BugBash Podcast

Hypothesis vs. Hallucinations: Property Testing AI-Generated Code

1 tim 19 min10 december 2025

Large Language Models can generate code in a flash, but that code is notoriously unreliable. Traditional unit tests often can’t put enough guardrails in place to ensure correctness… even if they’re written by the LLM itself.

This is where property-based testing (PBT) becomes essential.

Today, we're joined by David R. MacIver, creator of the PBT library Hypothesis, and now an Antithesis employee! We discuss how to build robust feedback loops that are needed to make AI-generated code trustworthy.

We'll cover why standard AI coding benchmarks are flawed, how Hypothesis makes PBT approachable, and the challenge of getting developers to think in "invariants." David also shares his perspective on the future of AI in software engineering.

If you want to build a reliability backstop for your code, vibed or otherwise, stick around.

The BugBash Podcast med Antithesis finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.