Sveriges mest populära poddar

Software Engineering Radio - the podcast for professional software developers

SE Radio 703: Sahaj Garg on Low Latency AI

55 min•14 januari 2026

In this episode, Sahaj Garg, CTO of wispr.ai, joins SE Radio host Robert Blumen to talk about the challenges of building low-latency AI applications. They discuss latency's effect on consumer behavior as well as interactive applications. The conversation explores how to measure latency and how scale impacts it. Then Sahaj and Robert shift to themes around AI, including whether "AI" means LLMs or something broader, as they look at latency requirements and challenges around subtypes of AI applications. The final part of the episode explores techniques for managing latency in AI: speed vs accuracy trade-offs; speed vs cost; latency vs cost; choosing the right model; reducing quantization; distillation; and guessing + validating.

Brought to you by IEEE Computer Society and IEEE Software magazine.

Fler avsnitt av Software Engineering Radio - the podcast for professional software developers

SE Radio 724: Jure Leskovec on Relational Graph and Foundational Models

10 juni•1 tim 2 min

SE Radio 723: Dave Airlie on Linux Kernel Maintenance

3 juni•1 tim 9 min

SE Radio 722: Dwayne McDaniel on the Engineering Challenges of Secrets Management

27 maj•52 min

SE Radio 721: Rob Moffat on Risk-First Software Development

20 maj•53 min

SE Radio 720: Martin Dilger on Understanding Eventsourcing

13 maj•56 min

SE Radio 719: Birol Yildiz on Building an Agentic AI SRE

6 maj•54 min

SE Radio 718: Will Sentance on JS Modernization

29 apr.•59 min

SE Radio 717: Eric Tschetter on Decoupling Observability

23 apr.•1 tim

SE Radio 716: Martin Kleppmann Local-First Software

15 apr.•55 min

SE Radio 715: Sahaj Garg on Designing for Ambiguity in Human Input

8 apr.•48 min

Software Engineering Radio - the podcast for professional software developers med [email protected] (SE-Radio Team) finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.