We carry more compute in our hands than sits in the entire cloud, yet almost all the money flows to data centers. For five years we've built on-device first at Detail and Subwave, shipping local rendering, captions, and enhancement while keeping video on the device. Through NP-Hard we backed Mirai, pushing LLM inference to 1,000 tokens per second, and at Detail we use Argmax's on-device speech models. The building blocks exist, but there's still a wide gap between what developers want to build and what's actually available to them.
Every developer I spoke to at WWDC had the same story: a wish list of AI features they'd build if inference cost weren't a factor, and expensive cloud tokens they'd happily swap for a local model running on the device already in their user's pocket. The demand is real. The SDKs aren't there yet.
That's why we're launching Desert Ant Labs, a European on-device AI lab focused on shipping dozens of small, opinionated audio and visual models that drop into any product with a few lines of code. No inference cost, nothing leaving the device, running on the 6 billion devices people already own. When cost drops to zero, you stop trading capability against budget on every feature decision. The first models already power Detail and Subwave, and a dozen more will be available to third-party developers before the end of the year.
Published on Subwave
https://subwave.app/@paul/post/little-brains-in-every-product
Fler avsnitt av Paul Veugen
Visa alla avsnitt av Paul VeugenPaul Veugen med Paul Veugen finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.
