Sveriges mest populära poddar
Latent Space: The AI Engineer Podcast

How AI is eating Finance — with Mike Conover of Brightwave

55 min11 juni 2024

In April 2023 we released an episode named “Mapping the future of *truly* open source models” to talk about Dolly, the first open, commercial LLM.

Mike was leading the OSS models team at Databricks at the time. Today, Mike is back on the podcast to give us the “one year later” update on the evolution of large language models and how he’s been using them to build Brightwave, an an AI research assistant for investment professionals.

Today they are announcing a $6M seed round (led by Alessio and Decibel!), and sharing some of the learnings from serving customers with >$120B of assets under management in production in the last 4 months since launch.

Losing faith in long context windows

In our recent “Llama3 1M context window” episode we talked about the amazing progress we have done in context window size, but it’s good to remember that Dolly’s original context size was 1,024 tokens, and this was only 14 months ago.

But while understanding length has increased, models are still not able to generate very long answers. His empirical intuition (which matches ours while building smol-podcaster) is that most commercial LLMs, as well as Llama, tend to generate responses

Fler avsnitt av Latent Space: The AI Engineer Podcast

Visa alla avsnitt av Latent Space: The AI Engineer Podcast

Latent Space: The AI Engineer Podcast med Latent.Space finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.