Sveriges mest populära poddar

Data Brew by Databricks

Utbildning Teknologi

SWE-bench & SWE-agent | Data Brew | Episode 44

36 min•17 april 2025

In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhancing AI in software engineering.

Highlights include:
- SWE-bench: A benchmark for assessing AI models on real-world coding tasks.
- Addressing data leakage concerns in GitHub-sourced benchmarks.
- SWE-agent: An AI-driven system for navigating and solving coding challenges.
- Overcoming agent limitations, such as getting stuck in loops.
- The future of AI-powered code reviews and automation in software engineering.

Fler avsnitt av Data Brew by Databricks

Reinforcement Fine-Tuning and the Future of Specialized AI Models

5 aug. 2025•40 min

Benchmarking Domain Intelligence | Data Brew | Episode 45

24 apr. 2025•32 min

Enterprise AI: Research to Product | Data Brew | Episode 43

10 apr. 2025•38 min

Multimodal AI | Data Brew | Episode 42

7 apr. 2025•42 min

Age of Agents | Data Brew | Episode 41

27 mars 2025•41 min

Reward Models | Data Brew | Episode 40

20 mars 2025•40 min

Retrieval, rerankers, and RAG tips and tricks | Data Brew | Episode 39

20 feb. 2025•45 min

The Power of Synthetic Data | Data Brew | Episode 38

4 feb. 2025•42 min

Secret to Production AI: Tools & Infrastructure | Data Brew | Episode 37

22 jan. 2025•37 min

Mixture of Memory Experts (MoME) | Data Brew | Episode 36

10 jan. 2025•41 min

Data Brew by Databricks med Databricks finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.