Sveriges mest populära poddar

Generally Intelligent

Giancarlo Kerg, Mila: Approaching deep learning from mathematical foundations

1 tim 9 min•27 mars 2021

Giancarlo Kerg (Google Scholar) is a PhD student at Mila, supervised by Yoshua Bengio and Guillaume Lajoie. He is working on out-of-distribution generalization and modularity in memory-augmented neural networks.

Highlights from our conversation:

🧮 Pure math foundations as an approach to progress and structural understanding in deep learning research

🧠 How a formal proof on the way self-attention mitigates gradient vanishing when capturing long-term dependencies in RNNs led to a relevancy screening mechanism resembling human memory consolidation

🎯 Out-of-distribution generalization through modularity and inductive biases

Fler avsnitt av Generally Intelligent

There will be a scientific theory of deep learning

24 apr.•1 tim 33 min

Malleable software and human agency with Geoffrey Litt

14 nov. 2025•1 tim 32 min

From lawless spaces to true liberty: rethinking AI's role in society

13 aug. 2025•1 tim 39 min

Rylan Schaeffer, Stanford: Investigating emergent abilities and challenging dominant research ideas

18 sep. 2024•1 tim 3 min

Ari Morcos, DatologyAI: Leveraging data to democratize model training

11 juli 2024•1 tim 34 min

Percy Liang, Stanford: The paradigm shift and societal effects of foundation models

9 maj 2024•1 tim 2 min

Seth Lazar, Australian National University: Legitimate power, moral nuance, and the political philosophy of AI

12 mars 2024•1 tim 56 min

Tri Dao, Stanford: FlashAttention and sparsity, quantization, and efficient inference

9 aug. 2023•1 tim 20 min

Jamie Simon, UC Berkeley: Theoretical principles for how neural networks learn and generalize

22 juni 2023•1 tim 2 min

Bill Thompson, UC Berkeley: How cultural evolution shapes knowledge acquisition

29 mars 2023•1 tim 15 min

Generally Intelligent med Kanjun Qiu finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.