The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Latest Episodes
Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682
Today we're joined by Azarakhsh (Aza) Jalalvand, a research scholar at Princeton University, to discuss his work using deep reinforcement learning to control plasma instabilities in nuclear fusion reactors. Aza explains his team developed a model to detec
GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681
Today we're joined by Kirk Marple, CEO and founder of Graphlit, to explore the emerging paradigm of "GraphRAG," or Graph Retrieval Augmented Generation. In our conversation, Kirk digs into the GraphRAG architecture and how Graphlit uses it to offer a mult
Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680
Today we're joined by Alex Havrilla, a PhD student at Georgia Tech, to discuss "Teaching Large Language Models to Reason with Reinforcement Learning." Alex discusses the role of creativity and exploration in problem solving and explores the opportunities
Localizing and Editing Knowledge in LLMs with Peter Hase - #679
Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the importance of developing a deeper understanding of how large neural networks make decisions. We learn how matr
Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678
Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) Anything". Jonas explains how neural networks can be exploited, highlighting the risk of deploying LLM agen
V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677
Today were joined by Mido Assran, a research scientist at Metas Fundamental AI Research (FAIR). In this conversation, we discuss V-JEPA, a new model being billed as the next step in Yann LeCun's vision for true artificial reasoning. V-JEPA, the video
Video as a Universal Interface for AI Reasoning with Sherry Yang - #676
Today were joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student at UC Berkeley. In this interview, we discuss her new paper, "Video as the New Language for Real-World Decision Making, which explores how generative video
Assessing the Risks of Open AI Models with Sayash Kapoor - #675
Today were joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Princeton University. Sayash walks us through his paper: "On the Societal Impact of Open Foundation Models. We dig into the controversy around AI safety, the ri
OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674
Today were joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Akshita joins us to discuss OLMo, a new open source language model with 7 billion and 1 billion variants, but with a key difference compared to similar models
Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673
Today were joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive science and machine learning. Our conversation centers on Bens recent paper, Why think step by step? Reas