The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)


Latest Episodes

Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682
April 29, 2024

Today we're joined by Azarakhsh (Aza) Jalalvand, a research scholar at Princeton University, to discuss his work using deep reinforcement learning to control plasma instabilities in nuclear fusion reactors. Aza explains his team developed a model to detec

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681
April 22, 2024

Today we're joined by Kirk Marple, CEO and founder of Graphlit, to explore the emerging paradigm of "GraphRAG," or Graph Retrieval Augmented Generation. In our conversation, Kirk digs into the GraphRAG architecture and how Graphlit uses it to offer a mult

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680
April 16, 2024

Today we're joined by Alex Havrilla, a PhD student at Georgia Tech, to discuss "Teaching Large Language Models to Reason with Reinforcement Learning." Alex discusses the role of creativity and exploration in problem solving and explores the opportunities

Localizing and Editing Knowledge in LLMs with Peter Hase - #679
April 08, 2024

Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the importance of developing a deeper understanding of how large neural networks make decisions. We learn how matr

Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678
April 01, 2024

Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) Anything". Jonas explains how neural networks can be exploited, highlighting the risk of deploying LLM agen

V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677
March 25, 2024

Today were joined by Mido Assran, a research scientist at Metas Fundamental AI Research (FAIR). In this conversation, we discuss V-JEPA, a new model being billed as the next step in Yann LeCun's vision for true artificial reasoning. V-JEPA, the video

Video as a Universal Interface for AI Reasoning with Sherry Yang - #676
March 18, 2024

Today were joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student at UC Berkeley. In this interview, we discuss her new paper, "Video as the New Language for Real-World Decision Making, which explores how generative video

Assessing the Risks of Open AI Models with Sayash Kapoor - #675
March 11, 2024

Today were joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Princeton University. Sayash walks us through his paper: "On the Societal Impact of Open Foundation Models. We dig into the controversy around AI safety, the ri

OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674
March 04, 2024

Today were joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Akshita joins us to discuss OLMo, a new open source language model with 7 billion and 1 billion variants, but with a key difference compared to similar models

Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673
February 26, 2024

Today were joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive science and machine learning. Our conversation centers on Bens recent paper, Why think step by step? Reas