AI Testing and Evaluation: Reflections

Microsoft Research Podcast

AI Testing and Evaluation: Reflections

July 21, 2025

In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.

Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/

Download Episode

Redmond, WA

An ongoing series of conversations bringing you right up to the cutting edge of Microsoft Research.

Microsoft Research Podcast

AI Testing and Evaluation: Reflections

Services