Data Science at Home

Data Science at Home


Latest Episodes

[RB] Replicating GPT-2, the most dangerous NLP model (with Aaron Gokaslan)
October 18, 2019

Join the discussion on our Discord server   In this episode, I am with Aaron Gokaslan, computer vision researcher, AI Resident at Facebook AI Research. Aaron is the author of OpenGPT-2, a parallel NLP model to the most discussed version that OpenAI d...

What is wrong with reinforcement learning?
October 15, 2019

Join the discussion on our Discord server   After reinforcement learning agents doing great at playing Atari video games, Alpha Go, doing financial trading, dealing with language modeling, let me tell you the real story here.In this episode I want to shi.

Have you met Shannon? Conversation with Jimmy Soni and Rob Goodman about one of the greatest minds in history
October 10, 2019

Join the discussion on our Discord server   In this episode I have an amazing conversation with Jimmy Soni and Rob Goodman, authors of “A mind at play”, a book entirely dedicated to the life and achievements of Claude Shannon. Claude Shannon does not n...

Attacking machine learning for fun and profit (conversation with the authors of SecML)
September 30, 2019

Join the discussion on our Discord server As ML plays a more and more relevant role in many domains of everyday life, it’s quite obvious to see more and more attacks to ML systems. In this episode we talk about the most popular attacks against machine le.

Replicating GPT-2, the most dangerous NLP model (with Aaron Gokaslan)
September 23, 2019

Join the discussion on our Discord server In this episode, I am with Aaron Gokaslan, computer vision researcher, AI Resident at Facebook AI Research. Aaron is the author of OpenGPT-2, a parallel NLP model to the most discussed version that OpenAI dec...

Training neural networks faster without GPU [RB]
September 17, 2019

Join the discussion on our Discord server Training neural networks faster usually involves the usage of powerful GPUs. In this episode I explain an interesting method from a group of researchers from Google Brain, who can train neural networks faster...

How to generate very large images with GANs
September 06, 2019

Join the discussion on our Discord server In this episode I explain how a research group from the University of Lubeck dominated the curse of dimensionality for the generation of large medical images with GANs. The problem is not as trivial as it seems. .

[RB] Complex video analysis made easy with Videoflow
August 29, 2019

In this episode I am with Jadiel de Armas, senior software engineer at Disney and author of Videflow, a Python framework that facilitates the quick development of complex video analysis applications and other series-processing based applications in a mul.

[RB] Validate neural networks without data with Dr. Charles Martin
August 27, 2019

In this episode, I am with Dr. Charles Martin from Calculation Consulting a machine learning and data science consulting company based in San Francisco. We speak about the nuts and bolts of deep neural networks and some impressive findings about the way .

Episode 74: How to cluster tabular data with Markov Clustering
August 21, 2019

In this episode I explain how a community detection algorithm known as Markov clustering can be constructed by combining simple concepts like random walks, graphs, similarity matrix. Moreover, I highlight how one can build a similarity graph and then run.