Data Engineering Podcast

Data Engineering Podcast


Latest Episodes

Automate Your Pipeline Creation For Streaming Data Transformations With SQLake
January 08, 2023

Managing end-to-end data flows becomes complex and unwieldy as the scale of data and its variety of applications in an organization grows. Part of this complexity is due to the transformation and orch

Increase Your Odds Of Success For Analytics And AI Through More Effective Knowledge Management With AlignAI
December 29, 2022

Making effective use of data requires proper context around the information that is being used. As the size and complexity of your organization increases the difficulty of ensuring that everyone has t

Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams
December 28, 2022

With all of the messaging about treating data as a product it is becoming difficult to know what that even means. Vishal Singh is the head of products at Starburst which means that he has to spend all

An Exploration Of Tobias' Experience In Building A Data Lakehouse From Scratch
December 25, 2022

Five years of hosting the Data Engineering Podcast has provided Tobias Macey with a wealth of insight into the work of building and operating data systems at a variety of scales and for myriad purpose

Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems
December 25, 2022

Encryption and security are critical elements in data analytics and machine learning applications. We have well developed protocols and practices around data that is at rest and in motion, but securit

Making Sense Of The Technical And Organizational Considerations Of Data Contracts
December 18, 2022

One of the reasons that data work is so challenging is because no single person or team owns the entire process. This introduces friction in the process of collecting, processing, and using data. In o

Revisit The Fundamental Principles Of Working With Data To Avoid Getting Caught In The Hype Cycle
December 18, 2022

The data ecosystem has seen a constant flurry of activity for the past several years, and it shows no signs of slowing down. With all of the products, techniques, and buzzwords being discussed it can

Convert Your Unstructured Data To Embedding Vectors For More Efficient Machine Learning With Towhee
December 11, 2022

An interview with Frank Liu about how the open source Towhee library simplifies the work of building pipelines to generate vector embeddings of your data for building machine learning projects.

Run Your Applications Worldwide Without Worrying About The Database With Planetscale
December 11, 2022

An interview with Nick van Wiggeren about the Planetscale serverless MySQL service built on top of the open source Vitess project and the impact on developer productivity that it offers when you don't

Business Intelligence In The Palm Of Your Hand With Zing Data
December 04, 2022

An interview with Sabin Thomas about how Zing Data is lets you bring business intelligence with you when you're on the go with first-class support for mobile devices