Data Engineering Podcast
Latest Episodes
Automate Your Pipeline Creation For Streaming Data Transformations With SQLake
Managing end-to-end data flows becomes complex and unwieldy as the scale of data and its variety of applications in an organization grows. Part of this complexity is due to the transformation and orch
Increase Your Odds Of Success For Analytics And AI Through More Effective Knowledge Management With AlignAI
Making effective use of data requires proper context around the information that is being used. As the size and complexity of your organization increases the difficulty of ensuring that everyone has t
Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams
With all of the messaging about treating data as a product it is becoming difficult to know what that even means. Vishal Singh is the head of products at Starburst which means that he has to spend all
An Exploration Of Tobias' Experience In Building A Data Lakehouse From Scratch
Five years of hosting the Data Engineering Podcast has provided Tobias Macey with a wealth of insight into the work of building and operating data systems at a variety of scales and for myriad purpose
Simple And Scalable Encryption Of Data In Use For Analytics And Machine Learning With Opaque Systems
Encryption and security are critical elements in data analytics and machine learning applications. We have well developed protocols and practices around data that is at rest and in motion, but securit
Making Sense Of The Technical And Organizational Considerations Of Data Contracts
One of the reasons that data work is so challenging is because no single person or team owns the entire process. This introduces friction in the process of collecting, processing, and using data. In o
Revisit The Fundamental Principles Of Working With Data To Avoid Getting Caught In The Hype Cycle
The data ecosystem has seen a constant flurry of activity for the past several years, and it shows no signs of slowing down. With all of the products, techniques, and buzzwords being discussed it can
Convert Your Unstructured Data To Embedding Vectors For More Efficient Machine Learning With Towhee
An interview with Frank Liu about how the open source Towhee library simplifies the work of building pipelines to generate vector embeddings of your data for building machine learning projects.
Run Your Applications Worldwide Without Worrying About The Database With Planetscale
An interview with Nick van Wiggeren about the Planetscale serverless MySQL service built on top of the open source Vitess project and the impact on developer productivity that it offers when you don't
Business Intelligence In The Palm Of Your Hand With Zing Data
An interview with Sabin Thomas about how Zing Data is lets you bring business intelligence with you when you're on the go with first-class support for mobile devices