Data Engineering Podcast

Data Engineering Podcast


Latest Episodes

Data Migration Strategies For Large Scale Systems
May 26, 2024

Summary Any software system that survives long enough will require some form of migration or evolution. When that system is responsible for the data layer the process becomes more challenging. Sriram Panyam has been involved in several projects that requ

Release Management For Data Platform Services And Logic
May 12, 2024

Building a data platform is a substrantial engineering endeavor. Once it is running, the next challenge is figuring out how to address release management for all of the different component parts. The

Barking Up The Wrong GPTree: Building Better AI With A Cognitive Approach
May 05, 2024

Artificial intelligence has dominated the headlines for several months due to the successes of large language models. This has prompted numerous debates about the possibility of, and timeline for, art

Build Your Second Brain One Piece At A Time
April 28, 2024

Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and un

Making Email Better With AI At Shortwave
April 21, 2024

Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he

Designing A Non-Relational Database Engine
April 14, 2024

Databases come in a variety of formats for different use cases. The default association with the term "database" is relational engines, but non-relational engines are also used quite widely. In this e

Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
April 07, 2024

Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the org

Adding Anomaly Detection And Observability To Your dbt Projects Is Elementary
March 31, 2024

Working with data is a complicated process, with numerous chances for something to go wrong. Identifying and accounting for those errors is a critical piece of building trust in the organization that

Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+
March 24, 2024

A core differentiator of Dagster in the ecosystem of data orchestration is their focus on software defined assets as a means of building declarative workflows. With their launch of Dagster+ as the red

Reconciling The Data In Your Databases With Datafold
March 17, 2024

A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consum