Data Engineering Podcast
Latest Episodes
Data Migration Strategies For Large Scale Systems
Summary Any software system that survives long enough will require some form of migration or evolution. When that system is responsible for the data layer the process becomes more challenging. Sriram Panyam has been involved in several projects that requ
Release Management For Data Platform Services And Logic
Building a data platform is a substrantial engineering endeavor. Once it is running, the next challenge is figuring out how to address release management for all of the different component parts. The
Barking Up The Wrong GPTree: Building Better AI With A Cognitive Approach
Artificial intelligence has dominated the headlines for several months due to the successes of large language models. This has prompted numerous debates about the possibility of, and timeline for, art
Build Your Second Brain One Piece At A Time
Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and un
Making Email Better With AI At Shortwave
Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he
Designing A Non-Relational Database Engine
Databases come in a variety of formats for different use cases. The default association with the term "database" is relational engines, but non-relational engines are also used quite widely. In this e
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the org
Adding Anomaly Detection And Observability To Your dbt Projects Is Elementary
Working with data is a complicated process, with numerous chances for something to go wrong. Identifying and accounting for those errors is a critical piece of building trust in the organization that
Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+
A core differentiator of Dagster in the ecosystem of data orchestration is their focus on software defined assets as a means of building declarative workflows. With their launch of Dagster+ as the red
Reconciling The Data In Your Databases With Datafold
A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consum