Data Engineering Podcast
Kanal Detayları
Data Engineering Podcast
This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
Son Bölümler
485 bölüm
Context Engineering as a Discipline: Building Governed AI Analytics
Summary
In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick Schrock, CTO and founder of Dagster Labs, to discu...

The Data Model That Captures Your Business: Metric Trees Explained
Summary
In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach t...

From GPUs-as-a-Service to Workloads-as-a-Service: Flex AI’s Path to High-Utilization AI Infra
Summary
In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about revolution...

From RAG to Relational: How Agentic Patterns Are Reshaping Data Architecture
Summary
In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are...

Duck Lake: Simplifying the Lakehouse Ecosystem
Summary
In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the creators of DuckDB, share their work on Duck La...

Aligning Business and Data: The Essential Role of Data Modeling
Summary
In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL DBM, talks about the socio-technical aspects o...

From Academia to Industry: Bridging Data Engineering Challenges
Summary
In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on know...

High Performance And Low Overhead Graphs With KuzuDB
Summary
In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Pr...
Bridging Data and Decision-Making: AI's Role in Modern Analytics
Summary
In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an au...
From Bits to Tables: The Evolution of S3 Storage
Summary
In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and th...
Revolutionizing Python Notebooks with Marimo
Summary
In this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, whi...
Warehouse Native Incremental Data Processing With Dynamic Tables And Delayed View Semantics
Summary
In this episode of the Data Engineering Podcast Dan Sotolongo from Snowflake talks about the complexities of incremental data processing...
Streamlining Data Pipelines with MCP Servers and Vector Engines
Summary
In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about integrating MCP servers with vector databases to proce...
Foundational Data Engineering At Two Sigma
Summary
In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data engineering at Two Sigma, talks about the com...
Enabling Agents In The Enterprise With A Platform Approach
Summary
In this episode of the Data Engineering Podcast Arun Joseph talks about developing and implementing agent platforms to empower businesse...
Dagster's New Era: Modularizing Data Transformation in the Age of AI
Summary
In this episode of the Data Engineering Podcast we welcome back Nick Schrock, CTO and founder of Dagster Labs, to discuss the evolving l...
AI and the Lakehouse: How Starburst is Pioneering New Workflows
Summary
In this episode of the Data Engineering Podcast Alex Albu, tech lead for AI initiatives at Starburst, talks about integrating AI workloa...
Amazon S3: The Backbone of Modern Data Systems
Summary
In this episode of the Data Engineering Podcast Mai-Lan Tomsen Bukovec, Vice President of Technology at AWS, talks about the evolution o...
Scaling Data Operations With Platform Engineering
Summary
In this episode of the Data Engineering Podcast Chakravarthy Kotaru talks about scaling data operations through standardized platform of...
From Data Discovery to AI: The Evolution of Semantic Layers
Summary
In this episode of the Data Engineering Podcast, host Tobias Macy welcomes back Shinji Kim to discuss the evolving role of semantic laye...
Balancing Off-the-Shelf and Custom Solutions in Data Engineering
Summary
In this episode of the Data Engineering Podcast Tulika Bhatt, a senior software engineer at Netflix, talks about her experiences with la...
StarRocks: Bridging Lakehouse and OLAP for High-Performance Analytics
Summary
In this episode of the Data Engineering Podcast Sida Shen, product manager at CelerData, talks about StarRocks, a high-performance analy...
Exploring NATS: A Multi-Paradigm Connectivity Layer for Distributed Applications
Summary
In this episode of the Data Engineering Podcast Derek Collison, creator of NATS and CEO of Synadia, talks about the evolution and capabi...
Advanced Lakehouse Management With The LakeKeeper Iceberg REST Catalog
Summary
In this episode of the Data Engineering Podcast Viktor Kessler, co-founder of Vakmo, talks about the architectural patterns in the lake...
Simplifying Data Pipelines with Durable Execution
Summary
In this episode of the Data Engineering Podcast Jeremy Edberg, CEO of DBOS, about durable execution and its impact on designing and impl...
Overcoming Redis Limitations: The Dragonfly DB Approach
Summary
In this episode of the Data Engineering Podcast Roman Gershman, CTO and founder of Dragonfly DB, explores the development and impact of...
Bringing AI Into The Inner Loop of Data Engineering With Ascend
Summary
In this episode of the Data Engineering Podcast Sean Knapp, CEO of Ascend.io, explores the intersection of AI and data engineering. He d...
Astronomer's Role in the Airflow Ecosystem: A Deep Dive with Pete DeJoy
Summary
In this episode of the Data Engineering Podcast Pete DeJoy, co-founder and product lead at Astronomer, talks about building and managing...
Accelerated Computing in Modern Data Centers With Datapelago
Summary
In this episode of the Data Engineering Podcast Rajan Goyal, CEO and co-founder of Datapelago, talks about improving efficiencies in dat...
The Future of Data Engineering: AI, LLMs, and Automation
Summary
In this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and...
Evolving Responsibilities in AI Data Management
Summary
In this episode of the Data Engineering Podcast Bartosz Mikulski talks about preparing data for AI applications. Bartosz shares his jour...
CSVs Will Never Die And OneSchema Is Counting On It
Summary
In this episode of the Data Engineering Podcast Andrew Luo, CEO of OneSchema, talks about handling CSV data in business operations. Andr...
Breaking Down Data Silos: AI and ML in Master Data Management
Summary
In this episode of the Data Engineering Podcast Dan Bruckner, co-founder and CTO of Tamr, talks about the application of machine learnin...
Building a Data Vision Board: A Guide to Strategic Planning
Summary
In this episode of the Data Engineering Podcast Lior Barak shares his insights on developing a three-year strategic vision for data mana...
How Orchestration Impacts Data Platform Architecture
Summary
The core task of data engineering is managing the flows of data through an organization. In order to ensure those flows are executing on...
An Exploration Of The Impediments To Reusable Data Pipelines
Summary
In this episode of the Data Engineering Podcast the inimitable Max Beauchemin talks about reusability in data pipelines. The conversatio...
The Art of Database Selection and Evolution
Summary
In this episode of the Data Engineering Podcast Sam Kleinman talks about the pivotal role of databases in software engineering. Sam shar...
Bridging Code and UI in Data Orchestration with Kestra
Summary
In this episode of the Data Engineering Podcast, Anna Geller talks about the integration of code and UI-driven interfaces for data orche...
Streaming Data Into The Lakehouse With Iceberg And Trino At Going
In this episode, I had the pleasure of speaking with Ken Pickering, VP of Engineering at Going, about the intricacies of streaming data into a Trino a...
An Opinionated Look At End-to-end Code Only Analytical Workflows With Bruin
Summary
The challenges of integrating all of the tools in the modern data stack has led to a new generation of tools that focus on a fully integ...