Data Engineering Podcast

By Tobias Macey

Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry

  1. 1.
    Make Database Performance Optimization A Playful Experience With Ottertune
    58:28
  2. 2.
    Bring Order To The Chaos Of Your Unstructured Data Assets With Unstruk
    40:47
  3. 3.
    Accelerating ML Training And Delivery With In-Database Machine Learning
    1:05:32
  4. 4.
    Taking A Tour Of The Google Cloud Platform For Data And Analytics
    53:16
  5. 5.
    Make Sure Your Records Are Reliable With The BookKeeper Distributed Storage Layer
    42:01
  6. 6.
    Build Your Analytics With A Collaborative And Expressive SQL IDE Using Querybook
    52:35
  7. 7.
    Making Data Pipelines Self-Serve For Everyone With Shipyard
    51:22
  8. 8.
    Paving The Road For Fast Analytics On Distributed Clouds With The Yellowbrick Data Warehouse
    52:40
  1. 9.
    Easily Build Advanced Similarity Search With The Pinecone Vector Database
    46:47
  2. 10.
    A Holistic Approach To Data Governance Through Self Reflection At Collibra
    55:52
  3. 11.
    Unlocking The Power of Data Lineage In Your Platform with OpenLineage
    57:38
  4. 12.
    Building Your Data Warehouse On Top Of PostgreSQL
    1:15:06
  5. 13.
    Making Analytical APIs Fast With Tinybird
    54:23
  6. 14.
    Making Spark Cloud Native At Data Mechanics
    40:15
  7. 15.
    The Grand Vision And Present Reality of DataOps
    57:08
  8. 16.
    Self Service Data Exploration And Dashboarding With Superset
    47:24
  9. 17.
    Moving Machine Learning Into The Data Pipeline at Cherre
    48:04
  10. 18.
    Exploring The Expanding Landscape Of Data Professions with Josh Benamram of Databand
    1:08:36
  11. 19.
    Put Your Whole Data Team On The Same Page With Atlan
    57:36
  12. 20.
    Data Quality Management For The Whole Team With Soda Data
    58:00
  13. 21.
    Real World Change Data Capture At Datacoral
    49:58
  14. 22.
    Managing The DoorDash Data Platform
    46:04
  15. 23.
    Leave Your Data Where It Is And Automate Feature Extraction With Molecula
    51:39
  16. 24.
    Bridging The Gap Between Machine Learning And Operations At Iguazio
    1:06:27
  17. 25.
    Self Service Open Source Data Integration With AirByte
    52:15
  18. 26.
    Building The Foundations For Data Driven Businesses at 5xData
    52:15
  19. 27.
    How Shopify Is Building Their Production Data Warehouse Using DBT
    46:30
  20. 28.
    System Observability For The Cloud Native Era With Chronosphere
    1:04:50
  21. 29.
    Making It Easier To Stick B2B Data Integration Pipelines Together With Hotglue
    34:05
  22. 30.
    Using Your Data Warehouse As The Source Of Truth For Customer Data With Hightouch
    59:33
  23. 31.
    Enabling Version Controlled Data Collaboration With TerminusDB
    57:48
  24. 32.
    Bringing Feature Stores and MLOps to the Enterprise at Tecton
    47:40
  25. 33.
    Off The Shelf Data Governance With Satori
    34:24
  26. 34.
    Low Friction Data Governance With Immuta
    53:33
  27. 35.
    Building A Self Service Data Platform For Alternative Data Analytics At YipitData
    1:04:47
  28. 36.
    Proven Patterns For Building Successful Data Teams
    1:12:30
  29. 37.
    Streaming Data Integration Without The Code at Equalum
    44:50
  30. 38.
    Keeping A Bigeye On The Data Quality Market
    49:25
  31. 39.
    Self Service Data Management From Ingest To Insights With Isima
    44:02
  32. 40.
    Building A Cost Effective Data Catalog With Tree Schema
    51:52
  33. 41.
    Add Version Control To Your Data Lake With LakeFS
    50:15
  34. 42.
    Cloud Native Data Security As Code With Cyral
    48:32
  35. 43.
    Better Data Quality Through Observability With Monte Carlo
    55:52
  36. 44.
    Rapid Delivery Of Business Intelligence Using Power BI
    1:02:42
  37. 45.
    Self Service Real Time Data Integration Without The Headaches With Meroxa
    1:00:55
  38. 46.
    Speed Up And Simplify Your Streaming Data Workloads With Red Panda
    59:46
  39. 47.
    Cutting Through The Noise And Focusing On The Fundamentals Of Data Engineering With The Data Janitor
    47:40
  40. 48.
    Distributed In Memory Processing And Streaming With Hazelcast
    44:07
  41. 49.
    Simplify Your Data Architecture With The Presto Distributed SQL Engine
    53:59
  42. 50.
    Building A Better Data Warehouse For The Cloud At Firebolt
    1:05:50

Listen to Data Engineering Podcast now.

Listen to Data Engineering Podcast in full in the Spotify app