This browser doesn't support Spotify Web Player. Switch browsers or download Spotify for your desktop.

Data Engineering Podcast

By Tobias Macey

Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry

  1. 1.
    The Alluxio Distributed Storage System - Episode 7002/19/2019
    59:44
  2. 2.
    Building Machine Learning Projects In The Enterprise - Episode 6902/11/2019
    48:18
  3. 3.
    Cleaning And Curating Open Data For Archaeology - Episode 6802/04/2019
    1:00:55
  4. 4.
    Managing Database Access Control For Teams With strongDM - Episode 6701/29/2019
    42:17
  5. 5.
    Building Enterprise Big Data Systems At LEGO - Episode 6601/21/2019
    48:03
  6. 6.
    TimescaleDB: The Timeseries Database Built For SQL And Scale - Episode 6501/14/2019
    41:25
  7. 7.
    Performing Fast Data Analytics Using Apache Kudu - Episode 6401/07/2019
    50:46
  8. 8.
    Simplifying Continuous Data Processing Using Stream Native Storage In Pravega with Tom Kaitchuck - Episode 6312/31/2018
    44:42
  1. 9.
    Continuously Query Your Time-Series Data Using PipelineDB with Derek Nelson and Usman Masood - Episode 6212/24/2018
    1:03:51
  2. 10.
    Advice On Scaling Your Data Pipeline Alongside Your Business with Christian Heinzmann - Episode 6112/17/2018
    39:22
  3. 11.
    Putting Apache Spark Into Action with Jean Georges Perrin - Episode 6012/10/2018
    50:31
  4. 12.
    Apache Zookeeper As A Building Block For Distributed Systems with Patrick Hunt - Episode 5912/03/2018
    54:25
  5. 13.
    Set Up Your Own Data-as-a-Service Platform On Dremio with Tomer Shiran - Episode 5811/26/2018
    39:18
  6. 14.
    Stateful, Distributed Stream Processing on Flink with Fabian Hueske - Episode 5711/19/2018
    48:01
  7. 15.
    How Upsolver Is Building A Data Lake Platform In The Cloud with Yoni Iny - Episode 5611/11/2018
    51:50
  8. 16.
    Self Service Business Intelligence And Data Sharing Using Looker with Daniel Mintz - Episode 5511/05/2018
    58:04
  9. 17.
    Using Notebooks As The Unifying Layer For Data Roles At Netflix with Matthew Seal - Episode 5410/29/2018
    40:54
  10. 18.
    Of Checklists, Ethics, and Data with Emily Miller and Peter Bull (Cross Post from Podcast.__init__) - Episode 5310/22/2018
    45:32
  11. 19.
    Improving The Performance Of Cloud-Native Big Data At Netflix Using The Iceberg Table Format with Ryan Blue - Episode 5210/15/2018
    53:45
  12. 20.
    Combining Transactional And Analytical Workloads On MemSQL with Nikita Shamgunov - Episode 5110/09/2018
    56:54
  13. 21.
    Building A Knowledge Graph From Public Data At Enigma With Chris Groskopf - Episode 5010/01/2018
    52:52
  14. 22.
    A Primer On Enterprise Data Curation with Todd Walter - Episode 4909/24/2018
    49:35
  15. 23.
    Take Control Of Your Web Analytics Using Snowplow With Alexander Dean - Episode 4809/17/2018
    47:48
  16. 24.
    Keep Your Data And Query It Too Using Chaos Search with Thomas Hazel and Pete Cheslock - Episode 4709/10/2018
    48:08
  17. 25.
    An Agile Approach To Master Data Management with Mark Marinelli - Episode 4609/03/2018
    47:16
  18. 26.
    Protecting Your Data In Use At Enveil with Ellison Anne Williams - Episode 4508/27/2018
    24:41
  19. 27.
    Graph Databases In Production At Scale Using DGraph with Manish Jain - Episode 4408/20/2018
    42:39
  20. 28.
    Putting Airflow Into Production With James Meickle - Episode 4308/13/2018
    48:05
  21. 29.
    Taking A Tour Of PostgreSQL with Jonathan Katz - Episode 4208/06/2018
    56:21
  22. 30.
    Mobile Data Collection And Analysis Using Ona And Canopy With Peter Lubell-Doughtie - Episode 4107/30/2018
    29:14
  23. 31.
    Ceph: A Reliable And Scalable Distributed Filesystem with Sage Weil - Episode 4007/16/2018
    48:30
  24. 32.
    Building Data Flows In Apache NiFi With Kevin Doran and Andy LoPresto - Episode 3907/08/2018
    1:04:15
  25. 33.
    Leveraging Human Intelligence For Better AI At Alegion With Cheryl Martin - Episode 3807/02/2018
    46:13
  26. 34.
    Package Management And Distribution For Your Data Using Quilt with Kevin Moore - Episode 3706/25/2018
    41:43
  27. 35.
    User Analytics In Depth At Heap with Dan Robinson - Episode 3606/17/2018
    45:27
  28. 36.
    CockroachDB In Depth with Peter Mattis - Episode 3506/11/2018
    43:41
  29. 37.
    ArangoDB: Fast, Scalable, and Multi-Model Data Storage with Jan Steeman and Jan Stücke - Episode 3406/04/2018
    40:05
  30. 38.
    The Alooma Data Pipeline With CTO Yair Weinberger - Episode 3305/28/2018
    47:49
  31. 39.
    PrestoDB and Starburst Data with Kamil Bajda-Pawlikowski - Episode 3205/21/2018
    42:07
  32. 40.
    Brief Conversations From The Open Data Science Conference: Part 2 - Episode 3105/14/2018
    26:05
  33. 41.
    Brief Conversations From The Open Data Science Conference: Part 1 - Episode 3005/07/2018
    32:38
  34. 42.
    Metabase Self Service Business Intelligence with Sameer Al-Sakran - Episode 2904/30/2018
    44:46
  35. 43.
    Octopai: Metadata Management for Better Business Intelligence with Amnon Drori - Episode 2804/23/2018
    39:52
  36. 44.
    Data Engineering Weekly with Joe Crobak - Episode 2704/15/2018
    43:31
  37. 45.
    Defining DataOps with Chris Bergh - Episode 2604/08/2018
    54:30
  38. 46.
    ThreatStack: Data Driven Cloud Security with Pete Cheslock and Patrick Cable - Episode 2504/01/2018
    51:52
  39. 47.
    MarketStore: Managing Timeseries Financial Data with Hitoshi Harada and Christopher Ryan - Episode 2403/25/2018
    33:27
  40. 48.
    Stretching The Elastic Stack with Philipp Krenn - Episode 2303/19/2018
    51:02
  41. 49.
    Database Refactoring Patterns with Pramod Sadalage - Episode 2203/12/2018
    49:05
  42. 50.
    The Future Data Economy with Roger Chen - Episode 2103/05/2018
    42:47
  43. 51.
    Honeycomb Data Infrastructure with Sam Stokes - Episode 2002/26/2018
    41:33
  44. 52.
    Data Teams with Will McGinnis - Episode 1902/19/2018
    28:38
  45. 53.
    TimescaleDB: Fast And Scalable Timeseries with Ajay Kulkarni and Mike Freedman - Episode 1802/11/2018
    1:02:40
  46. 54.
    Pulsar: Fast And Scalable Messaging with Rajan Dhabalia and Matteo Merli - Episode 1702/04/2018
    53:46
  47. 55.
    Dat: Distributed Versioned Data Sharing with Danielle Robinson and Joe Hand - Episode 1601/29/2018
    1:02:57
  48. 56.
    Snorkel: Extracting Value From Dark Data with Alex Ratner - Episode 1501/22/2018
    37:12
  49. 57.
    CRDTs and Distributed Consensus with Christopher Meiklejohn - Episode 1401/15/2018
    45:42
  50. 58.
    Citus Data: Distributed PostGreSQL for Big Data with Ozgun Erdogan and Craig Kerstiens - Episode 1301/08/2018
    46:44
  51. 59.
    Wallaroo with Sean T. Allen - Episode 1212/25/2017
    59:12
  52. 60.
    SiriDB: Scalable Open Source Timeseries Database with Jeroen van der Heijden - Episode 1112/18/2017
    33:52
  53. 61.
    Confluent Schema Registry with Ewen Cheslack-Postava - Episode 1012/10/2017
    49:21
  54. 62.
    data.world with Bryon Jacob - Episode 912/03/2017
    46:24
  55. 63.
    Data Serialization Formats with Doug Cutting and Julien Le Dem - Episode 811/22/2017
    51:42
  56. 64.
    Buzzfeed Data Infrastructure with Walter Menendez - Episode 711/14/2017
    43:40
  57. 65.
    Astronomer with Ry Walker - Episode 608/06/2017
    42:49
  58. 66.
    Rebuilding Yelp's Data Pipeline with Justin Cunningham - Episode 506/18/2017
    42:27
  59. 67.
    ScyllaDB with Eyal Gutkind - Episode 403/18/2017
    35:06
  60. 68.
    Defining Data Engineering with Maxime Beauchemin - Episode 303/05/2017
    45:20
  61. 69.
    Dask with Matthew Rocklin - Episode 201/22/2017
    46:00
  62. 70.
    Pachyderm with Daniel Whitenack - Episode 101/14/2017
    44:42
  63. 71.
    Introducing The Show - Episode 001/08/2017
    4:23

Listen to Data Engineering Podcast now.

Listen to Data Engineering Podcast in full in the Spotify app