Want to create an interactive transcript for this episode?
Podcast: DataTalks.Club
Episode: Build Your Own Data Pipeline - Andreas Kretz
Description: We talked about:
Andreasβs background
Why data engineering is becoming more popular
Who to hire first β a data engineer or a data scientist?
How can I, as a data scientist, learn to build pipelines?
Donβt use too many tools
What is a data pipeline and why do we need it?
What is ingestion?
Can just one person build a data pipeline?
Approaches to building data pipelines for data scientists
Processing frameworks
Common setup for data pipelines β car price prediction
Productionizing the model with the help of a data pipeline
Scheduling
Orchestration
Start simple
Learning DevOps to implemen...