Free Data Engineering Learning Path (6 Weeks)
Courses to help building a strong data engineering foundation.
This learning path combines free courses from curated platforms and Google Cloud Skills Boost to help you build strong data engineering foundations.
Week 1: Data Engineering Fundamentals
π― Goal: Learn core concepts, SQL, and programming basics.
- SQLBolt β Interactive SQL tutorials
- Introduction to Python β Constructor Academy
- Git Fundamentals β Microsoft Learn
- Practice Git with GitHub Docs
- Data Fundamentals β Microsoft DP-900
Week 2: Cloud Computing & Databases
π― Goal: Get comfortable with cloud environments and services.
Google Cloud Skills Boost:
- Google Cloud Fundamentals: Core Infrastructure
- Cloud Storage Fundamentals
- Relational Database Essentials
Week 3: Data Ingestion & Transformation
π― Goal: Learn how to ingest data, clean it, and prepare it for analysis.
Google Cloud Skills Boost:
Week 4: Data Warehousing, Modeling, and Analytics
π― Goal: Learn how to design scalable models and query large datasets.
- Data Modelling Expert Session - details to be identified
Google Cloud Skills Boost:
- Exploring and Preparing your Data with BigQuery
- Data Modeling in BigQuery
- Building Data Lakes with BigQuery and Cloud Storage
Week 5: Orchestration, Visualization, and Capstone
π― Goal: Build workflows and share results.
Google Cloud Skills Boost
Capstone Project (Optional)
Ingest β Transform with BigQuery β Visualize with Streamlit β Orchestrate with Cloud Composer/Airflow
- Create a Dashboard in Streamlit to visualize the data
- Step 1: Ingest data from a public APIs or datasets from here
- Step 2: Identify a business case to showcase a trend or pattern based on the chosen dataset
- Step 3: Transform the data using Pandas/Pyspark
- Step 4: Store the data in BigQuery database
- Step 5: Create a dashboard using Streamlit to visualize the data
- Step 6: Create a repository on GitHub and leverage poetry build tool
- Additional Step: Orchestrate this project in CLoud Composer or Airflow
Repository and Practicalities
- Fork this data-engineering-learnings to your own GitHub account
- Go checkout the repo for more details.
Discord
Join the DataTribe Collective Discord server to connect with fellow learners, ask questions, and share your progress. Join here.
Happy learning! π
Acknowledgments
For more details on content usage, dependencies, and non-commercial information, please refer to the disclaimer page.