Baron Schwinger
About me: I am a data analyst actively transitioning into data science and engineering roles. While I am still in the process of completing foundational courses, such as Andrew Ng's Machine Learning Specialization, I am taking steps to broaden my skill set to include data engineering to address real-world challenges at my current job. By taking this course, I aim to bridge the gap between data engineering and machine learning systems. My goal is to develop efficient pipelines that not only enhance analytics workflows but also support advanced AI applications like recommendation systems.
Total Score: 73
Homework submissions
Homework 1: Docker, SQL and Terraform
Score: 11 = 7 (questions) + 0 (FAQ) + 4 (learning in public)
Homework URL: View submission
Learning in public links: Show
- https://medium.com/@cyanlake/why-do-we-hear-conda-in-data-science-but-docker-in-data-engineering-046ee34deb6f
- https://medium.com/@cyanlake/lets-get-docker-involved-in-our-data-pipeline-process-207f860fbc49
- https://medium.com/@cyanlake/what-is-dockerfile-220ff5263bc9
- https://medium.com/@cyanlake/learning-terraform-a-beginners-guide-0b1b8305df14
Homework 2: Workflow Orchestration
Score: 11 = 6 (questions) + 0 (FAQ) + 5 (learning in public)
Homework URL: View submission
Learning in public links: Show
- https://medium.com/@cyanlake/post-1-why-workflow-orchestration-meet-kestra-22e5cd7776ce
- https://medium.com/@cyanlake/de-week2-2-why-workflow-orchestration-meet-kestra-5ae0d1bff66f
- https://medium.com/@cyanlake/de-week2-3-deep-dive-into-a-comprehensive-docker-compose-for-kestra-and-postgresql-b07ef4786d1b
- https://medium.com/@cyanlake/de-week2-4-deploying-kestra-with-docker-compose-running-your-first-workflow-60ea3cf45f0a
- https://medium.com/@cyanlake/de-week2-5-essential-data-ingestion-with-csv-and-postgres-25ab298fb22a
Workshop 1: Ingestion with dlt
Score: 8 = 4 (questions) + 0 (FAQ) + 4 (learning in public)
Homework URL: View submission
Learning in public links: Show
- https://medium.com/@cyanlake/de-workshop-1-what-is-data-load-tool-dlt-ec5a34ba4e68
- https://medium.com/@cyanlake/de-workshop-2data-normalization-for-analysts-why-your-csv-files-arent-enough-and-how-tools-7337c4c801ad
- https://medium.com/@cyanlake/de-workshop-3-1-stop-re-fetching-data-in-tableau-build-a-self-updating-7-day-dataset-with-dlt-a3a5b3b648d2
- https://medium.com/@cyanlake/de-workshop-3-2-stop-re-fetching-data-in-tableau-build-a-self-updating-7-day-dataset-with-dlt-cd87bb786e75
Homework 3: Data Warehousing
Score: 13 = 8 (questions) + 0 (FAQ) + 5 (learning in public)
Homework URL: View submission
Learning in public links: Show
- https://medium.com/@cyanlake/de-module3-1-oltp-vs-olap-core-concepts-for-data-engineers-5b589ff4139c
- https://medium.com/@cyanlake/de-module3-2-what-is-data-warehouse-0c4fca0c2c32
- https://medium.com/@cyanlake/de-module3-3-data-lake-vs-lakehouse-why-and-how-1cc6046f64e1
- https://medium.com/@cyanlake/de-module3-3-2-5186801050f5
- https://medium.com/@cyanlake/de-module3-4-bigquery-for-analysts-06bbcc419cd0
Homework 4: Analytics Engineering
Score: 10 = 6 (questions) + 0 (FAQ) + 4 (learning in public)
Homework URL: View submission
Learning in public links: Show
- https://medium.com/@cyanlake/de-module4-1-comparing-dbt-models-and-sql-ctes-similarities-differences-and-best-practices-f30540e22e39
- https://medium.com/@cyanlake/de-module4-2-introduction-to-dbt-and-the-analytics-engineering-mindset-3e08732b25b5
- https://medium.com/@cyanlake/de-module4-3-a-comprehensive-guide-to-dbt-materializations-5f933ff238a9
- https://medium.com/@cyanlake/de-module4-3-1-handling-changing-order-states-with-incremental-models-773c4ef06b7b
Homework 5: Batch
Score: 13 = 6 (questions) + 0 (FAQ) + 7 (learning in public)
Homework URL: View submission
Learning in public links: Show
- https://medium.com/@cyanlake/de-module5-1-unleashing-the-power-of-apache-spark-for-data-analysts-and-ml-engineers-9f15c2126d68
- https://medium.com/@cyanlake/de-module5-1harnessing-big-data-with-apache-spark-insights-from-netflix-and-industry-trends-d4dd0fdefcb8
- https://medium.com/@cyanlake/de-module5-3-navigating-the-pyspark-installation-maze-with-conda-075e357b8284
- https://medium.com/@cyanlake/de-module5-4-explain-sparks-job-stage-and-task-concepts-using-a-restaurant-analogy-efc19662bf48
- https://medium.com/@cyanlake/de-module5-how-pyspark-performs-groupby-operations-029d16f26d80
- https://medium.com/@cyanlake/de-module5-6-how-pyspark-performs-join-operations-60d5a736979d
- https://medium.com/@cyanlake/de-module5-7-understanding-shuffle-operations-in-pyspark-1beed511989e
Homework 6: Streaming
Score: 7 = 6 (questions) + 0 (FAQ) + 1 (learning in public)
Homework URL: View submission
Learning in public links: Show