Udacity Data Engineering Nanodegree Projects
The course followed a fictitious music streaming company called Sparkify as they scale up their analytics platform.
Projects 1 & 2: Set up on prem Postgres/Cassandra databases.
Project 3: Set up a Redshift cloud database.
Project 4: Migrate from Redshift into a data lake. Use Spark (via EMR) to transform S3 csv into easily queryable dimensional tables.
Project 5: Set up ETL pipelines to Redshift using Airflow.