This repository contains my notes, lab work, and projects from the IBM Data Engineering Professional Certificate on Coursera.
Note: Courses 1–7 are presented as high-level overviews summarizing key concepts, while detailed documentation and code begin from Course 8 – ETL and Data Pipelines with Shell, Airflow, and Kafka onward.
| # | Course Title | Status |
|---|---|---|
| 1 | Introduction to Data Engineering | ✅ Complete |
| 2 | Python for Data Science, AI & Development | ✅ Complete |
| 3 | Python Project for Data Engineering | ✅ Complete |
| 4 | Introduction to Relational Databases (RDBMS) | ✅ Complete |
| 5 | Databases and SQL for Data Science with Python | ✅ Complete |
| 6 | Hands-on Introduction to Linux Commands and Shell Scripting | ✅ Complete |
| 7 | Relational Database Administration (DBA) | ✅ Complete |
| 8 | ETL and Data Pipelines with Shell, Airflow and Kafka | ✅ Complete |
| 9 | Getting Started with Data Warehousing and BI Analytics | 🚧 Underway |
| 10 | BI Dashboards with IBM Cognos Analytics and Google Looker | 📅 Upcoming |
| 11 | Introduction to NoSQL Databases | 📅 Upcoming |
| 12 | Introduction to Big Data with Spark and Hadoop | 📅 Upcoming |
| 13 | Machine Learning with Apache Spark | 📅 Upcoming |
| 14 | Data Engineering Capstone Project | 📅 Upcoming |
| 15 | Generative AI: Elevate your Data Engineering Career | 📅 Upcoming |
| 16 | Data Engineering Career Guide and Interview Preparation | 📅 Upcoming |
.
├── 01_intro_to_data_engineering/
├── 02_python_for_ds_ai_dev/
├── 03_python_project_for_de/
├── 04_intro_to_rdbms/
├── 05_sql_for_ds_with_python/
├── 06_linux_shell_scripting/
├── 07_rdbms_dba/
├── 08_etl_airflow_kafka/
│ └── src/ # Python source files for ETL workflows
├── 09_dw_and_bi_analytics/
├── 10_bi_dashboards_cognos_looker/
├── 11_intro_to_nosql/
├── 12_big_data_spark_hadoop/
├── 13_ml_with_apache_spark/
├── 14_capstone/
├── 15_genai_for_de/
├── 16_career_guide/
├── utils/
├── data/
├── requirements.txt
└── README.md