Skip to content
View airscholar's full-sized avatar
💭
Do hard things!
💭
Do hard things!

Highlights

  • Pro

Block or report airscholar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
airscholar/README.md

Hey there 👋, I'm Yusuf!

LinkedIn Medium Stackoverflow Dev.to

👨🏻‍🎓 Academic experience:

📝 I regularly write articles:

  • On Medium about programing, data science and AI
  • On HackerNoon about programing, data science and AI
  • On Dev.to about programing, data science and AI

📺 Latest Youtube Videos

Apache Iceberg Explained in 10 Minutes – Everything You Need to Know! End to End Modern Distributed Data Lakehouse using Apache Iceberg, Trino, Airflow, DBT and Minio Building End to End #salesforecasting Machine Learning #pipeline from scratch Building Realtime End to End Sales Forecasting AI from Scratch 10 Step Roadmap to Become an Expert Data Engineer with Projects DevOps in Data Engineering: End-to-End Automation with CI/CD, Terraform & AWS - PART 1 Build Realtime Fraud Detection AI from Scratch - End to End Machine Learning Project   - Part 1 #MachineLearning, #DevOps and #CICD in #DataEngineering - End to End Data Engineering Project Realtime Logs Processing with Apache Airflow, Kafka and Elasticsearch - PART 1

📚 Latest Medium Stories

airscholar

Pinned Loading

  1. e2e-data-engineering e2e-data-engineering Public

    An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…

    Python 287 130

  2. RedditDataEngineering RedditDataEngineering Public

    This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and serv…

    Python 164 81

  3. changecapture-e2e changecapture-e2e Public

    This project shows how to capture changes from postgres database and stream them into kafka

    Python 38 20

  4. RealtimeStreamingEngineering RealtimeStreamingEngineering Public

    This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from da…

    Python 43 29

  5. FootballDataEngineering FootballDataEngineering Public

    An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Fa…

    Python 27 23

  6. ApacheFlink-SalesAnalytics ApacheFlink-SalesAnalytics Public

    This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, s…

    Java 11 9