Skip to content

jwittbold/spring_capital

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Ingest & Transform of stock data on Azure Blob Storage using Pyspark

Analytical ETL

Step 1

• Moving Avg step_1_moving_avg

Step 2

• Previous day closing prices
step_2_earlier_day_closing_pr

Step 3

• Union Quote and Trade records step_3_union_quote_trade

Step 4

• Last trade moving average step_4_last_trade_moving_avg

Step 5

• Filter for Quote records step_5_filter_quote_records

Step 6

• Broadcast join step_6_final_broadcast_join

Track Job Status

Extract

• Succesful extract.py job run extract_success

End of day load

• Succesful EOD_load.py job run eod_quote eod_trade_success

Analytics

• Succesful analytical_ETL.py job run analytics_success

Job status in PostgreSQL table

• Succesfully updated job status table for each job run job_tracker_postgres_table

About

ETL pipeline for stock data using Spark on Azure

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published