Skip to content

Status BI python DAGs for Airflow

Notifications You must be signed in to change notification settings

status-im/airflow-dags

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Description

This repo contains implementations of Airflow workflows and tasks called respectively DAGs and Operators.

  • DAGs - Direct Acyclic Graphs - Python scripts defining workflows in a way that reflects their relationships.
  • Operators - Python functions which define the individual tasks that are executed as part of a DAG run.

To learn how to write DAGs and Operators read about core concepts and follow the official tutorial.

DAG

This repository contains:

  • website_sync: DAG to launch the Airbyte jobs for the status-website charts.
  • spiff_sync: DAG to synchronize Spiff workflows data
  • dbt: DAG to run all the dbt models,
  • gh_sync: DAG to synchronize data from repository (logos, waku, codex)
  • status_analytics: DAG to extract and clean data from the google play store and Mixpanel
  • shopify: DAG to extract and clean data from Shopify
  • budget: DAG to extract and clean data from an excel sheet where there are budget transactions and forecast data fron Finance team.

The DBT models run in some DAG are stored in dbt-models.

Continuous Integration

Changes pushed to master are automatically fetched to our Airflow instance by the airflow-webhook service.

Branches

This repos has 3 working branches:

Infrastructure

All Airflow infrastructure is managed in the infra-bi repository.