Store Monitoring System

Loop monitors several restaurants in the US and needs to monitor if the store is online or not. All restaurants are supposed to be online during their business hours. Due to some unknown reasons, a store might go inactive for a few hours. Restaurant owners want to get a report of the how often this happened in the past.

Data Source:

We will have 3 sources of data

We poll every store roughly every hour and have data about whether the store was active or not in a CSV. The CSV has 3 columns (store_id, timestamp_utc, status) where status is active or inactive. All timestamps are in UTC
1. Data can be found in CSV format here
We have the business hours of all the stores - schema of this data is store_id, dayOfWeek(0=Monday, 6=Sunday), start_time_local, end_time_local
1. These times are in the local time zone
2. If data is missing for a store, assume it is open 24*7
3. Data can be found in CSV format here
Timezone for the stores - schema is store_id, timezone_str
1. If data is missing for a store, assume it is America/Chicago
2. This is used so that data sources 1 and 2 can be compared against each other.
3. Data can be found in CSV format here

We want to output a report to the user that has the following schema

store_id, uptime_last_hour(in minutes), uptime_last_day(in hours), update_last_week(in hours), downtime_last_hour(in minutes), downtime_last_day(in hours), downtime_last_week(in hours)

Uptime and downtime should only include observations within business hours.
You need to extrapolate uptime and downtime based on the periodic polls we have ingested, to the entire time interval.
1. eg, business hours for a store are 9 AM to 12 PM on Monday
  1. we only have 2 observations for this store on a particular date (Monday) in our data at 10:14 AM and 11:15 AM
  2. we need to fill the entire business hours interval with uptime and downtime from these 2 observations based on some sane interpolation logic

Note: The data we have given is a static data set, so you can hard code the current timestamp to be the max timestamp among all the observations in the first CSV.

Solution: -

Tech : Python Starlette framework and PostgreSQL database. Hanlded report generation as task using celery and RabbitMQ

APIs :

/trigger_report endpoint that will trigger report generation from the data provided (stored in DB)
1. No input
2. Output - report_id (random string)
3. report_id will be used for polling the status of report completion
/get_report endpoint that will return the status of the report or the csv
1. Input - report_id
2. Output
  - if report generation is not complete, return “Running” as the output
  - if report generation is complete, return “Complete” along with the CSV file with the schema described above.

HOW TO RUN IN LOCAL

Clone the repo
Intsall poetry using pip3 install poerty
Install modules used in the project using poetry update
Install the project poetry install
Setup DataBase, Table and load the initial data from CSV python3 {full_file_path_models.py}
Start the server using uvicorn server:app
Start celery in another terminal celery -A tasks.celery worker --concurrency 4 --loglevel=info (add -P solo if windows)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
store_monitoring_system		store_monitoring_system
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Store Monitoring System

Data Source:

Solution: -

APIs :

HOW TO RUN IN LOCAL

About

Releases

Packages

Languages

License

itsanand/store-monitoring-system

Folders and files

Latest commit

History

Repository files navigation

Store Monitoring System

Data Source:

Solution: -

APIs :

HOW TO RUN IN LOCAL

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages