Skip to content

Latency logging#244

Open
mccabete wants to merge 15 commits intoconus-dpsfrom
latency_logging
Open

Latency logging#244
mccabete wants to merge 15 commits intoconus-dpsfrom
latency_logging

Conversation

@mccabete
Copy link
Contributor

@mccabete mccabete commented Mar 9, 2026

It's been really hard to stay on top of our data latency issues. This is in part because the data the users see can be delayed for a variety of reasons, but we only have alerts for run failures. This is a tack on to the existing alerting system that will let us know if the data the users are seeing is going stale faster than we want it to.

mccabete added 15 commits March 6, 2026 14:35
Adding a logging & alert system when data latency is greater than 16 hours (12 hours expected).
action.yaml, cribbed from alerting .yaml
Quickstart on job
query the api and check if most recent data is more than a certain number of hours stale compared to now. If it is, send an alert with a statement about the time difference.
12 hour latency expectation
getting rid of MAAP specific stuff
piping results of latency_logging.py to $GITHUB_OUTPUT for display in workflow
trying to capture output from stdout
adding action id and output
Adding alert parameter
add alert boolean
first terrible draft of workflow
Runs hourly and will alert if the data latency is longer than the # of hours as defined in latency_logging.py
@mccabete
Copy link
Contributor Author

mccabete commented Mar 9, 2026

This is partially in response to #244 -- but notably this will only "log" by issuing lots of alerts when we have data older than 12 hours.

@mccabete
Copy link
Contributor Author

mccabete commented Mar 9, 2026

@zebbecker could you glance at this and tell me if you think it will break our existing alerts system?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants