-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[queue time] Add Job Queue Time hook #6417
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
id: aws_creds | ||
uses: aws-actions/configure-aws-credentials@v3 | ||
with: | ||
role-to-assume: arn:aws:iam::308535385114:role/gha_workflow_update_queue_times |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI I used the same aws role as the update-queue-times.yml.
@jeanschmidt
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't do that, each role for each UC.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sonds good
I don't like that we're using test-infra ci for those kind of jobs. I believe we should stop this bad engineering standard. Maybe this should be a AWS lambda? |
@@ -0,0 +1,26 @@ | |||
name: Update job queue times dataset |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think it is a good standard to use GHA CI for chron jobs, I believe we should be using lambdas or other things for this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just curious, is there reason why the old one exists?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need to think better ways of doing this.
Description
Add git work to insert new queue time data into s3 every 15 minutes,
the new data includes:
{ queue_s, repo, workflow_name , job_name, machine_type, time}
Pending PR for permission:
https://github.com/pytorch-labs/pytorch-gha-infra/pull/627
Pending PR for db schema:
#6418
Design Doc:
https://docs.google.com/document/d/1OiPv-ku_NvMgvnaMIbtnIEHTixJY28CBaKeMKJtglEk/edit?tab=t.0#heading=h.87bg49h503n7
Details
working result in s3
https://us-east-1.console.aws.amazon.com/s3/object/ossci-raw-job-status?region=us-east-1&bucketType=general&prefix=job_queue_times_historical/pytorch/pytorch/1742001203.txt