Postmortem: Taskcluster Decision Task PR broke master commit checks (multiple times) #20523
Closed
2 tasks
Labels
postmortem
This issue is a postmortem for some outage
Owner: @stephenmcgruer
Postmortem Created: 2019-11-28 21:45 EST
Status: In Review
Issue: No specific issue for landing the Taskcluster Decision Graph change exists. See here.
Impact: Medium. Approximately 14 (4 for the first landing, 10 for the second) commits to master did not run Taskcluster successfully. This means that experimental Chrome and Firefox results for those commits are not available on wpt.fyi (example). The change also broke the epoch branches responsible for producing stable runs, and we lose a few days of stable results.
Root Cause: A planned change from a single status Taskcluster yaml file to a 'decision task' approach had a number of bugs and unintended side effects, which were only discovered as it rolled out.
Timeline
Lessons Learnt
Things that went well
Things that went poorly
Where we got lucky
Action Items
The text was updated successfully, but these errors were encountered: