You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Sep 30, 2024. It is now read-only.
Devs often don’t know what the pipeline is doing or why their pipeline has failed.
We (devx) often get asked to help diagnose and figure out what the actual problem is. This is because we have in-depth knowledge about the pipeline and how it works.
The pipeline needs to be less of a blackbox and surface useful information which allows one to investigate further.
Boundaries
Scope
Our primary concern is with the pipeline and its communication channels. As it stands the pipeline has 3 communication channels:
Buildkite UI
Slack notifications
sg ci status
Each channel can be enhanced to provide more contextual information and easy on-ramps to other tools that help failure diagnosis.
Improving pipeline stability also falls into improving pipeline communication. A false positive or bug communicates the wrong thing and confuses the end user. Therefore, addressing bugs falls into the scope of this bet.
Out of scope
Nice to haves = eye candy
Speed improvements
Test out buildkite tracing / opentelemetry
Better test output can help with making a broken pipeline more actionable
Problem
Devs often don’t know what the pipeline is doing or why their pipeline has failed.
We (devx) often get asked to help diagnose and figure out what the actual problem is. This is because we have in-depth knowledge about the pipeline and how it works.
The pipeline needs to be less of a blackbox and surface useful information which allows one to investigate further.
Boundaries
Scope
Our primary concern is with the pipeline and its communication channels. As it stands the pipeline has 3 communication channels:
sg ci status
Each channel can be enhanced to provide more contextual information and easy on-ramps to other tools that help failure diagnosis.
Improving pipeline stability also falls into improving pipeline communication. A false positive or bug communicates the wrong thing and confuses the end user. Therefore, addressing bugs falls into the scope of this bet.
Out of scope
Definition of Done
Payout:
Tracked issues
@unassigned
Completed
@burmudar: 3.00d
Completed: 3.00d
Legend
brain storm doc
The text was updated successfully, but these errors were encountered: