You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If a test crashes, e.g. because of OOM, then the temporary files it creates do not get cleaned up properly. This recently caused the storage on our local CI runner to fill up (all 10 TB) and causes all subsequent tests to fail because they couldn't write anything to disk.
The issue came up from flux_test.py But other tests likely do the same.
Tests need to clean up after themselves, even if they crash.
Tests need to store their temp artifacts in a better place, currently they are being stored at the top level of the directory, making it difficult to find anything.
The text was updated successfully, but these errors were encountered:
Alex-Vasile
added
bug
Something isn't working
infra
General category for infrastructure-related requests for common triaging and prioritization
labels
Feb 18, 2025
We shouldn't assume that tests will be able to clean up after themselves. Runners can go offline in the middle of a job for various reasons, especially if a test fails or crashes.
The usual tricks are:
Store temporary files in a designated temporary directory
If a test crashes, e.g. because of OOM, then the temporary files it creates do not get cleaned up properly. This recently caused the storage on our local CI runner to fill up (all 10 TB) and causes all subsequent tests to fail because they couldn't write anything to disk.
The issue came up from flux_test.py But other tests likely do the same.
The text was updated successfully, but these errors were encountered: