Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improvements for MG collections #11161

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

petr-balogh
Copy link
Member

@petr-balogh petr-balogh commented Jan 17, 2025

Fixes: #10526
Fixes: #11159

Several improvements in MG logs like preventing running MG over and over
when it's still failing or getting timeouted.

Collecting OCP logs for Ecosystem tests like upgrade decorated with purple
squad.

Not collect logs again at the end of execution on success when it was
collected at least once during execution by some failed test.

@petr-balogh petr-balogh requested a review from a team as a code owner January 17, 2025 16:57
@pull-request-size pull-request-size bot added the size/L PR that changes 100-499 lines label Jan 17, 2025
@petr-balogh petr-balogh force-pushed the improvmentes_for_mg_collections branch 3 times, most recently from c368dd8 to eabf343 Compare January 17, 2025 17:39
@petr-balogh
Copy link
Member Author

Trying to verify here:
https://url.corp.redhat.com/1f1ea06

@petr-balogh petr-balogh force-pushed the improvmentes_for_mg_collections branch from eabf343 to 752cfd4 Compare January 17, 2025 20:18
@petr-balogh
Copy link
Member Author

New verification triggered here:
https://url.corp.redhat.com/108c27f

@petr-balogh petr-balogh force-pushed the improvmentes_for_mg_collections branch 2 times, most recently from 0befd70 to bfe7cf7 Compare January 20, 2025 16:56
@petr-balogh
Copy link
Member Author

Verification job:
https://url.corp.redhat.com/95eab59

@petr-balogh petr-balogh force-pushed the improvmentes_for_mg_collections branch 2 times, most recently from 2b07848 to a272f62 Compare January 21, 2025 14:35
dahorak
dahorak previously approved these changes Jan 21, 2025
Copy link
Contributor

@dahorak dahorak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@petr-balogh
Copy link
Member Author

Verification job:
https://url.corp.redhat.com/d37eb8b

Fixes: red-hat-storage#10526
Fixes: red-hat-storage#11159

Several improvements in MG logs like preventing running MG over and over
when it's still failing or getting timeouted.

Collecting OCP logs for Ecosystem tests like upgrade decorated with purple
squad.

Not collect logs again at the end of execution on success when it was
collected at least once during execution by some failed test.

Signed-off-by: Petr Balogh <[email protected]>
Copy link

openshift-ci bot commented Jan 21, 2025

New changes are detected. LGTM label has been removed.

Copy link

openshift-ci bot commented Jan 21, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: petr-balogh

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

1 similar comment
Copy link

openshift-ci bot commented Jan 21, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: petr-balogh

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

"mcg",
"purple_squad",
}
# For every failure in MG we are trying to extend next attempt by 20 minutes
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this would be up to 2 hours wait (20 min + 40 min + 60 min), in case the default max_mg_fail_attempts is being used.
Isn't it too much time?

max_mg_fail_attempts = config.REPORTING.get("max_mg_fail_attempts")
if skip_after_max_fail:
with mg_lock:
if mg_fail_count > max_mg_fail_attempts:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are we deleting the MG dir structure in case of timeout failure?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size/L PR that changes 100-499 lines
Projects
None yet
Development

Successfully merging this pull request may close these issues.

MG makes a lot of noise written to info log level OCP must gather is not collected
3 participants