Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CELEBORN-1743] Resolve the metrics data interruption and the job failure caused by locked resources #3005

Closed
wants to merge 1 commit into from

Conversation

zaynt4606
Copy link
Contributor

…lure caused by locked resources

Remove the ConcurrentLinkedQueue and lock in AbstractSource which might cause the metrics data interruption and job fail.

Current problems:jira CELEBORN-1743 the lock in [CELEBORN-1453] might block the thread.

No

Manual test
same result with CELEBORN-1453
image

Closes #2956 from zaynt4606/clb1743.

Authored-by: zhengtao [email protected]

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

…lure caused by locked resources

Remove the  ConcurrentLinkedQueue and lock in AbstractSource which might cause the metrics data interruption and job fail.

Current problems:[jira CELEBORN-1743](https://issues.apache.org/jira/browse/CELEBORN-1743)
the lock in [[CELEBORN-1453]](apache#2548) might block the thread.

No

Manual test
same result with CELEBORN-1453
![image](https://github.com/user-attachments/assets/3e3a4c53-1cf6-48f6-8c37-67d875d675af)

Closes apache#2956 from zaynt4606/clb1743.

Authored-by: zhengtao <[email protected]>
Signed-off-by: Shuang <[email protected]>
@zaynt4606
Copy link
Contributor Author

cherry-pick to branch-0.5

Copy link

codecov bot commented Dec 18, 2024

Codecov Report

Attention: Patch coverage is 0% with 51 lines in your changes missing coverage. Please review.

Project coverage is 41.16%. Comparing base (5c6a2c8) to head (a45c089).
Report is 26 commits behind head on branch-0.5.

Files with missing lines Patch % Lines
...eleborn/common/metrics/source/AbstractSource.scala 0.00% 51 Missing ⚠️
Additional details and impacted files
@@              Coverage Diff               @@
##           branch-0.5    #3005      +/-   ##
==============================================
- Coverage       41.19%   41.16%   -0.02%     
==============================================
  Files             226      226              
  Lines           14488    14503      +15     
  Branches         1308     1312       +4     
==============================================
+ Hits             5967     5969       +2     
- Misses           8179     8193      +14     
+ Partials          342      341       -1     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link
Contributor

@FMX FMX left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Merged into branch-0.5(v0.5.3).

FMX pushed a commit that referenced this pull request Dec 18, 2024
…lure caused by locked resources

…lure caused by locked resources

Remove the  ConcurrentLinkedQueue and lock in AbstractSource which might cause the metrics data interruption and job fail.

Current problems:[jira CELEBORN-1743](https://issues.apache.org/jira/browse/CELEBORN-1743) the lock in [[CELEBORN-1453]](#2548) might block the thread.

No

Manual test
same result with CELEBORN-1453
![image](https://github.com/user-attachments/assets/3e3a4c53-1cf6-48f6-8c37-67d875d675af)

Closes #2956 from zaynt4606/clb1743.

Authored-by: zhengtao <shuaizhentao.sztalibaba-inc.com>

### What changes were proposed in this pull request?

### Why are the changes needed?

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

Closes #3005 from zaynt4606/branch-0.5-dev.

Authored-by: zhengtao <[email protected]>
Signed-off-by: mingji <[email protected]>
@FMX FMX closed this Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants