-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Edge::getWebViewWrapper deadlock threat #1466 #1966
Fix Edge::getWebViewWrapper deadlock threat #1466 #1966
Conversation
Ran the failing tests locally - it runs fine with the proposed changes. |
0657aba
to
5c1e180
Compare
5c1e180
to
3302644
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you please sketch a flow that led to a deadlock before and is prevented by this change? I have to admit that though we quickly talked about it, I cannot recap the scenario.
There are also some mistaked in the commit message (in particular waitForLastPEndingLast
is no used qualifier`). Could you please fix them?
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.581 D2279630 #2149048 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" [TestRunner @ 2025-03-14T22:36:05.5804878] almost reached timeout of 40M
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.588 D2279637 #000007 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" vi.test.runtime.TimeoutTimer$ThreadDump: [TestRunner] thread dump for thread: main
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.588 D2279637 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/jdk.internal.misc.Unsafe.park(Native Method)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.589 D2279638 #000001 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/java.util.concurrent.locks.LockSupport.park(LockSupport.java:221)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.589 D2279638 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/java.util.concurrent.CompletableFuture$Signaller.block(CompletableFuture.java:1864)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.589 D2279638 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/java.util.concurrent.ForkJoinPool.unmanagedBlock(ForkJoinPool.java:3780)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.589 D2279638 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/java.util.concurrent.ForkJoinPool.managedBlock(ForkJoinPool.java:3725)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.589 D2279638 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/java.util.concurrent.CompletableFuture.waitingGet(CompletableFuture.java:1898)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.590 D2279639 #000001 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at java.base@21.0.3/java.util.concurrent.CompletableFuture.join(CompletableFuture.java:2117)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.590 D2279639 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.swt.browser.Edge$WebViewProvider.getWebViewWrapper(Edge.java:415)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.590 D2279639 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.swt.browser.Edge$WebViewProvider.getWebView(Edge.java:427)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.590 D2279639 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.swt.browser.Edge.getUrl(Edge.java:954)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.590 D2279639 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.swt.browser.Browser.getUrl(Browser.java:771)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.591 D2279640 #000001 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.ui.internal.intro.impl.presentations.BrowserIntroPartImplementation.saveCurrentPage(BrowserIntroPartImplementation.java:645)
[2025-03-14T21:36:08.996Z] [java] [2025/03/14 22:36:05.591 D2279640 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.ui.internal.intro.impl.model.AbstractIntroPartImplementation.saveState(AbstractIntroPartImplementation.java:302)
[2025-03-14T21:36:08.997Z] [java] [2025/03/14 22:36:05.591 D2279640 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.ui.internal.intro.impl.model.IntroPartPresentation.saveState(IntroPartPresentation.java:427)
[2025-03-14T21:36:08.997Z] [java] [2025/03/14 22:36:05.591 D2279640 #000000 Heap:10240 2856 2005 Meta:733] "TestRunner timer,5,main" at org.eclipse.ui.intro.config.CustomizableIntroPart.saveState(CustomizableIntroPart.java:405) This was the stacktrace of the issue and as you can see, it gets stuck at private WebViewWrapper getWebViewWrapper(boolean waitForPendingWebviewTasksToFinish) {
if(waitForPendingWebviewTasksToFinish) {
processOSMessagesUntil(lastWebViewTask::isDone, exception -> {
lastWebViewTask.completeExceptionally(exception);
throw exception;
}, browser.getDisplay());
}
return webViewWrapperFuture.join();
} at the line return webViewWrapperFuture.join(); while we know the future chain is something like processOSMessagesUntil(lastWebViewTask::isDone, exception -> {
lastWebViewTask.completeExceptionally(exception);
throw exception;
}, browser.getDisplay()); We see it can be completed exceptionally on timeout irrespectve of if webViewWrapperFuture has completed. In this case, we are prone to a deadlock or atleast with my investigation that was the only reasoning I could do for this stacktrace. Also if waitForPendingWebviewTasksToFinish is set to false then it directly tries to access the webViewWrapperFuture which can lead to a deadlock as well. Conclusion: We should definitely wait for webViewWrapperFuture to complete before performing anything. |
This commit makes sure while calling Edge::getWebViewWrapper with waitForPendingWebviewTasksToFinish set to true that the method also checks for the completion of the webViewWrapperFuture before calling a join() on it, which can some time lead to deadlock as we allow forced execptional completion of lastWebViewTask on timeout. contributes to eclipse-platform#1466
3302644
to
5eb3ba2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, I see no risk with merging this PR.
Follow-up work
We should find a way to make it known that calling webViewWrapperFuture.join()
is a bad idea and that one should use getWebViewWrapper()
instead. But that's for a follow-up PR since it's an internal class and not part of the API (only us, SWT developers, are affected by this).
@HeikoKlare has this been addressed by #1966 (comment) ? |
I just say your answer from yesterday in Slack. Merging. |
This PR makes sure while calling
Edge::getWebViewWrapper
withwaitForPendingWebviewTasksToFinish
set to true that the method also checks for the completion of thewebViewWrapperFuture
before calling ajoin()
on it, which can some time lead to deadlock as we allow forced exceptional completion oflastWebViewTask
on timeout.contributes to #1466