Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle exec error #722

Merged
merged 2 commits into from
Mar 13, 2025
Merged

Handle exec error #722

merged 2 commits into from
Mar 13, 2025

Conversation

headius
Copy link
Contributor

@headius headius commented Mar 13, 2025

Apparently this exec errors when the subprocess fails rather than returning an error code. This causes the whole setup process to terminate. This patch catches the error and uses that to indicate failure to launch.

Hopefully will address the issue in ruby/jruby-dev-builder#10 (comment)

@eregon
Copy link
Member

eregon commented Mar 13, 2025

The docs are at https://github.com/actions/toolkit/tree/main/packages/exec
Could you try https://github.com/actions/toolkit/blob/d9347d4ab99fd507c0b9104b2cf79fb44fcc827d/packages/exec/src/interfaces.ts#L27-L28 ?

The try/catch works, but might hide other issues which we shouldn't necessarily ignore (e.g. passing bad arguments to that function).

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

@eregon I'll try. I don't know the syntax for that but I'll figure it out.

Apparently this exec errors when the subprocess fails rather than
returning an error code. This causes the whole setup process to
terminate. This patch catches the error and uses that to indicate
failure to launch.
@headius headius force-pushed the handle_jruby_exec_error branch from 3224696 to 82eb255 Compare March 13, 2025 18:02
@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

@eregon In theory the latest attempt should do it. TruffleRuby failure seems very much unrelated.

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

Sorry about the churn here but without a better way to test this we're forced to iterate in the wild. Does GHA have any test instance or harness that can be used off the grid?

@eregon
Copy link
Member

eregon commented Mar 13, 2025

There is https://github.com/nektos/act but I haven't tried it.
One idea is we should have tried passing an invalid flag instead of ruby --version in the original PR to simulate the JRuby 10 case of process exiting with non-zero.

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

@eregon I pushed a throw-away commit to test with a bad flag.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

It seems to hang on Windows, see https://github.com/ruby/setup-ruby/actions/runs/13841525436/job/38730124830?pr=722 and https://github.com/ruby/setup-ruby/actions/runs/13841525436/job/38730135071?pr=722
It uses D:\jruby-head\bin\ruby.bat there, the .bat (EDIT:) I think that shouldn't be used, there is an executable from jruby-launcher IIRC.

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

Is there something else the JRuby install on Windows does to make it runnable? I literally used the same code you had for the final version print-out and it did not hang there.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

Yeah, so it seems a JRuby 10 + Windows bug.
JRuby 9 worked fine, and also used the .bat: https://github.com/ruby/setup-ruby/actions/runs/13820766951/job/38665461620#step:3:38

@eregon
Copy link
Member

eregon commented Mar 13, 2025

I think at this point we won't find a quick fix, it sounds like a difference between JRuby 9 and 10.
To avoid breaking existing users of jruby-head, I suggest to revert ruby/jruby-dev-builder#10 and trigger a build there. I let you do it, you should already have the permissions for that.

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

@eregon The bat file is not the preferred way to launch on Windows and may introduce other problems. Why is it running with the bat file?

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

We are looking into the --version hang on Windows.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

I don't know, that may be an issue of exec.exec, or the builds of JRuby on Windows.
Example build of jruby-head and relevant line about .bat: https://github.com/ruby/jruby-dev-builder/actions/runs/13839648355/job/38723593911#step:10:6

Example build of a jruby release build: https://github.com/ruby/ruby-builder/actions/runs/13047231536/job/36399785028#step:9:7

I misremembered about jruby-launcher, that's only done for releases and on linux/macOS, e.g.
https://github.com/ruby/ruby-builder/actions/runs/13047231536/job/36399786116#step:19:24

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

JRuby's dist (tar.gz and zip)includes a jruby.exe in bin out of the box.

We cannot reproduce the hang on a local Windows system.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

See jruby/jruby#6042 on the reasoning for adding jruby.bat

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

I will try something here as a test. No way to verify these things locally and it does not hang with any combination of 10 build and ruby.bat for us.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

Feel free to try changing https://github.com/ruby/jruby-dev-builder/blob/a4cdaa60a48c4bfb237f6071f950752ec59ec6e0/.github/workflows/build.yml#L129-L130 since anyway jruby-head on Windows is broken.
Maybe the .bat is no longer needed for some reason?
Possibly related to the 64-bit vs 32-bit thing mentioned in jruby/jruby#6042

I'll log off now, it's quite late here.
If you want to unbreak jruby-head for existing setup-ruby users, the easiest way is #722 (comment)

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

I will be forced to undo the 9.4 to 10 change if you will be unable to merge anything further here, but I will also be unable to fix anything here.

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

Switching to the exe did not help either so it's not that. We are trying to find a way to reproduce the issue on Windows. I will keep trying things here but will revert the jruby-head change before giving up for the day.

@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

I have fixed the Windows hangs by bypassing the launcher altogether. We have no explanation for it hanging, as both the tarball and a local HEAD build on Windows 10 both work fine, with either a good or bad JDK.

Bypassing the launcher for this JRuby-specific logic is also probably better; it's fewer processes to launch and avoids variability in the launcher behavior.

Native launcher appears to hang on GHA when the JDK version is too
old for JRuby 10. This bypasses it and should be a bit lighter. It
will also avoid any variability in the launcher used, since Unixes
and Windows already use different executables.
@headius headius force-pushed the handle_jruby_exec_error branch from a2bc55b to 31ccf06 Compare March 13, 2025 19:24
@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

All tests are green with none hanging at 31ccf06.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

OK, quite weird, but this solution seems alright, merging.

@eregon eregon merged commit 6c79f72 into ruby:master Mar 13, 2025
227 checks passed
@headius headius deleted the handle_jruby_exec_error branch March 13, 2025 19:35
@headius
Copy link
Contributor Author

headius commented Mar 13, 2025

Is the fix released then? I just started a new jruby-dev-builder release but cancelled it when I saw you merged.

@eregon
Copy link
Member

eregon commented Mar 13, 2025

Yes, it's released, the passing CI green here means everything is fine.

await measure("Modifying JAVA_HOME for JRuby", async () => {
console.log("attempting to run with existing JAVA_HOME")

let ret = await exec.exec('ruby', ['--version'])
let ret = await exec.exec('java', ['-jar', path.join(rubyPrefix, 'lib/jruby.jar'), '--version'], {ignoreReturnCode: true})
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be $JAVA_HOME/bin/java (or process.env, you get the idea) as java in PATH might resolve to something different and in fact it does I saw /usr/bin/java in logs. Could you make a PR to address that?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants