Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix for jobs stuck in configuring state in case of the frontend execution mode #186

Closed
wants to merge 1 commit into from
Closed

Fix for jobs stuck in configuring state in case of the frontend execution mode #186

wants to merge 1 commit into from

Conversation

nsimakov
Copy link
Contributor

Hello,

I had a problem running slurm in frontend mode (configured with --enable-front-end) for modeling purposes. Namely, all submitted jobs stay in configuring state:

             JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON) 
                20     debug sleep.jo nikolays CF       1:18      1 n1

It turns out that test_job_nodes_ready function from job_mgr.c checking the readiness of the first node for batch job execution. But in frontend mode, it is the task of the frontend node. So I added checking of frontend node instead. That fixed the problem.

Regards,
Nikolay

Used configuration:
slurm.conf

instead of the first node in test_job_nodes_ready function.
Otherwise submitted jobs stack in configuring state.
@wickberg wickberg closed this Jun 16, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants