Skip to content

[Slurm] Make slurmctld and slurmdbd services wait for remote file systems to be mounted before being started. #2710

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

gmarciani
Copy link
Contributor

Description of changes

Make slurmctld and slurmdbd services wait for remote file systems to be mounted before being started.

Tests

  • Before this change, slurmctld fails to restart on head node reboot.
  • After this change, slurmctld succeeds the restart on head node reboot.
  • The same change was already applied to slurmd
  • We can safely assume that the same applies to slurmdbd

References

  • Link to impacted open issues.
  • Link to related PRs in other packages (i.e. cookbook, node).
  • Link to documentation useful to understand the changes.

Checklist

  • Make sure you are pointing to the right branch.
  • If you're creating a patch for a branch other than develop add the branch name as prefix in the PR title (e.g. [release-3.6]).
  • Check all commits' messages are clear, describing what and why vs how.
  • Make sure to have added unit tests or integration tests to cover the new/modified code.
  • Check if documentation is impacted by this change.

Please review the guidelines for contributing and Pull Request Instructions.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

…tems to be mounted before being started.

Signed-off-by: Giacomo Marciani <[email protected]>
@gmarciani gmarciani force-pushed the wip/mgiacomo/310/fix-slurm-service-start branch from f61b30c to 3e0b0a7 Compare April 16, 2024 17:10
@gmarciani gmarciani enabled auto-merge (rebase) April 17, 2024 08:15
@gmarciani gmarciani merged commit abc0039 into aws:develop Apr 17, 2024
29 checks passed
@gmarciani gmarciani deleted the wip/mgiacomo/310/fix-slurm-service-start branch April 17, 2024 08:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants