Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resuming the job on slurm after it gets cancelled. #297

Open
uahmed93 opened this issue Oct 11, 2024 · 0 comments
Open

Resuming the job on slurm after it gets cancelled. #297

uahmed93 opened this issue Oct 11, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@uahmed93
Copy link

Initial discussion happened with @VibhuJawa

Is your feature request related to a problem? Please describe.
While running a workflow on slurm with large files, if it needs to be cancelled or stopped due to 4 hr slurm job restriction, we will have partial results. Can we have robust resume feature which can handle even if files are partially processed.

Describe the solution you'd like
Merging output file parts (0.part, 1.part, ...) into 1 file and compare with input file for checking whether complete processing has happened or not, if not then while job resumes the remaining file part will also be counted in set which will go for processing.

@uahmed93 uahmed93 added the enhancement New feature or request label Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant