Replies: 1 comment
-
We'd have to try, I don't know if all MPI ranks get the signal for example. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I would like SU2 to save the current solution when SIGTERM is sent to the job.
So here is my current understanding of how this is done by others:
Some (CFD) software packages will save the solution at the current iteration when you kill the job. I think what all these packages do is that they specifically capture SIGTERM. As far as I know, packages like slurm and LSF send 3 signals, SIGCONT or SIGINT, SIGTERM and SIGKILL, in this order. By capturing SIGTERM, you do not save files if you just hit on the command line when you start SU2 there (sending SIGINT) and with cluster jobs you can still force the job to not write solutions files by sending SIGKILL immediately.
So I think it's a matter of catching the signal and then exit the main loop as if it finished all the iterations. Maybe somebody with more experience in this matter can comment on this approach...
Beta Was this translation helpful? Give feedback.
All reactions