-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
install/configure SLURM+MPI for deploying on Azure #60
Comments
how often would a user want to use SLURM on a cloud-based cluster. They won't be able to use their Savio SLURM script anyway as the QoS and That said, it would be nice if an MPI submission would just work on a We may want to discuss how to prioritize use of AWS, Google, and Azure for On Tue, Jul 28, 2015 at 6:14 PM, Aaron Culich [email protected]
|
Since Azure already provides a pre-configured Ubuntu instance running SLURM, my request to them is to simply add MPI to their image. This addresses the use case of migrating from Savio (or similarly configure HPC cluster) directly to a cloud environment without changing the code or submit scripts-- since changing that or retraining people can take some time. In the long run there is probably a better strategy than using SLURM, but I'm focused right now on having a very easy migration path for people. An example is our reading group topic this Thursday using Savio, Azure, and AWS. As far as prioritizing which platforms, I am agnostic to that... really just responding to the demand for each as I hear it. |
After a brief superficial look, this seems awesome - seems like it would really make it easier for people to migrate Azure <-> Savio. I agree that reducing code changes in a research environment is really the key imo. EC2 <-> Savio would then be ideal. |
For some parallel and AWS functionality, I wrote some add-on scripts, so an On Tue, Jul 28, 2015 at 7:32 PM, Chris Kennedy [email protected]
|
Hadn't noticed until now that you had updated the documentation on how to use BCE to include the scripts for the parallel computation tools. I'll give that a shot. Azure now supports RDMA, though currently only with their SUSE Linux image, but will add support for others in the future:
|
Enabling this in a Savio-compatible way would allow mobility of code from a local HPC cluster to Azure cloud-based HPC cluster-- without an end-user having to change their code or submit scripts.
See azure-quickstart-templates#422 for further discussion.
The text was updated successfully, but these errors were encountered: