Automatic/standardized scale function computation #464

SamTov · 2022-01-14T15:16:29Z

What feature would you like to see added?
Partially in conjunction with #383 but a different consequence, we need a standardized method of assessing the accuracy of the scaling functions.

My suggestion would be to set up a remote worker with ~4GB of memory, i.e. memory of a standard GPU, and check the scaling functions allow for correct operations on data that is both configuration-wise and atom-wise batched.

PythonFZ · 2022-01-14T16:52:30Z

I think in this context we should add some manual overwritting for the batching as well.
For Example we have one person that wants to run the some MDSuite Job in the background, does not care about the speed and wants to do something else in the meantime.
On the other side there is someone who wants the results asap and MDSuites batching is to agressiv and forcefully slows down the computation.

I think this could be handled by an argument in mdsuite.config which by default is config.batch_scaling = 1.

This is more in the context of testing the scaling functions than the actual scaling though.

I would go for 8 GB though.

SamTov · 2022-01-14T18:53:55Z

The initial idea was that you set the memory consumption which is hardcoded to be super conservative rn because the scale functions are not accurate enough. I think having a dev option for manual batching is a good idea but I would leave it to setting memory consumption for users e.g., use 50% of memory vs 98% if you want that super performance.

The reason I think 4GB is because it makes the scaling very obvious. The smaller the better in fact, if it is safe enough to run on 1GB then it is safe to run on 100. By setting it to 4GB we pretty much assure that runs on all standard GPUs and that the scaling is good, hence I see no reason to move to 8.

Having it set through config would be great. I was having problems today with getting TF to stop using the GPU so we should probably make the config file something that is easier to work with on a user end where we set stuff like turn of GPU, use this much memory, set logging and so on.

SamTov added enhancement New feature or request discussion A general discussion point p2-medium Should be relatively soon. labels Jan 14, 2022

This was referenced Jan 19, 2022

Adapt scaling factors to get test runs to work. #471

Closed

[WIP] SamTov measure memory scaling #476

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic/standardized scale function computation #464

Automatic/standardized scale function computation #464

SamTov commented Jan 14, 2022

PythonFZ commented Jan 14, 2022 •

edited

Loading

SamTov commented Jan 14, 2022

Automatic/standardized scale function computation #464

Automatic/standardized scale function computation #464

Comments

SamTov commented Jan 14, 2022

PythonFZ commented Jan 14, 2022 • edited Loading

SamTov commented Jan 14, 2022

PythonFZ commented Jan 14, 2022 •

edited

Loading