-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
stress_test: Control the level of workload of each components #1
Comments
Hello @da-ekchajzer, we used Stress-ng on our side and we found it pretty flexible to simulate different kinds of workloads and most importantly target a specific CPU load which isn't always easy with other stress tools. To reflect an average usage we tested several options. To be able to run on many platforms it should be coupled with power measurement (software based of physical wattmeters). By default what was missing on our side for CPU was the support of AMD machines but that's easily doable (without memory consumption however for AMD). The same goes for GPUs if looking at ML-oriented hardware (Nvidia and AMD provide tooling for this). |
Thank you for the resource. About power measurement, I'll let you give your opinion or advice here : #2 |
To-do
|
See https://github.com/teads/turbostress for an implementation of stress-ng in an analog context. |
I think we should provide a complete example based on stress-ng, but the list of stress tests could be left to the user. We should just load a list of commands from a file. Something like could work fine :
|
Hey there ! Just had a great discussion with Arne from Green Coding. They have very valuable insights, and also a tool for this, includng very important variables like hyper threading, turboboost, etc.. We should synchronize before moving on a direction by ourselves ! We should have a look to https://github.com/green-coding-berlin/tools and synchronize with them |
I guess we should have an interface to easily implement different types of tests. |
Now that we have decided to use stress-ng. I think we should continue the discussion here #19 |
Problem
We need to automate the collection of point-in-time power consumption measurements per component at different workload levels.
Example for CPU intel xeon platinium
We need to control the level of workload of each components involve in the evaluation.
Solution
I propose to use a stress module to control the workload level. We can start with a step of 10 : 0%, 10%, 20%, ..., 100%.
@github-benjamin-davy, you have already developed this kind of module. Would you have some advice / resources that could be useful ?
Article : https://medium.com/teads-engineering/estimating-aws-ec2-instances-power-consumption-c9745e347959
The text was updated successfully, but these errors were encountered: