Automate benchmarking and add to CI #34

hanno-becker · 2024-05-10T04:51:20Z

Depends on: #28

Acceptance criteria:

Automated benchmarking of MLKEM-C-AArch64 implementations on the listed platforms (see README) is available and can be run by maintainers and by CI.
Automated benchmarking is added to CI.

Steps:

Plan access to various benchmarking platforms. For some, the maintainers own boards with the right CPU, but we need to figure how we want to access them remotely from CI. For others, such as Graviton instances, we likely need to setup EC2 accounts.
Provide scripts for running benchmarks and preparing results
Add benchmarking to CI

planetf1 · 2024-05-14T09:50:29Z

A few related items

Github does have ARM based runners. Currently in private beta, expected to go public in July. (As an enterprise (PQCA) we can have access). There are currently costs involved, and a request is open with PQCA TAC.
still not appropriate for benchmarking, and I'm not sure of what the actual processor support is
if funding is needed for other CI support, we can go through the TAC, but would need to get some estimates.
not sure how applicable above is, given this is a very hw specific area

hanno-becker · 2024-05-20T07:38:34Z

@planetf1 Leaving aside the benchmarking platforms for which the maintainers have development boards, we would need budget for AWS EC2 instances (Graviton2, Graviton3, Apple M1) for benchmarking and CI. Is there precedent for how to get funding for this in PQCA/LF?

planetf1 · 2024-05-20T08:01:45Z

The process in general would be to raise the issue at the TSC (this becomes more relevant when there are more projects so we can consolidate & ensure there's awareness, but given the active projects are working together less of an issue).

Then we need to raise it with the PQCA TAC for general review, and they then either approve (if within existing budget) or raise a request with the governing board (if not).

Sounds like administrivia, but apart from the last step I think we can do quickly. Also it's my broad understanding, we're just getting started. I do join all the PQCA meetings

Do you have an estimate on resource usage for the AWS EC2 instances ?
Any thoughts on how this may grow over time (so we plan for say a year) ?
Are there alternatives to EC2 (we may be asked)?
When do you need it (yesterday?)
What's the impact if you don't have it?

There's a PQCA TAC on Wed, so if we can get info together for that I can add an agenda item to the discussion

hanno-becker · 2024-05-20T08:45:47Z

Do you have an estimate on resource usage for the AWS EC2 instances ?

Some rough thoughts:

We'd want to benchmark branches and PR revisions. Every individual benchmark should be fast, I'd imagine a few minutes. Therefore, a single EC2 instance per type should be enough to cover CI demands initially, even with a fair amount of PR activity (which we haven't yet reached).

Following https://aws.amazon.com/ec2/pricing/on-demand/, a Graviton3 instance (c7g/xlarge) is currently $0.1445/hr, Graviton2 (c6g/xlarge) is currently $0.136/hr. If the instances ran 24/7, this would amount to $209/month. The true cost should be much lower, however, since we are unlikely to ever reach a level of activity soon that would keep benchmarking CI busy permanently -- still, the above gives an upper bound for Graviton benchmarking.

There is some flexibility in the choice of instance size (medium, large, xlarge, {2,4,8,16}xlarge) -- the above is for xlarge instances with 4 vCPUs, allowing for fast build times. Instances with 1 vCPU would be cheaper, and while the build would be slower, they would likely be equally suitable for benchmarking since all our code is single-threaded.

This does not yet take into account potential M1 instances.

Any thoughts on how this may grow over time (so we plan for say a year) ?

The demand would grow with the frequency with which we make updates to PRs, so hopefully it would grow over time. However, a single instance per type should remain sufficient to cover our needs for the foreseeable future.

Are there alternatives to EC2 (we may be asked)?

We have other benchmarking targets independent of EC2, but Graviton2/3 are part of EC2.

When do you need it (yesterday?)

When we start optimizing MLKEM-C-AArch64 for performance -- in the coming weeks I'd suppose (the first PR to this effect is #38).

What's the impact if you don't have it?

We have to conduct ad-hoc measurements for our PRs. I would imagine this effectively leads to incomplete benchmarking information per PR, depending on what board/EC2 access the respective maintainer has.

hanno-becker · 2024-05-21T04:01:53Z

@planetf1 I am not in a position to make promises, but one could also apply for cloud credits from AWS (https://aws.amazon.com/government-education/research-and-technical-computing/cloud-credit-for-research/).

planetf1 · 2024-05-21T11:57:39Z

@planetf1 I am not in a position to make promises, but one could also apply for cloud credits from AWS (aws.amazon.com/government-education/research-and-technical-computing/cloud-credit-for-research).

Thanks @hanno-becker From a quick scan of that page it seems targeted at individual researchers at academic institutions. Whilst this forms part of the community working on projects in pqca, we also have a foundation (with funding from commercial organizations), as well as contributors working for commercial orgs.

planetf1 · 2024-05-21T12:06:21Z

Thanks for all the info on EC2 - do you think GitHub arm runners would be an alternative? As good? nearly as good? Not very good? ....

Just asking as I know that topic is already floated, and the Linux Foundation has been working with Github on enterprise access, plus already funds usage on other projects.

Many questions on this - as it's not quite publicly available yet, and I've not seen machine specs - but can find out.

I do think proper integration into CI with the resource behind it is important for an implementation that's focussed on performance - manual just adds scope for errors/inconsistency and much harder to spot regressions which a run after each merge allows even automated checking of performance regression (perhaps, with some bounds, given virtual platforms)

ryjones · 2024-05-22T16:07:38Z

If Graviton is a requirement, it looks like AWS is the only provider. BuildJet arm builders are something I can do quickly. AWS will take more work. GitHub offers mac arm runners as well.

hanno-becker · 2024-05-23T08:57:44Z

@ryjones Do we know what hardware underlies the native BuildJet arm builders?

We need to know exactly which hardware we are running on for the benchmarking (this is in contrast to the functional tests, which need an Arm as well, but any platform will do, including an emulated one).

ryjones · 2024-05-23T10:29:14Z

@hanno-becker it looks like AWS is the only source for graviton. So let's do it bit by bit, get what we can and add AWS later (by later, I mean weeks or a month, not months).

The thing with AWS is I need to get those resources under the MSA that LF has with Amazon, which takes some paper shuffling.

hanno-becker · 2024-05-23T10:42:21Z

@ryjones Sounds good to me.

planetf1 · 2024-05-23T11:13:32Z

Thanks @ryjones for looking into this

hanno-becker · 2024-05-23T19:25:08Z

@ryjones

BuildJet arm builders are something I can do quickly.

I think for testing and dynamic analysis that might still be useful. Can you help us / give us some pointers on how to use BuildJet arm runners?

ryjones · 2024-05-23T20:34:04Z

@hanno-becker Once approved, I will connect them to the org and you can use them like any other runner. Here are the docs.

ryjones · 2024-06-04T15:16:13Z

@planetf1 @hanno-becker the runner name is pqcp-arm64. you could convert this job into a matrix and add that as a target

see this as an example

planetf1 · 2024-06-04T15:54:01Z

@ryjones So pqcp-arm64 is a BuildJet runner? Is there a link/summary of what it is (cpu, processors, ram etc)?

ryjones · 2024-06-04T16:41:51Z

@planetf1 no, it is a github runner.

Runner group: [pqcp-large-runners](https://github.com/enterprises/post-quantum-cryptography-alliance/settings/actions/runner-groups/3)
Platform: Linux ARM64 Beta
Size:
4-cores · 16 GB RAM · 150 GB SSD
Public IP: Disabled
Network Configuration: Disabled

I can make it larger or smaller as you need.

hanno-becker · 2024-06-06T04:24:47Z

@ryjones This is great, thank you very much. Leveraged in #49.

hanno-becker · 2024-09-05T05:25:49Z

EC2 benchmarking added in #99

hanno-becker added the enhancement New feature or request label May 10, 2024

mkannwischer mentioned this issue May 15, 2024

Access to ARM github runners pq-code-package/tsc#55

Closed

cothan mentioned this issue May 16, 2024

Vectorize Rejection Sampling #38

Closed

1 task

planetf1 mentioned this issue May 22, 2024

Spending money on GitHub actions: what would be the scale? PQCA/TAC#20

Closed

hanno-becker closed this as completed Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automate benchmarking and add to CI #34

Automate benchmarking and add to CI #34

hanno-becker commented May 10, 2024 •

edited

Loading

planetf1 commented May 14, 2024

hanno-becker commented May 20, 2024

planetf1 commented May 20, 2024

hanno-becker commented May 20, 2024

hanno-becker commented May 21, 2024

planetf1 commented May 21, 2024

planetf1 commented May 21, 2024

ryjones commented May 22, 2024

hanno-becker commented May 23, 2024 •

edited

Loading

ryjones commented May 23, 2024

hanno-becker commented May 23, 2024

planetf1 commented May 23, 2024

hanno-becker commented May 23, 2024

ryjones commented May 23, 2024

ryjones commented Jun 4, 2024 •

edited

Loading

planetf1 commented Jun 4, 2024

ryjones commented Jun 4, 2024

hanno-becker commented Jun 6, 2024

hanno-becker commented Sep 5, 2024

Automate benchmarking and add to CI #34

Automate benchmarking and add to CI #34

Comments

hanno-becker commented May 10, 2024 • edited Loading

planetf1 commented May 14, 2024

hanno-becker commented May 20, 2024

planetf1 commented May 20, 2024

hanno-becker commented May 20, 2024

hanno-becker commented May 21, 2024

planetf1 commented May 21, 2024

planetf1 commented May 21, 2024

ryjones commented May 22, 2024

hanno-becker commented May 23, 2024 • edited Loading

ryjones commented May 23, 2024

hanno-becker commented May 23, 2024

planetf1 commented May 23, 2024

hanno-becker commented May 23, 2024

ryjones commented May 23, 2024

ryjones commented Jun 4, 2024 • edited Loading

planetf1 commented Jun 4, 2024

ryjones commented Jun 4, 2024

hanno-becker commented Jun 6, 2024

hanno-becker commented Sep 5, 2024

hanno-becker commented May 10, 2024 •

edited

Loading

hanno-becker commented May 23, 2024 •

edited

Loading

ryjones commented Jun 4, 2024 •

edited

Loading