[FEAT] GHA workflow to perform tcph benchmarking #3184

raunakab · 2024-11-05T21:33:53Z

Overview

Create new GHA workflow for building a commit and running tpch against it.

Notes

There are 2 main workflows:

build-commit.yaml
run-tpch.yaml

The final workflow, build-commit-run-tpch.yaml just runs the above two in a sequential order.

I've also made some changes to benchmarking/tpch/__main__.py. Namely:

Added all env-vars that start with DAFT to the ray-runtime-env variables that's sent during ray-cluster initialization.
Added flag to turn off sending daft module to ray-cluster during initialization.

No need to pickle the daft module and send it over; it's already installed on the ray-cluster from the AWS S3 link pointing to the prebuilt python-wheel.

I've summarized the workflows individually down below:

build-commit workflow

uses buildjet for building
caching enabled
builds a release python-wheel and stores it in AWS S3
without caching, builds take around 6-7min
with caching, builds take roughly 3min
with aggressive caching, builds take <40s

run-tpch workflow

pulls the AWS S3 python-wheel and runs it
runs the benchmarking.tpch benchmark
produces an output.csv and sends it back to GHA to be displayed
renders the output directly in the Summary Page
grabs Ray-Logs and uploads that to the GHA Summary Page as well

build-commit-run-tpch workflow

literally just invokes the first two in sequential order
maps the built python-wheel of the build-commit job (1st one) to the input of the run-tpch job (2nd one)

codspeed-hq · 2024-11-05T21:47:40Z

CodSpeed Performance Report

Merging #3184 will degrade performances by 34.59%

_{Comparing feat/infra (e908742) with main (ec39dc0)}

Summary

❌ 1 regressions
✅ 16 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`main`	`feat/infra`	Change
❌	`test_iter_rows_first_row[100 Small Files]`	264.6 ms	404.6 ms	-34.59%

codecov · 2024-11-05T21:56:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 76.37%. Comparing base (ec39dc0) to head (e908742).
Report is 39 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3184      +/-   ##
==========================================
- Coverage   76.54%   76.37%   -0.17%     
==========================================
  Files         685      685              
  Lines       85269    85135     -134     
==========================================
- Hits        65266    65020     -246     
- Misses      20003    20115     +112

see 35 files with indirect coverage changes

.github/workflows/benchmark-local.yaml

jaychia · 2024-11-07T23:12:49Z

.github/workflows/benchmark-local.yaml

+      with:
+        aws-region: us-west-2
+        role-to-assume: ${{ secrets.ACTIONS_AWS_ROLE_ARN }}
+        role-session-name: daft-performance-comparisons


Why is this necessary -- Is this for S3 upload permissions?

Yes, this was here for AWS S3 uploads. I believe writes to the bucket are private (whereas reads are public). May be wrong though.

.github/workflows/benchmark-local.yaml

.github/workflows/differ-local.yaml

.github/assets/ray.yaml

raunakab · 2024-11-21T06:56:28Z

Example:
https://github.com/Eventual-Inc/Daft/actions/runs/11926775586

Run by @desmondcheongzx. Run was submitted locally using the gh CLI tool. Invocation was:

gh workflow run build-commit-run-tpch.yaml --ref $BRANCH_NAME -f skip_questions=$SKIP_QUESTIONS

raunakab · 2024-11-21T06:57:58Z

Tagging @colin-ho. You recently touched the benchmarking/tpch/__main__.py file. Just wanted to run some of those changes by you first.

colin-ho

The benchmarking/tpch/__main__.py file looks good to me apart from the one nit comment about the py_modules arg. Also left some suggestions on the other files.

.github/assets/ray.yaml

.github/workflows/build-commit-run-tpch.yaml

benchmarking/tpch/__main__.py

.github/assets/ray.yaml

.github/workflows/run-tpch.yaml

- slightly different than existing one

.github/workflows/run-tpch.yaml

jaychia · 2024-11-22T00:17:39Z

.github/assets/benchmarking_ray_config.yaml

+- uv pip install pip ray[default] py-spy
+# GitHub Actions workflow will replace all parameters between `<<...>>` with the
+# actual values as determined dynamically during runtime of the actual workflow.
+- uv pip install https://github-actions-artifacts-bucket.s3.us-west-2.amazonaws.com/builds/<<SHA>>/<<WHEEL>>


nit: I think {{SHA}} and {{WHEEL}} is more common here for templating

Can update.

jaychia · 2024-11-22T00:18:38Z

.github/scripts/csv_to_md.py

+
+        for row in csv_reader:
+            row_str = make_md_row(row)
+            output.write(row_str)


Ok, but note that this was a nice-to-have that really should have been in a follow-on

jaychia · 2024-11-22T00:19:22Z

.github/workflows/build-commit-run-tpch.yaml

+        options:
+        - '2'
+        - '32'
+        - '32'


I think we should combine SF with partition size, since not all permutations have datasets?

I.e. SF10_part32, SF100_part512...

Or even simpler, just let people select the URL

Ya, I was thinking of this as well.

Options could be:

2-2

10-32

etc.

Will update.

jaychia · 2024-11-22T00:24:37Z

.github/assets/benchmarking_ray_config.yaml

+  aws s3 sync \
+  s3://eventual-dev-benchmarking-fixtures/uncompressed/tpch-dbgen/<<SCALE_FACTOR>>/<<PARTITION_SIZE>>/parquet/ \
+  /tmp/data/<<SCALE_FACTOR>>/<<PARTITION_SIZE>>/parquet/ \
+  --quiet


Is this intended behavior? Downloading 1TB of data is going to kill the machine. Are we benchmarking off s3 or local?

jaychia · 2024-11-22T00:28:49Z

.github/workflows/run-tpch.yaml

+        type: string
+        description: The wheel artifact to use
+        required: false
+        default: getdaft-0.3.0.dev0-cp38-abi3-manylinux_2_31_x86_64.whl


What's this version of Daft and why is this not required?

This is the name of the wheel which is inside of the directory in the github-actions-artifacts-bucket bucket. The name of the outputted wheel is determined via build job which then injects the name of the wheel to this step. If you want to run this step without having run the previous, you'll have to specify the name.

It's possible that we have an S3 List job to grab the name instead...

jaychia · 2024-11-22T00:34:01Z

.github/workflows/run-tpch.yaml

+        aws-region: us-west-2
+        role-session-name: run-tpch-workflow
+    - uses: ./.github/actions/install
+    - run: |


Split this into steps, this makes it difficult to debug if something goes wrong.

You probably also want the ray down step to "always run" regardless of success/failure of all the rest of the steps.

jaychia · 2024-11-22T00:36:27Z

.github/workflows/run-tpch.yaml

+            --ray_job_dashboard_url http://localhost:8265 \
+            --skip_warmup \
+            --pickle_daft_module='false'
+        fi


Why not just leverage ray submit here?

jaychia · 2024-11-22T00:40:50Z

.github/workflows/run-tpch.yaml

+        ray up .github/assets/benchmarking_ray_config.yaml -y
+        HEAD_NODE_IP=$(ray get-head-ip .github/assets/benchmarking_ray_config.yaml | tail -n 1)
+        ssh -o StrictHostKeyChecking=no -fN -L 8265:localhost:8265 -i ~/.ssh/ci-github-actions-ray-cluster-key.pem ubuntu@$HEAD_NODE_IP
+        export DAFT_ENABLE_RAY_TRACING=1


You might want to make this configurable and default to true

jaychia · 2024-11-22T00:41:22Z

.github/workflows/run-tpch.yaml

+          new_base=${base//:/_}
+          mv "$filepath" "$dir/$new_base"
+        done
+        ' _ {} +


What's this?

This converts all of the instances of : in path names to _. Having : in path names will make the GitHub Actions upload-artifacts step fail.

raunakab · 2024-12-02T03:28:07Z

Closing in favour of #3404.

github-actions bot added the enhancement New feature or request label Nov 5, 2024

raunakab marked this pull request as ready for review November 5, 2024 21:35

raunakab marked this pull request as draft November 5, 2024 21:35

raunakab requested review from desmondcheongzx, jaychia and kevinzwang November 5, 2024 21:35

raunakab marked this pull request as ready for review November 5, 2024 21:43

kevinzwang removed their request for review November 5, 2024 21:49

raunakab marked this pull request as draft November 5, 2024 22:07

raunakab marked this pull request as ready for review November 5, 2024 22:07

raunakab self-assigned this Nov 5, 2024

jaychia reviewed Nov 7, 2024

View reviewed changes

.github/workflows/benchmark-local.yaml Outdated Show resolved Hide resolved

jaychia requested changes Nov 7, 2024

View reviewed changes

raunakab force-pushed the feat/infra branch 5 times, most recently from 0c8a35f to a71c566 Compare November 19, 2024 03:40

raunakab removed the request for review from desmondcheongzx November 19, 2024 18:27

raunakab marked this pull request as draft November 19, 2024 20:08

raunakab commented Nov 19, 2024

View reviewed changes

.github/assets/ray.yaml Outdated Show resolved Hide resolved

raunakab changed the title ~~[FEAT] New GH workflow for local diffing~~ [FEAT] GHA workflow to perform tcph benchmarking Nov 19, 2024

This was referenced Nov 20, 2024

[FEAT] Add new workflow to build commits and output to s3 #3321

Closed

[FEAT] add workflow to run tpch benchmarking #3328

Closed

raunakab force-pushed the feat/infra branch from 32caf7d to 0343b12 Compare November 20, 2024 20:53

raunakab requested a review from jaychia November 21, 2024 05:30

raunakab marked this pull request as ready for review November 21, 2024 05:34

raunakab requested a review from samster25 November 21, 2024 06:54

raunakab requested a review from colin-ho November 21, 2024 06:57

raunakab removed the request for review from samster25 November 21, 2024 07:03

colin-ho reviewed Nov 21, 2024

View reviewed changes

.github/assets/ray.yaml Outdated Show resolved Hide resolved

.github/workflows/build-commit-run-tpch.yaml Outdated Show resolved Hide resolved

benchmarking/tpch/__main__.py Outdated Show resolved Hide resolved

colin-ho reviewed Nov 21, 2024

View reviewed changes

.github/assets/ray.yaml Outdated Show resolved Hide resolved

raunakab requested a review from colin-ho November 22, 2024 00:04

raunakab force-pushed the feat/infra branch from 5455062 to 6a0380e Compare November 22, 2024 00:11

graphite-app bot reviewed Nov 22, 2024

View reviewed changes

.github/workflows/run-tpch.yaml Outdated Show resolved Hide resolved

.github/workflows/run-tpch.yaml Show resolved Hide resolved

Add workflows to build a commit and run tpch benchmarks

3bc0a42

- slightly different than existing one

raunakab force-pushed the feat/infra branch from 8231b9f to 3bc0a42 Compare November 22, 2024 00:22

Remove aggressive caching

0bf92dc

graphite-app bot reviewed Nov 22, 2024

View reviewed changes

.github/workflows/run-tpch.yaml Show resolved Hide resolved

Remove duplicate value

e908742

jaychia reviewed Nov 22, 2024

View reviewed changes

raunakab closed this Dec 2, 2024

raunakab deleted the feat/infra branch December 2, 2024 03:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT] GHA workflow to perform tcph benchmarking #3184

[FEAT] GHA workflow to perform tcph benchmarking #3184

raunakab commented Nov 5, 2024 •

edited

Loading

codspeed-hq bot commented Nov 5, 2024 •

edited

Loading

codecov bot commented Nov 5, 2024 •

edited

Loading

jaychia Nov 7, 2024

raunakab Nov 20, 2024 •

edited

Loading

raunakab commented Nov 21, 2024

raunakab commented Nov 21, 2024

colin-ho left a comment

jaychia Nov 22, 2024

raunakab Nov 22, 2024

jaychia Nov 22, 2024

jaychia Nov 22, 2024

raunakab Nov 22, 2024

raunakab Nov 22, 2024

jaychia Nov 22, 2024

jaychia Nov 22, 2024

raunakab Nov 22, 2024

jaychia Nov 22, 2024

jaychia Nov 22, 2024

jaychia Nov 22, 2024

jaychia Nov 22, 2024

raunakab Nov 22, 2024

raunakab commented Dec 2, 2024

[FEAT] GHA workflow to perform tcph benchmarking #3184

[FEAT] GHA workflow to perform tcph benchmarking #3184

Conversation

raunakab commented Nov 5, 2024 • edited Loading

Overview

Notes

build-commit workflow

run-tpch workflow

build-commit-run-tpch workflow

codspeed-hq bot commented Nov 5, 2024 • edited Loading

Merging #3184 will degrade performances by 34.59%

Summary

Benchmarks breakdown

codecov bot commented Nov 5, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

raunakab Nov 20, 2024 • edited Loading

Choose a reason for hiding this comment

raunakab commented Nov 21, 2024

raunakab commented Nov 21, 2024

colin-ho left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

raunakab commented Dec 2, 2024

raunakab commented Nov 5, 2024 •

edited

Loading

codspeed-hq bot commented Nov 5, 2024 •

edited

Loading

codecov bot commented Nov 5, 2024 •

edited

Loading

raunakab Nov 20, 2024 •

edited

Loading