Extend the upload script for other local use cases #36

huydhn · 2025-06-06T00:11:15Z

This is the script I'm using to upload vLLM benchmark results. It can be retrofit to upload other benchmark results too, for example torch.compile benchmark results on GB200. I will write a wiki on how to use this script, but the simple usage is:

UPLOADER_USERNAME=<REACT>
UPLOADER_PASSWORD=<REACT>
GPU_DEVICE=$(nvidia-smi -i 0 --query-gpu=name --format=csv,noheader | awk '{print $2}')

python upload_benchmark_results.py \
  --repo pytorch \
  --benchmark-name TorchInductor \
  --benchmark-results test-reports \
  --device "${GPU_DEVICE}" \
  --dry-run

This also handles JSONEachRow format correctly in read_benchmark_results function as this is the format used by ClickHouse and DynamoBench

cc @zhe-thoughts @ZainRizvi

Signed-off-by: Huy Do <huydhn@gmail.com>

@zhe-thoughts

Courtesy of Claude code. This is the initial version of an API that I'm building to allow people to upload benchmark results. This works in conjunction with pytorch/pytorch-integration-testing#36 I will need to prepare a Terraform change for the lambda, but the API works as follows: ``` import requests with open("FILE_TO_BE_UPLOADED.json") as f: content = f.read() json_data = { "username": "REDACT", "password": "REDACT", "content": content, "path": "v3/foobar/debug.json", } headers = { "content-type": "application/json" } url = "https://qrr6jzjpvyyd77fkj6mqkes4mq0tpirr.lambda-url.us-east-1.on.aws" requests.post(url, json=json_data, headers=headers) ``` cc @zhe-thoughts @ZainRizvi --------- Signed-off-by: Huy Do <huydhn@gmail.com>

ZainRizvi · 2025-06-09T17:23:43Z

vllm-benchmarks/upload_benchmark_results.py

+        if s3_bucket:
+            upload_s3(s3_bucket, s3_path, data)
+        elif upload_url:
+            upload_via_api(upload_url, s3_path, data)


Seems like you're expecting the user to always specify a unique s3 bucket to upload the metrics to.

Is there a right/wrong format or path for them to be using, or will arbitrary paths work just fine as long as they're unique? Do we need path validation here?

Can we generate this bucket path for the user? Ideally in the lambda so that there's no risk of a bad path being passed in

While the S3 path is generated automatically here f"v3/{repo_name}/{head_branch}/{head_sha}/{device}/benchmark_results{model_suffix}.json", the lambda has a check to not overwrite existing file https://github.com/pytorch/test-infra/blob/main/aws/lambda/benchmark-results-uploader/lambda_function.py#L29. I will remove the option to set the bucket name because I want to write only to the benchmark bucket.

How do you feel about sending the lambda the inputs used to generate the S3 path instead? Feels like a much more reliable approach over trusting input from a client to be in a specific format (especially if the backend takes a dependency on that format)

ZainRizvi · 2025-06-09T17:27:04Z

vllm-benchmarks/upload_benchmark_results.py

+    uploader.add_argument(
+        "--upload-url",
+        type=str,
+        action=ValidateURL,
+        help="the URL to upload the benchmark results to",


is this expected to be the full s3 url, or just the s3 item key (the lambda seems to have the bucket name fixed)

Let me rework this part. The URL here will be fixed after https://github.com/pytorch-labs/pytorch-gha-infra/pull/705 is deployed. For example, https://qrr6jzjpvyyd77fkj6mqkes4mq0tpirr.lambda-url.us-east-1.on.aws. I only want people to upload to the benchmark bucket

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn · 2025-06-09T19:33:35Z

The failure on llama4 Maverick is expected. That model isn't working yet.

ZainRizvi

Approving since as per offline convo the actual paths used don't really matter so long as there are no path collisions

zhe-thoughts · 2025-06-09T23:18:22Z

Looking forward!

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn · 2025-06-10T01:16:07Z

cc @yangw-dev Once this is merged, we will need to finally migrate https://hud.pytorch.org/benchmark/compilers dashboard to the new benchmark table because this script will only upload the results there. Till date, the two tables are in sync, but this is not the case anymore.

huydhn · 2025-06-10T01:18:40Z

Looking forward!

@zhe-thoughts This is finally landed, I'm writing the wiki on how to use this script locally and will send it your way in the next few hours so that we can try it out tomorrow.

Extend the upload script for other local use cases

f90b974

Signed-off-by: Huy Do <huydhn@gmail.com>

facebook-github-bot added the cla signed label Jun 6, 2025

huydhn temporarily deployed to pytorch-x-vllm June 6, 2025 00:11 — with GitHub Actions Inactive

huydhn mentioned this pull request Jun 6, 2025

Add a lambda to write benchmark results pytorch/test-infra#6718

Merged

Support upload URL

480d2b8

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn had a problem deploying to pytorch-x-vllm June 6, 2025 02:45 — with GitHub Actions Error

huydhn temporarily deployed to pytorch-x-vllm June 6, 2025 02:45 — with GitHub Actions Inactive

huydhn requested a review from ZainRizvi June 6, 2025 02:47

huydhn marked this pull request as ready for review June 6, 2025 02:47

Fix a small bug

e95fb83

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn temporarily deployed to pytorch-x-vllm June 6, 2025 02:56 — with GitHub Actions Inactive

ZainRizvi requested changes Jun 9, 2025

View reviewed changes

Merge branch 'main' into extend-upload-script

a4b3587

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn temporarily deployed to pytorch-x-vllm June 9, 2025 18:11 — with GitHub Actions Inactive

huydhn had a problem deploying to pytorch-x-vllm June 9, 2025 18:11 — with GitHub Actions Failure

huydhn temporarily deployed to pytorch-x-vllm June 9, 2025 18:11 — with GitHub Actions Inactive

huydhn requested a review from ZainRizvi June 9, 2025 18:13

ZainRizvi approved these changes Jun 9, 2025

View reviewed changes

Use the correct lambda

33d217c

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn temporarily deployed to pytorch-x-vllm June 10, 2025 01:07 — with GitHub Actions Inactive

huydhn had a problem deploying to pytorch-x-vllm June 10, 2025 01:07 — with GitHub Actions Failure

huydhn temporarily deployed to pytorch-x-vllm June 10, 2025 01:07 — with GitHub Actions Inactive

huydhn merged commit 3478bce into main Jun 10, 2025
6 of 7 checks passed

huydhn mentioned this pull request Jun 11, 2025

Fix wrong repo name #37

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend the upload script for other local use cases #36

Extend the upload script for other local use cases #36

Uh oh!

huydhn commented Jun 6, 2025 •

edited

Loading

Uh oh!

ZainRizvi Jun 9, 2025

Uh oh!

huydhn Jun 9, 2025

Uh oh!

ZainRizvi Jun 9, 2025

Uh oh!

ZainRizvi Jun 9, 2025

Uh oh!

huydhn Jun 9, 2025

Uh oh!

huydhn commented Jun 9, 2025

Uh oh!

ZainRizvi left a comment

Uh oh!

zhe-thoughts commented Jun 9, 2025

Uh oh!

Uh oh!

huydhn commented Jun 10, 2025

Uh oh!

huydhn commented Jun 10, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Extend the upload script for other local use cases #36

Extend the upload script for other local use cases #36

Uh oh!

Conversation

huydhn commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZainRizvi Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

huydhn Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

ZainRizvi Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

ZainRizvi Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

huydhn Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

huydhn commented Jun 9, 2025

Uh oh!

ZainRizvi left a comment

Choose a reason for hiding this comment

Uh oh!

zhe-thoughts commented Jun 9, 2025

Uh oh!

Uh oh!

huydhn commented Jun 10, 2025

Uh oh!

huydhn commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

huydhn commented Jun 6, 2025 •

edited

Loading

huydhn commented Jun 10, 2025 •

edited

Loading