Skip to content

Commit 1ddea76

Browse files
committed
You're an odd one, ain't you?
Signed-off-by: Huy Do <huydhn@gmail.com>
1 parent a7a1664 commit 1ddea76

File tree

2 files changed

+11
-4
lines changed

2 files changed

+11
-4
lines changed

.github/scripts/gpt-oss/run_accuracy_checks.sh

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -30,22 +30,29 @@ fi
3030
pushd vllm-benchmarks/gpt-oss
3131
mkdir -p /tmp/gpqa_openai
3232

33+
# Not sure why this is needed on ROCm image
34+
if [[ "${DEVICE_NAME}" == "rocm" ]]; then
35+
ls -la gpt_oss
36+
ls -la gpt_oss/evals
37+
export PYTHONPATH=$(pwd):$PYTHONPATH
38+
fi
39+
3340
# Low
34-
OPENAI_API_KEY='' python3 -m gpt_oss.evals --base-url http://localhost:8000/v1 \
41+
OPENAI_API_KEY="" python3 -m gpt_oss.evals --base-url http://localhost:8000/v1 \
3542
--model $MODEL \
3643
--eval gpqa \
3744
--reasoning-effort low \
3845
--n-threads $(expr $(nproc) / 2)
3946

4047
# Mid
41-
OPENAI_API_KEY='' python3 -m gpt_oss.evals --base-url http://localhost:8000/v1 \
48+
OPENAI_API_KEY="" python3 -m gpt_oss.evals --base-url http://localhost:8000/v1 \
4249
--model $MODEL \
4350
--eval gpqa \
4451
--reasoning-effort medium \
4552
--n-threads $(expr $(nproc) / 2)
4653

4754
# High
48-
OPENAI_API_KEY='' python3 -m gpt_oss.evals --base-url http://localhost:8000/v1 \
55+
OPENAI_API_KEY="" python3 -m gpt_oss.evals --base-url http://localhost:8000/v1 \
4956
--model $MODEL \
5057
--eval gpqa \
5158
--reasoning-effort high \

.github/workflows/gpt-oss-benchmark.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ jobs:
2222
- runner: linux.dgx.b200.8
2323
model: openai/gpt-oss-120b
2424
docker-image: 'public.ecr.aws/q9t5s3a7/vllm-ci-test-repo:6d8d0a24c02bfd84d46b3016b865a44f048ae84b'
25-
- runner: linux.rocm.gpu.gfx942.4
25+
- runner: linux.rocm.gpu.gfx942.8
2626
model: openai/gpt-oss-120b
2727
docker-image: rocm/vllm-dev:open-mi300-08052025
2828
# gpt-oss-20b

0 commit comments

Comments
 (0)