Skip to content

Conversation

@jakub-sochacki
Copy link
Owner

@jakub-sochacki jakub-sochacki commented Oct 14, 2025

  • Add linux.hpu.gaudi3.8 runner to benchmark matrix (placeholder)
  • Create test-[throughput | latency | serving]-hpu.json files with benchmark configurations
  • Add gaudi3 to default runners list in workflow dispatch
  • Implement HPU device detection using hl-smi command
  • Add HPU-specific Docker image suffix (-hpu)
  • Add special commit selection for HPU that queries vllm-gaudi history
  • Clone vllm-gaudi vllm/last-good-commit-for-vllm-gaudi branch to get compatibility history (N most recent VLLM_STABLE_COMMIT updates)
  • Search through commit history to find most recent vLLM version with existing Docker image built by CI

The commit selection mechanism solves the race condition where VLLM_STABLE_COMMIT might change between CI image builds and benchmark runs (every 12 hours), ensuring benchmarks always find an existing compatible Docker image.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants