GitHub - meta-pytorch/BackendBench: How to ship your LLM generated kernels to PyTorch

BackendBench

BackendBench is an evaluation suite for testing how well LLMs and humans can write PyTorch backends. It lets developers add custom kernels in an organized directory structure and dynamically override PyTorch's core operators at runtime—resulting in a fully functional PyTorch backend you can pip install and use with existing models, no changes required.

Features:

Comprehensive correctness testing via PyTorch's OpInfo and FACTO test suites
Performance benchmarks using real tensor shapes from popular Hugging Face models
Clean path to upstream your kernels to PyTorch (if it passes our tests, it's likely correct enough to merge)

Why it matters: Many kernel optimization efforts struggle with correctness. Our approach ensures your kernels are production-ready by meeting PyTorch's own standards.

Installation:

pip install .

LLM-Based Kernel Generation and Evaluation

Generate and evaluate PyTorch kernels using Claude API:

Run LLM evaluation on smoke test (relu operation):

export ANTHROPIC_API_KEY=your_api_key_here
uv run python BackendBench/scripts/main.py --suite smoke --backend llm

License

Source code is made available under a BSD 3 license

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github		.github
BackendBench		BackendBench
KernelAgent @ 2b26ae0		KernelAgent @ 2b26ae0
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BackendBench

Installation:

LLM-Based Kernel Generation and Evaluation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Languages

License

meta-pytorch/BackendBench

Folders and files

Latest commit

History

Repository files navigation

BackendBench

Installation:

LLM-Based Kernel Generation and Evaluation

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Languages

Packages