GitHub - meta-pytorch/BackendBench: How to ensure correctness and ship LLM generated kernels in PyTorch

BackendBench

BackendBench is an evaluation suite for testing how well LLMs and humans can write PyTorch backends. It lets developers add custom kernels in an organized directory structure and dynamically override PyTorch's core operators at runtime resulting in a fully functional PyTorch backend you can pip install and use with existing models, no modeling code changes required.

Features:

Comprehensive edge case correctness testing via PyTorch's OpInfo and FACTO test suites
Performance benchmarks using real tensor shapes from popular Hugging Face models
Clean path to upstream your kernels to PyTorch (if it passes our tests, it's likely correct enough to merge)

Many kernel optimization efforts struggle with correctness. Our approach ensures your kernels are production-ready by meeting PyTorch's own standards. You can learn about correcntess in our launch blog and launch video

Installation:

pip install .

LLM Kernel Development Workflow

Create operator directories:

python -m BackendBench.scripts.setup_operator_directories

Implement kernels in each directory you'll see an empty op implementation. Please get your LLM to fill it out!
Test your implementations:

# smoke test to make sure everything is in check
python BackendBench/scripts/main.py --suite smoke --backend aten

# OpInfo correctness tests
python BackendBench/scripts/main.py --suite opinfo --backend directory

# TorchBench performance tests  
python BackendBench/scripts/main.py --suite torchbench --backend directory

Example: Train nanoGPT using BackendBench with LLM generated kernels

See BackendBench Example for a practical demonstration of how to use BackendBench for model convergence testing.

License

Source code is made available under a BSD 3 license

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github		.github
BackendBench		BackendBench
docs		docs
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BackendBench

Installation:

LLM Kernel Development Workflow

Example: Train nanoGPT using BackendBench with LLM generated kernels

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 11

Languages

License

meta-pytorch/BackendBench

Folders and files

Latest commit

History

Repository files navigation

BackendBench

Installation:

LLM Kernel Development Workflow

Example: Train nanoGPT using BackendBench with LLM generated kernels

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Languages

Packages