-
-
Notifications
You must be signed in to change notification settings - Fork 10.1k
Closed
Labels
bugSomething isn't workingSomething isn't working
Description
Your current environment
uv init vllminfer
uv pip install vllm
source .venv/bin/activate
git clone https://github.com/flashinfer-ai/flashinfer.git --recursive
cd flashinfer
export TORCH_CUDA_ARCH_LIST="7.5 8.0 8.9 9.0a 10.0a"
python -m flashinfer.aot # Produces AOT kernels in aot-ops/
python -m pip install --no-build-isolation --verbose .
🐛 Describe the bug
I compiled Flashinfer 0.2.10 AOT mode in my environment. this was reported. I guess the version check thought 0.2.10 < 0.2.3.......
WARNING 08-07 20:24:13 [topk_topp_sampler.py:36] FlashInfer version >= 0.2.3 required. Falling back to default sampling implementation.
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
etvincen and teddythinh
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working