Skip to content

[Bug]: Flashinfer 0.2.10 not supported #22455

@pty819

Description

@pty819

Your current environment

uv init vllminfer
uv pip install vllm
source .venv/bin/activate

git clone https://github.com/flashinfer-ai/flashinfer.git --recursive
cd flashinfer
export TORCH_CUDA_ARCH_LIST="7.5 8.0 8.9 9.0a 10.0a"
python -m flashinfer.aot # Produces AOT kernels in aot-ops/
python -m pip install --no-build-isolation --verbose .

🐛 Describe the bug

I compiled Flashinfer 0.2.10 AOT mode in my environment. this was reported. I guess the version check thought 0.2.10 < 0.2.3.......

WARNING 08-07 20:24:13 [topk_topp_sampler.py:36] FlashInfer version >= 0.2.3 required. Falling back to default sampling implementation.

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions