-
Notifications
You must be signed in to change notification settings - Fork 48
[Sync] Upstream 20250910 commit e408272
#667
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Sync] Upstream 20250910 commit e408272
#667
Conversation
…ject#23831) Signed-off-by: Thomas Parnell <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: zjy0516 <[email protected]> Signed-off-by: Jiangyun Zhu <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…23971) Signed-off-by: sadeghja1070 <[email protected]> Signed-off-by: Jee Jee Li <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
…finally (vllm-project#23758) Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Andy Lo <[email protected]> Co-authored-by: Jee Jee Li <[email protected]>
Signed-off-by: Gabriel Marinho <[email protected]> Signed-off-by: Gabriel Marinho <[email protected]> Signed-off-by: Max de Bayser <[email protected]> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Max de Bayser <[email protected]>
…entifiers. (vllm-project#23394) Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Didier Durand <[email protected]>
Signed-off-by: Roger Wang <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: Or Ozeri <[email protected]>
…TQ and AutoRound-GPTQ) (vllm-project#23994) Signed-off-by: JartX <[email protected]> Signed-off-by: Isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: JunHowie <[email protected]> Co-authored-by: JunHowie <[email protected]> Co-authored-by: Isotr0py <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
…tialization (vllm-project#23357) Signed-off-by: Isotr0py <[email protected]>
…oject#23677) Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Christian Pinto <[email protected]> Signed-off-by: Max de Bayser <[email protected]> Co-authored-by: Max de Bayser <[email protected]>
…LAM Tool Parser (vllm-project#22769) Signed-off-by: Devon Peroutky <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
…L_LEN (vllm-project#20904) Signed-off-by: wang.yuqi <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]> Co-authored-by: Wentao Ye <[email protected]>
Signed-off-by: Didier Durand <[email protected]>
…lm-project#23735) Signed-off-by: NickLucche <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
…4524) Signed-off-by: Tyler Michael Smith <[email protected]>
… attn chunking (vllm-project#24474) Signed-off-by: Yong Hoon Shin <[email protected]>
…roject#24546) Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
… later) (vllm-project#24129) Signed-off-by: ignaciosica <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
…t#24528) Signed-off-by: Nick Hill <[email protected]>
…llm-project#23620) Signed-off-by: ApostaC <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
…enable test for LlamaForCausalLMEagle3 (vllm-project#24392) Signed-off-by: wwl2755 <[email protected]>
…llm-project#24154) Signed-off-by: Wei Wei <[email protected]>
…verrides processing. (vllm-project#24271) Signed-off-by: Chenheli Hua <[email protected]>
…ct#23845) Signed-off-by: Omer Dayan (SW-GPU) <[email protected]> Signed-off-by: Peter Schuurman <[email protected]> Co-authored-by: Omer Dayan (SW-GPU) <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
… the docs (vllm-project#24041) Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
Purpose
Sync with upstream to get latest changes and to fix the Compressed Tensor FP8 weight loading accuracy issue.
Upgrade vLLM version to 0.10.2rc2.dev+ge408272
Test Plan
Validated on model of interests
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.md
andexamples
for a new model.