[Model] support baichuan-13b based on baichuan-7b #643

Oliver-ss · 2023-08-02T03:36:22Z

baichuan-7b and baichuan-13b have different architectures in config.json, "BaichuanForCausalLM" for baichuan-13b and "BaiChuanForCausalLM" for baichuan-7b.
The model structures are almost the same except for position embedding. (RoPE for baichuan-7b and Alibi for baichuan-13b.) So most of the code coule be shared.

zhuohan123

Thank you for your contribution! Just tested this out with 13b model and -tp 4 and it works like a charm!

luohao123 · 2023-08-02T13:47:09Z

@zhuohan123 So this resolved baichuan13b tp issue?

But I think there was another PR merged in before, which one to use?

…llm-project#643)

Signed-off-by: Chendi Xue <[email protected]> Signed-off-by: Chendi.Xue <[email protected]>

### What this PR does / why we need it? Part of vllm-project#499 Add qwen2.5-vl test on single npu, v1 engine is excluded because qwen2.5-vl has some problems with v1 now, at the same time, this test can also make vllm-project#639 more credible Signed-off-by: wangli <[email protected]>

fix baichuan for different position embedding for 7b and 13b models

012fed8

zhuohan123 approved these changes Aug 2, 2023

View reviewed changes

zhuohan123 merged commit 64f23c2 into vllm-project:main Aug 2, 2023

This was referenced Aug 2, 2023

support baichuan 13b #530

Closed

Add support for Baichuan 13b model #512

Closed

This was referenced Aug 2, 2023

is it support BaiChuan-13B-Chat model ? #642

Closed

Support for baichuan models #428

Closed

百川baichuan-chat-13B 多卡推理 #593

Closed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

fix baichuan for different position embedding for 7b and 13b models (v…

c831bf0

…llm-project#643)

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Jan 24, 2025

Selective merged prefill (vllm-project#643)

001aa55

Signed-off-by: Chendi Xue <[email protected]> Signed-off-by: Chendi.Xue <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] support baichuan-13b based on baichuan-7b #643

[Model] support baichuan-13b based on baichuan-7b #643

Uh oh!

Oliver-ss commented Aug 2, 2023

Uh oh!

zhuohan123 left a comment •

edited

Loading

Uh oh!

luohao123 commented Aug 2, 2023

Uh oh!

Uh oh!

Uh oh!

[Model] support baichuan-13b based on baichuan-7b #643

[Model] support baichuan-13b based on baichuan-7b #643

Uh oh!

Conversation

Oliver-ss commented Aug 2, 2023

Uh oh!

zhuohan123 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luohao123 commented Aug 2, 2023

Uh oh!

Uh oh!

zhuohan123 left a comment •

edited

Loading