[Model] Support Tele-FLM Model #15023

atone · 2025-03-18T10:25:00Z

This PR adds support for Tele-FLM Model.

Tele-FLM (aka FLM-2) is a 52B open-sourced multilingual large language model that features a stable, efficient pre-training paradigm and enhanced factual judgement capabilities. Built upon the decoder-only transformer architecture, it has been trained on approximately 2T tokens. Tele-FLM demonstrates superior performances at its scale, and sometimes surpass larger models. In addition to sharing the model weights, we provide the core designs, engineering practices, and training details, anticipating their benefits for both academic and industrial communities.

The model collection can be found at Hugging Face (https://huggingface.co/collections/CofeAI/tele-flm-flm-2-669e4dbd2dbf53ccd2454304).

Veried that vllm serve CofeAI/FLM-2-52B-Instruct-2407 --trust-remote-code --chat-template /examples/tool_chat_template_teleflm.jinja -tp 2 works.

github-actions · 2025-03-18T10:25:09Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Isotr0py · 2025-03-18T13:16:50Z

vllm/model_executor/models/teleflm.py

Seems that here is the only difference with Llama, can you refactor the model implementation to reduce duplicate code just like glm.py and telechat2.py?

Isotr0py · 2025-03-18T13:18:59Z

vllm/transformers_utils/configs/teleflm.py

Is really necessary to port this config? I think custom config from hf dynamic module can work in most of cases.

jeejeelee · 2025-03-18T14:52:26Z

vllm/model_executor/models/teleflm.py

FYI: The supported_lora_modules var has been deprecated, plz remove it

jeejeelee · 2025-03-19T14:48:34Z

Hi, remember to update the documentation, see: https://github.com/vllm-project/vllm/blob/main/docs/source/models/supported_models.md

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

Signed-off-by: jiangxin <[email protected]>

Isotr0py · 2025-03-20T07:25:00Z

Don't forget updating tests/models/registry.py, otherwise test_initialization will fail! :)

Signed-off-by: jiangxin <[email protected]>

horizon94 · 2025-03-21T04:10:30Z

All the above suggestions have been adopted, and corresponding adjustments have been made:

The code now inherits from the llama class to minimize redundancy.
Updated tests/models/registry.py.
Updated the documentation.

The CI failure occurred in the Entrypoints Test, but this error appears unrelated to our code modifications.
Any suggestion? :)

Signed-off-by: jiangxin <[email protected]>

vllm/model_executor/models/registry.py

DarkLight1337 · 2025-03-22T05:37:35Z

Can you merge from main to fix the CI failure?

DarkLight1337 · 2025-03-22T09:04:39Z

CI failures look unrelated, merging. Thanks for your effort!

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]> Co-authored-by: Jason Fang <[email protected]> Co-authored-by: jiangxin <[email protected]>

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]> Co-authored-by: Jason Fang <[email protected]> Co-authored-by: jiangxin <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]> Co-authored-by: Jason Fang <[email protected]> Co-authored-by: jiangxin <[email protected]>

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]> Co-authored-by: Jason Fang <[email protected]> Co-authored-by: jiangxin <[email protected]> Signed-off-by: Mu Huai <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Mar 18, 2025

atone force-pushed the flm-0.7.3 branch 2 times, most recently from 5b956eb to 5f7f267 Compare March 18, 2025 10:39

atone changed the title ~~teleflm supported~~ [Model] Add Tele-FLM Model Support Mar 18, 2025

atone changed the title ~~[Model] Add Tele-FLM Model Support~~ [Model] Support Tele-FLM Model Mar 18, 2025

atone changed the title ~~[Model] Support Tele-FLM Model~~ [Model] Support FLM-2 Model Mar 18, 2025

atone changed the title ~~[Model] Support FLM-2 Model~~ [Model] Support Tele-FLM Model Mar 18, 2025

jeejeelee requested a review from DarkLight1337 March 18, 2025 11:49

atone marked this pull request as ready for review March 18, 2025 12:14

Isotr0py reviewed Mar 18, 2025

View reviewed changes

jeejeelee reviewed Mar 18, 2025

View reviewed changes

vllm/model_executor/models/teleflm.py Outdated

Copy link

Collaborator

jeejeelee Mar 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI: The supported_lora_modules var has been deprecated, plz remove it

Jason3900 and others added 14 commits March 20, 2025 05:33

teleflm supported

b4c34b5

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

refine chat template

a8166ff

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

remove unused layer_idx

3505891

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

add TeleFLMConfig to __all__

17e4e7a

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

add missing SPDX header

c903604

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

reformat code

90f02f5

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

reformat code

99fe5a9

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

reformat code

8957457

Signed-off-by: Naitong Yu <[email protected]> Signed-off-by: jiangxin <[email protected]>

Inheriting from LLaMA to eliminate code redundancy

781a0f1

Signed-off-by: jiangxin <[email protected]>

Inheriting from LLaMA to eliminate code redundancy

4a2e206

Signed-off-by: jiangxin <[email protected]>

reformat code

d783c76

Signed-off-by: jiangxin <[email protected]>

reformat code

766803d

Signed-off-by: jiangxin <[email protected]>

reformat code

0cd6c7b

Signed-off-by: jiangxin <[email protected]>

reformat code

28c8afc

Signed-off-by: jiangxin <[email protected]>

Jason3900 force-pushed the flm-0.7.3 branch from f8e9add to 28c8afc Compare March 20, 2025 05:34

update document for TeleFLM

dbe7a49

Signed-off-by: jiangxin <[email protected]>

horizon94 added 2 commits March 20, 2025 09:30

param passed to llama

aa65e44

Signed-off-by: jiangxin <[email protected]>

update registry

177d33d

Signed-off-by: jiangxin <[email protected]>

Jason3900 requested a review from ywang96 as a code owner March 20, 2025 09:31

reformat code

514d4dc

Signed-off-by: jiangxin <[email protected]>

atone requested a review from Isotr0py March 20, 2025 11:33

Isotr0py approved these changes Mar 21, 2025

View reviewed changes

Isotr0py added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 21, 2025

model arc: TeleFLMModel -> TeleFLMForCausalLM

851fbab

Signed-off-by: jiangxin <[email protected]>

DarkLight1337 reviewed Mar 21, 2025

View reviewed changes

vllm/model_executor/models/registry.py Show resolved Hide resolved

Isotr0py mentioned this pull request Mar 21, 2025

[Bugfix] Fix incorrect resolving order for transformers fallback #15279

Merged

Merge branch 'vllm-project:main' into flm-0.7.3

884bbd2

DarkLight1337 enabled auto-merge (squash) March 22, 2025 05:44

Merge branch 'vllm-project:main' into flm-0.7.3

f19ca56

vllm-bot merged commit 2f4bd35 into vllm-project:main Mar 22, 2025
28 of 36 checks passed

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Support Tele-FLM Model #15023

[Model] Support Tele-FLM Model #15023

Uh oh!

atone commented Mar 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 18, 2025

Uh oh!

Isotr0py Mar 18, 2025

Uh oh!

Isotr0py Mar 18, 2025

Uh oh!

jeejeelee Mar 18, 2025

Uh oh!

jeejeelee commented Mar 19, 2025

Uh oh!

Isotr0py commented Mar 20, 2025

Uh oh!

horizon94 commented Mar 21, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Mar 22, 2025

Uh oh!

DarkLight1337 commented Mar 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Model] Support Tele-FLM Model #15023

[Model] Support Tele-FLM Model #15023

Uh oh!

Conversation

atone commented Mar 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 18, 2025

Uh oh!

Isotr0py Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Isotr0py Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

jeejeelee Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

jeejeelee commented Mar 19, 2025

Uh oh!

Isotr0py commented Mar 20, 2025

Uh oh!

horizon94 commented Mar 21, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Mar 22, 2025

Uh oh!

DarkLight1337 commented Mar 22, 2025

Uh oh!

Uh oh!

Uh oh!

atone commented Mar 18, 2025 •

edited by github-actions bot

Loading