[Frontend] Support tool calling and reasoning parser #14511

WangErXiao · 2025-03-09T05:29:38Z

Remove the mutual exclusion restriction between tool calling and reasoning parser, as models like QwQ-32B can support both functionalities simultaneously. Additionally, having tool calling parse only from the content rather than the reasoning_content can improve the accuracy and performance of tool calling.

This PR resolves the issue #14490

github-actions · 2025-03-09T05:29:48Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gaocegege · 2025-03-09T10:53:05Z

Thanks for the PR. I will help review it. FYI I had a refactor for the reasoning support #14428, not sure if you can use is_reasoning_end instead of a new func here.

WangErXiao · 2025-03-09T11:25:11Z

Thanks for the PR. I will help review it. FYI I had a refactor for the reasoning support #14428, not sure if you can use is_reasoning_end instead of a new func here.

Thanks very much. I'll take a look at #14428. If you add is_reasoning_end, I will update the PR accordingly.

WangErXiao · 2025-03-09T11:51:00Z

Thanks for the PR. I will help review it. FYI I had a refactor for the reasoning support #14428, not sure if you can use is_reasoning_end instead of a new func here.

I have reviewed PR #14428, and I think we can reuse the is_reasoning_end function. After your review, I will proceed with the refactoring.

vllm/entrypoints/openai/reasoning_parsers/abs_reasoning_parsers.py

gaocegege

The streaming logic looks good to me, but does it also support full generation?

Besides this, we could also add some example code and tests

gaocegege · 2025-03-10T02:44:06Z

And we should:

Remove It is not compatible with [tool_calling](https://docs.vllm.ai/en/latest/features/tool_calling.html). in reasoning outputs docs
Add a new column in models section in reasoning outputs docs to show which model supports tool calling

WangErXiao · 2025-03-10T02:49:04Z

The streaming logic looks good to me, but does it also support full generation?

Besides this, we could also add some example code and tests

Yes, It support full generation, I will add some example code and tests.

WangErXiao · 2025-03-10T02:51:32Z

And we should:

Remove It is not compatible with [tool_calling](https://docs.vllm.ai/en/latest/features/tool_calling.html). in reasoning outputs docs

Add a new column in models section in reasoning outputs docs to show which model supports tool calling

OK

gaocegege

I think we can merge this before the refactor, I could rebase in that PR.

vllm/entrypoints/openai/serving_chat.py

WangErXiao · 2025-03-11T12:43:54Z

I think this pr can be ready to merge. @gaocegege @russellb

szpnygo · 2025-03-12T01:03:27Z

Will this request be included in version 0.7.4? When will it be released?

vllm/entrypoints/openai/serving_chat.py

gaocegege · 2025-03-12T02:03:52Z

Will this request be included in version 0.7.4? When will it be released?

@szpnygo I hope so.

aarnphm · 2025-03-16T15:56:15Z

can you change the title to "[Frontend] Support tool calling and reasoning parser"

WangErXiao · 2025-03-16T23:01:14Z

can you change the title to "[Frontend] Support tool calling and reasoning parser"

OK, done

WangErXiao · 2025-03-17T01:22:12Z

cc @DarkLight1337 @simon-mo @mgoin

…ng models Signed-off-by: WangErXiao <[email protected]>

Signed-off-by: WangErXiao <[email protected]>

gaocegege · 2025-03-20T01:48:13Z

@robertgshaw2-redhat @mgoin Could you please take a look? It blocks some other PRs about the reasoning parser.

WangErXiao · 2025-03-21T22:50:26Z

cc @simon-mo

fwensen

when streaming request model the output tool message will be incompleted

WangErXiao · 2025-03-25T00:51:47Z

when streaming request model the output tool message will be incompleted

#15177

It's maybe tool parser problem.
You can change

                # Coalesce any additional queued outputs
                while not q.empty():
                    next_out = q.get_nowait()
                    if sampling_params.output_kind == RequestOutputKind.DELTA:
                        out.add(next_out)
                    else:
                        out = next_out

to

                # Coalesce any additional queued outputs
               if not q.empty():
                    next_out = q.get_nowait()
                    if sampling_params.output_kind == RequestOutputKind.DELTA:
                        out.add(next_out)
                    else:
                        out = next_out

) Signed-off-by: WangErXiao <[email protected]>

) Signed-off-by: WangErXiao <[email protected]> Signed-off-by: Wes Medford <[email protected]>

) Signed-off-by: WangErXiao <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

) Signed-off-by: WangErXiao <[email protected]>

) Signed-off-by: WangErXiao <[email protected]> Signed-off-by: Mu Huai <[email protected]>

mergify bot added the frontend label Mar 9, 2025

WangErXiao force-pushed the tool_calling_for_reasoning_model branch from 3cef121 to df45cb1 Compare March 9, 2025 07:06

jiangyinzuo mentioned this pull request Mar 9, 2025

[Feature]: support tool and reasoning together #14429

Open

1 task

WangErXiao force-pushed the tool_calling_for_reasoning_model branch from df45cb1 to d1a57e7 Compare March 9, 2025 07:57

WangErXiao commented Mar 10, 2025

View reviewed changes

vllm/entrypoints/openai/reasoning_parsers/abs_reasoning_parsers.py Outdated Show resolved Hide resolved

gaocegege reviewed Mar 10, 2025

View reviewed changes

gaocegege mentioned this pull request Mar 10, 2025

[Refactor][Frontend] Keep all logic about reasoning into one class #14428

Merged

WangErXiao force-pushed the tool_calling_for_reasoning_model branch from d1a57e7 to 96558b1 Compare March 10, 2025 14:10

mergify bot added the documentation Improvements or additions to documentation label Mar 10, 2025

WangErXiao force-pushed the tool_calling_for_reasoning_model branch 3 times, most recently from 01e43db to 0b9cc13 Compare March 10, 2025 14:41

gaocegege approved these changes Mar 11, 2025

View reviewed changes

WangErXiao commented Mar 11, 2025

View reviewed changes

vllm/entrypoints/openai/serving_chat.py Outdated Show resolved Hide resolved

WangErXiao force-pushed the tool_calling_for_reasoning_model branch from 0b9cc13 to 62043bd Compare March 11, 2025 12:06

WangErXiao force-pushed the tool_calling_for_reasoning_model branch from 62043bd to 0cc007f Compare March 11, 2025 23:30

gaocegege reviewed Mar 12, 2025

View reviewed changes

vllm/entrypoints/openai/serving_chat.py Outdated Show resolved Hide resolved

WangErXiao force-pushed the tool_calling_for_reasoning_model branch 3 times, most recently from 16e984e to 4b96d9c Compare March 12, 2025 13:17

WangErXiao changed the title ~~[Frontend] Support both tool calling and reasoning parser for reasoni…~~ [Frontend] Support tool calling and reasoning parser Mar 16, 2025

WangErXiao mentioned this pull request Mar 17, 2025

[Feature]: 使用guided_decoding来实现Function Calling #14890

Closed

1 task

simon-mo added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 18, 2025

WangErXiao added 6 commits March 19, 2025 07:44

[Frontend] Support both tool calling and reasoning parser for reasoni…

6bc3f83

…ng models Signed-off-by: WangErXiao <[email protected]>

Optimize the extract_content method

b934ad6

Signed-off-by: WangErXiao <[email protected]>

rename extract_content method

0bd32c2

Signed-off-by: WangErXiao <[email protected]>

add test for tool calls and reasoning parser

76bdf60

Signed-off-by: WangErXiao <[email protected]>

skip test cases for tool calls and reasoning parser

f00581c

Signed-off-by: WangErXiao <[email protected]>

ignore test_chat_with_tool_reasoning.py in test-pipeline

47a7c16

Signed-off-by: WangErXiao <[email protected]>

WangErXiao force-pushed the tool_calling_for_reasoning_model branch from 2e87f90 to 47a7c16 Compare March 18, 2025 23:45

simon-mo merged commit d6cd59f into vllm-project:main Mar 23, 2025
57 checks passed

meffmadd mentioned this pull request Mar 24, 2025

[Frontend] Implement Tool Calling with tool_choice='required' #13483

Merged

6 tasks

fwensen reviewed Mar 24, 2025

View reviewed changes

erictang000 pushed a commit to erictang000/vllm that referenced this pull request Mar 25, 2025

[Frontend] Support tool calling and reasoning parser (vllm-project#14511

25e22ee

) Signed-off-by: WangErXiao <[email protected]>

wrmedford pushed a commit to wrmedford/vllm that referenced this pull request Mar 26, 2025

[Frontend] Support tool calling and reasoning parser (vllm-project#14511

7c02b54

) Signed-off-by: WangErXiao <[email protected]> Signed-off-by: Wes Medford <[email protected]>

SmartManoj mentioned this pull request Mar 26, 2025

API error: Error code: 400 FoundationAgents/OpenManus#331

Open

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[Frontend] Support tool calling and reasoning parser (vllm-project#14511

2f5e024

) Signed-off-by: WangErXiao <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Frontend] Support tool calling and reasoning parser (vllm-project#14511

f671b2b

) Signed-off-by: WangErXiao <[email protected]>

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[Frontend] Support tool calling and reasoning parser (vllm-project#14511

14d3f7a

) Signed-off-by: WangErXiao <[email protected]>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Frontend] Support tool calling and reasoning parser (vllm-project#14511

abcdd57

) Signed-off-by: WangErXiao <[email protected]> Signed-off-by: Mu Huai <[email protected]>

linyinli mentioned this pull request Jun 4, 2025

Deploying Qwen3 from catalog with vllm-ascend fails gpustack/gpustack#2146

Closed

Uh oh!

[Frontend] Support tool calling and reasoning parser #14511

[Frontend] Support tool calling and reasoning parser #14511

Uh oh!

Conversation

WangErXiao commented Mar 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 9, 2025

Uh oh!

gaocegege commented Mar 9, 2025

Uh oh!

WangErXiao commented Mar 9, 2025

Uh oh!

WangErXiao commented Mar 9, 2025

Uh oh!

Uh oh!

gaocegege left a comment

Choose a reason for hiding this comment

Uh oh!

gaocegege commented Mar 10, 2025

Uh oh!

WangErXiao commented Mar 10, 2025

Uh oh!

WangErXiao commented Mar 10, 2025

Uh oh!

gaocegege left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

WangErXiao commented Mar 11, 2025

Uh oh!

szpnygo commented Mar 12, 2025

Uh oh!

Uh oh!

gaocegege commented Mar 12, 2025

Uh oh!

aarnphm commented Mar 16, 2025

Uh oh!

WangErXiao commented Mar 16, 2025

Uh oh!

WangErXiao commented Mar 17, 2025

Uh oh!

gaocegege commented Mar 20, 2025

Uh oh!

WangErXiao commented Mar 21, 2025

Uh oh!

Uh oh!

fwensen left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WangErXiao commented Mar 25, 2025

Uh oh!

Uh oh!

WangErXiao commented Mar 9, 2025 •

edited by github-actions bot

Loading

fwensen left a comment •

edited

Loading