Skip to content

Commit 68d5b4f

Browse files
WangErXiaoskyloevil
authored andcommitted
[Doc] Add documentation for GLM-4.5 series models: tool-calling and reasoning parser (vllm-project#24589)
Signed-off-by: WangErXiao <[email protected]>
1 parent 74a072d commit 68d5b4f

File tree

2 files changed

+10
-0
lines changed

2 files changed

+10
-0
lines changed

docs/features/reasoning_outputs.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@ vLLM currently supports the following reasoning models:
1515
| [IBM Granite 3.2 language models](https://huggingface.co/collections/ibm-granite/granite-32-language-models-67b3bc8c13508f6d064cff9a) | `granite` |||
1616
| [Qwen3 series](https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f) | `qwen3` | `guided_json`, `guided_regex` ||
1717
| [Hunyuan A13B series](https://huggingface.co/collections/tencent/hunyuan-a13b-685ec38e5b46321e3ea7c4be) | `hunyuan_a13b` | `guided_json`, `guided_regex` ||
18+
| [GLM-4.5 series](https://huggingface.co/collections/zai-org/glm-45-687c621d34bda8c9e4bf503b) | `glm45` | `guided_json`, `guided_regex` ||
1819

1920
!!! note
2021
IBM Granite 3.2 reasoning is disabled by default; to enable it, you must also pass `thinking=True` in your `chat_template_kwargs`.

docs/features/tool_calling.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -311,6 +311,15 @@ Flags:
311311
* For non-reasoning: `--tool-call-parser hunyuan_a13b`
312312
* For reasoning: `--tool-call-parser hunyuan_a13b --reasoning-parser hunyuan_a13b --enable_reasoning`
313313

314+
### GLM-4.5 Models (`glm45`)
315+
316+
Supported models:
317+
318+
* `ZhipuAI/GLM-4.5`
319+
* `ZhipuAI/GLM-4.5-Air`
320+
321+
Flags: `--tool-call-parser glm45`
322+
314323
### Models with Pythonic Tool Calls (`pythonic`)
315324

316325
A growing number of models output a python list to represent tool calls instead of using JSON. This has the advantage of inherently supporting parallel tool calls and removing ambiguity around the JSON schema required for tool calls. The `pythonic` tool parser can support such models.

0 commit comments

Comments
 (0)