[Bug] [Spec Decode] Fix model_initialization test and mismatch in aux_hidden_layers #24613

wwl2755 · 2025-09-10T23:35:24Z

Fix the bug of model_initialization. The model should be initializated by a target model which is different from the speculative_model.
After changing to the correct target model, I faced another error which is because of the dfferent shape in hidden_states + residual

Root cause is the num_layers is initialized to 1 in the test (https://github.com/vllm-project/vllm/blob/v0.10.2rc1/tests/models/utils.py#L387). However, the aux_hidden_layer is determine as (2, num_layers // 2, num_layers - 3). (https://github.com/vllm-project/vllm/blob/v0.10.2rc1/tests/models/utils.py#L387)

# In the test: 
DEBUG: self.aux_hidden_state_layers: (2, 0, -2)

# It is supposed to be:
DEBUG: self.aux_hidden_state_layers: (2, 20, 37)

So, before this PR, it will treat the layer-0 as the aux-layer and entering the code path of hidden_states + residual (where hidden_states is a tensor and residual is None).

Solution: we set some of the eagle3 models to have the original number of layers.

cc: @DarkLight1337 @WoosukKwon @LiuXiaoxuanPKU @22quinn @benchislett

Tests

All passed.

VLLM_WORKER_MULTIPROC_METHOD=spawn pytest -v -s models/test_initialization.py::test_can_initialize[EagleLlamaForCausalLM]
VLLM_WORKER_MULTIPROC_METHOD=spawn pytest -v -s models/test_initialization.py::test_can_initialize[Eagle3LlamaForCausalLM]
VLLM_WORKER_MULTIPROC_METHOD=spawn pytest -v -s models/test_initialization.py::test_can_initialize[LlamaForCausalLMEagle3]

python $HOME/vllm/examples/offline_inference/spec_decode.py --method eagle3 --num-prompts 10 --model-dir Qwen/Qwen3-14B --eagle-dir AngelSlim/Qwen3-14B_eagle3

Signed-off-by: wwl2755 <[email protected]>

gemini-code-assist

Code Review

This pull request effectively addresses two bugs related to speculative decoding tests. First, it corrects the model initialization to use the proper target model instead of the speculative model. Second, it resolves a shape mismatch for aux_hidden_layers by introducing a mechanism to use the original number of layers from the model config during testing. The changes are logical and well-implemented. I have one suggestion to improve the robustness of how the layer count is determined in the test utilities.

gemini-code-assist · 2025-09-10T23:38:42Z

tests/models/utils.py

+        num_layers = getattr(text_config, 'num_layers', 1)
+        num_hidden_layers = getattr(text_config, 'num_hidden_layers', 1)


This logic for determining num_layers and num_hidden_layers can be fragile. If a model config only defines num_hidden_layers (which is common for many models like Llama and Qwen), num_layers will default to 1. This could lead to issues if other parts of the code rely on config.num_layers. A more robust approach would be to fetch the layer count from either attribute and ensure both num_layers and num_hidden_layers are consistent.

Suggested change

num_layers = getattr(text_config, 'num_layers', 1)

num_hidden_layers = getattr(text_config, 'num_hidden_layers', 1)

num_layers = getattr(text_config, "num_hidden_layers", getattr(text_config, "num_layers", 1))

num_hidden_layers = num_layers

ywang96 · 2025-09-10T23:48:55Z

@wwl2755 If I understand correctly - this is just to fix our CI setup right?

ywang96 · 2025-09-10T23:51:44Z

I turned on ready label to see if it fixes the issue.

wwl2755 · 2025-09-10T23:55:13Z

@wwl2755 If I understand correctly - this is just to fix our CI setup right?

Yes. It enables some tests to used the original num_layers instead of 1.

All the modification is under /tests. And there will be no any new commit coming unless requested in comment.

Signed-off-by: wwl2755 <[email protected]>

tests/models/registry.py

Signed-off-by: Roger Wang <[email protected]>

tests/models/registry.py

tests/models/utils.py

Signed-off-by: Cyrus Leung <[email protected]>

…_hidden_layers (vllm-project#24613) Signed-off-by: wwl2755 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Signed-off-by: Cyrus Leung <[email protected]> Co-authored-by: Roger Wang <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>

wwl2755 added 2 commits September 10, 2025 22:23

fix model_initialization test

4274b40

Signed-off-by: wwl2755 <[email protected]>

fix mismatch in aux_hidden_layers

d589fb7

Signed-off-by: wwl2755 <[email protected]>

wwl2755 requested review from DarkLight1337 and ywang96 as code owners September 10, 2025 23:35

wwl2755 mentioned this pull request Sep 10, 2025

[BugFix][Spec Decode] Fix out-of-range index triggered by eagle3; re-enable test for LlamaForCausalLMEagle3 #24392

Merged

5 tasks

gemini-code-assist bot reviewed Sep 10, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 10, 2025

reduce max-model-len

d5638df

Signed-off-by: wwl2755 <[email protected]>

DarkLight1337 reviewed Sep 11, 2025

View reviewed changes

tests/models/registry.py Outdated Show resolved Hide resolved

rename

2dab3e5

Signed-off-by: Roger Wang <[email protected]>

ywang96 approved these changes Sep 11, 2025

View reviewed changes

tests/models/registry.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Sep 11, 2025

View reviewed changes

tests/models/utils.py Outdated Show resolved Hide resolved

Update tests/models/utils.py

d7ac768

Signed-off-by: Cyrus Leung <[email protected]>

vllm-bot merged commit 6c8deac into vllm-project:main Sep 11, 2025
5 of 14 checks passed

wwl2755 deleted the fix-test branch September 11, 2025 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug] [Spec Decode] Fix model_initialization test and mismatch in aux_hidden_layers #24613

[Bug] [Spec Decode] Fix model_initialization test and mismatch in aux_hidden_layers #24613

Uh oh!

wwl2755 commented Sep 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 10, 2025

Uh oh!

ywang96 commented Sep 10, 2025

Uh oh!

ywang96 commented Sep 10, 2025

Uh oh!

wwl2755 commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		num_layers = getattr(text_config, 'num_layers', 1)
		num_hidden_layers = getattr(text_config, 'num_hidden_layers', 1)

Uh oh!

[Bug] [Spec Decode] Fix model_initialization test and mismatch in aux_hidden_layers #24613

[Bug] [Spec Decode] Fix model_initialization test and mismatch in aux_hidden_layers #24613

Uh oh!

Conversation

wwl2755 commented Sep 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tests

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

ywang96 commented Sep 10, 2025

Uh oh!

ywang96 commented Sep 10, 2025

Uh oh!

wwl2755 commented Sep 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wwl2755 commented Sep 10, 2025 •

edited by github-actions bot

Loading