You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Bug]: Qwen2.5: Sliding window for some but all layers is not supported. This model uses sliding window but max_window_layers = 28 is less than num_hidden_layers = 28. Please open an issue to discuss this feature. #15705
When I want to generate response from Qwen2.5-7B-Instruct, I encounter ValueError raised by this line:
Sliding window for some but all layers is not supported. This model uses sliding window but `max_window_layers` = 28 is less than `num_hidden_layers` = 28. Please open an issue to discuss this feature.
The model I used is fine-tuned using trl library and flash-attention 2, with sliding window enabled.
Looks like there is a TODO tag on this line, does it make sense?
I'm curious why when I trained it with trl and vllm, all works fine, but when I want to predict with the fine-tuend model, the vllm throws this ValueError?
Before submitting a new issue...
Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.