Skip to content

Commit f983d9d

Browse files
committed
fix the weights loading for the apertus model
1 parent ce30dca commit f983d9d

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

vllm/model_executor/models/apertus.py

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -415,6 +415,11 @@ def load_weights(self, weights: Iterable[tuple[str,
415415
(".qkv_proj", ".v_proj", "v"),
416416
]
417417
params_dict = dict(self.named_parameters())
418+
419+
# we need to load the buffers for beta and eps (XIELU)
420+
for name, buffer in self.named_buffers():
421+
params_dict[name] = buffer
422+
418423
loaded_params: set[str] = set()
419424
for name, loaded_weight in weights:
420425
if "rotary_emb.inv_freq" in name:

0 commit comments

Comments
 (0)