Skip to content

Commit 88419e1

Browse files
nathanrchneicherseiji
authored andcommitted
Fix weights loading for Apertus (vllm-project#24100)
Signed-off-by: Nathan Ranchin <[email protected]>
1 parent 01157ad commit 88419e1

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

vllm/model_executor/models/apertus.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -415,6 +415,12 @@ def load_weights(self, weights: Iterable[tuple[str,
415415
(".qkv_proj", ".v_proj", "v"),
416416
]
417417
params_dict = dict(self.named_parameters())
418+
419+
# we need to load the buffers for beta and eps (XIELU)
420+
for name, buffer in self.named_buffers():
421+
if name.endswith(".beta") or name.endswith(".eps"):
422+
params_dict[name] = buffer
423+
418424
loaded_params: set[str] = set()
419425
for name, loaded_weight in weights:
420426
if "rotary_emb.inv_freq" in name:

0 commit comments

Comments
 (0)