Apertus - Error while loading model

### System Info

Latest VLLM Docker image with latest git transformers, running on an NVIDIA GPU

### Who can help?

@ArthurZucker @EduardDurech @Cyrilvallez 

### Information

- [ ] The official example scripts
- [ ] My own modified scripts

### Tasks

- [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...)
- [ ] My own task or dataset (give details below)

### Reproduction

`ValueError: There is no module or parameter named 'model.layers.0.mlp.act_fn.beta' in TransformersForCausalLM`

To reproduce, load `swiss-ai/Apertus-8B-Instruct-2509` with the latest git version of Transformers for inference.

In my case, I used the VLLM Docker image and upgraded `transformers` manually:

```bash
docker run --rm -it --gpus all -e HF_TOKEN=$HF_TOKEN vllm/vllm-openai:latest --model swiss-ai/Apertus-8B-Instruct-2509 --port 8080 --async-scheduling
```

### Expected behavior

The model should load without any issue. I don't know what this missing module is.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Apertus - Error while loading model #40650

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Apertus - Error while loading model #40650

Description

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions