[`Ernie 4.5`] Add ernie text models #39228

vasqu · 2025-07-04T16:12:21Z

Adding the Ernie 4.5 suite of models.

Progress:

New/Followup PR:

… cache now hmm

… focussing on modeling other models first)

vasqu · 2025-07-07T11:23:59Z

run-slow: ernie4_5

github-actions · 2025-07-07T11:25:29Z

This comment contains run-slow, running the specified jobs:

models: ['models/ernie4_5']
quantizations: [] ...

ArthurZucker · 2025-07-07T13:07:15Z

Let's go! 🚀

…r modular and transformers style rope (complex view)

src/transformers/models/ernie4_5/convert_ernie4_5_tokenizer.py

src/transformers/models/ernie4_5_moe/modular_ernie4_5_moe.py

ArthurZucker · 2025-07-21T14:29:24Z

src/transformers/modeling_utils.py

+        # Passing hooks over to the embeddings if needed
+        # (currently limited to tensor parallel hooks and flags only)
+        if hasattr(input_embeddings, "_is_hooked") and getattr(input_embeddings, "_hf_tp_plan", None):
+            output_embeddings._is_hooked = input_embeddings._is_hooked
+            output_embeddings._hf_tp_plan = input_embeddings._hf_tp_plan
+            output_embeddings._forward_hooks = input_embeddings._forward_hooks
+            output_embeddings._forward_pre_hooks = input_embeddings._forward_pre_hooks
+            output_embeddings.__repr__ = (
+                lambda: f"{output_embeddings.__repr__()}\nTP Plan: {output_embeddings._hf_tp_plan}"
+            )


okay! makes sense!

github-actions · 2025-07-21T17:27:49Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, ernie4_5, ernie4_5_moe

* init * copied from remote * add proper structure and llama like structure * fixup * revert to state that works * get closer to llama * slow and steady * some removal * masks work * it is indeed the rope implementation, how dafuq does it mesh with the cache now hmm * nice * getting closer * closer to transformers style * let's simplify this, batching works now * simplified * working version with modular * it is indeed the rotation per weights, make it complete llama style * cleanup conversion, next to look at -> tokenizer * remove llama artefacts * fix modeling tests (common ones) * style * integration test + first look into tokenization (will need more work, focussing on modeling other models first) * style * working moe version, based on remote * lets keep it simple and go step by step - transformers annotations for modular and transformers style rope (complex view) * more cleanup * refactor namings and remove addition forXXX classes * our moe won't cut it it seems, correction bias seems to be missing in remote code version * tokenization change (remote) * our moe version works when adding normalization :D * cleanup moe * nits * cleanup modeling -> let's get to modular next * style * modular v1 * minor things + attempt at conversion (which doesn't work) * no conversion follow glm, fixup modular and other nits * modular cleanup * fixes * tests, tests, tests + some moe dtype forcing * simplify modular, fix fatal fa2 bug, remaining tests * fix import issue? * some initial docs, fix bnb faulty behavior --> needs to fix some tests because of gate needing to be float * fix sdpa test, load on init dtype only * fixup post merge * style * fix doc links * tokenization cleanup beginnings * simplify tokenizer by a lot as its basically llama * tokenizer is full llama with different defaults + extra special tokens * sync og special tokens of ernie * fix decoding with numbers (also in remote done what a timing), begin of tok tests * align with remote and preserve special tokens, adjust tests to ernie legacy behavior, warning for questionable behavior (also in llama) * nits * docs * my daily post merge it is * check * tokenization update with explanations and conversion script * review on modular (til), revert some tokenizer things i did prior, remove mtp comment (low prio) * post merge fixes * fixup tokenization, llama fast is the way to go * more fixups * check * import fixes * correction bias following the paddle code * fix * fix TP plan, fix correction bias sharding during forward * style * whoops * fix tied weights * docs and last nit * license * flasky tests * move repo id, update when merged on the hub

vasqu added 21 commits July 3, 2025 12:41

init

055fa74

copied from remote

cf31849

add proper structure and llama like structure

69d942c

fixup

0fa32aa

revert to state that works

2fc44ee

get closer to llama

39176fd

slow and steady

f9d5789

some removal

de53a33

masks work

999e7e9

it is indeed the rope implementation, how dafuq does it mesh with the…

1816efb

… cache now hmm

nice

d0c5877

getting closer

0361822

closer to transformers style

0682085

let's simplify this, batching works now

a551f22

simplified

4a9c20b

working version with modular

393c2c7

it is indeed the rotation per weights, make it complete llama style

936aa04

cleanup conversion, next to look at -> tokenizer

b33edd1

remove llama artefacts

4bcb7f0

fix modeling tests (common ones)

4129506

style

e837bf5

huggingface deleted a comment from github-actions bot Jul 4, 2025

vasqu added 2 commits July 7, 2025 13:21

integration test + first look into tokenization (will need more work,…

be5a7b0

… focussing on modeling other models first)

style

f98c8d9

ArthurZucker added the New model label Jul 7, 2025

vasqu added 2 commits July 7, 2025 18:08

working moe version, based on remote

de0389f

lets keep it simple and go step by step - transformers annotations fo…

1a41719

…r modular and transformers style rope (complex view)

vasqu and others added 10 commits July 18, 2025 12:50

correction bias following the paddle code

2948aed

fix

81bafc6

Merge branch 'main' into ernie4_5

f42e312

fix TP plan, fix correction bias sharding during forward

508b683

Merge branch 'main' into ernie4_5

624820c

style

b8d8f5c

whoops

c72e4d9

fix tied weights

b9f8db7

Merge branch 'main' into ernie4_5

680353e

docs and last nit

60ebe11

vasqu mentioned this pull request Jul 21, 2025

[WIP] try to relax the tie_weights method #39555

Draft

ArthurZucker reviewed Jul 21, 2025

View reviewed changes

vasqu and others added 2 commits July 21, 2025 16:05

license

a463588

Merge branch 'main' into ernie4_5

416cda6

ArthurZucker approved these changes Jul 21, 2025

View reviewed changes

vasqu added 2 commits July 21, 2025 16:32

flasky tests

f31f41d

move repo id, update when merged on the hub

961423d

vasqu merged commit b4115a4 into huggingface:main Jul 21, 2025
25 checks passed

vasqu deleted the ernie4_5 branch July 21, 2025 17:51

vasqu mentioned this pull request Jul 22, 2025

[Ernie 4.5] Ernie VL models #39585

Draft

NanoCode012 mentioned this pull request Sep 11, 2025

[ New model ] Ernie 4.5 axolotl-ai-cloud/axolotl#3151

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[`Ernie 4.5`] Add ernie text models #39228

[`Ernie 4.5`] Add ernie text models #39228

Uh oh!

vasqu commented Jul 4, 2025 •

edited

Loading

Uh oh!

vasqu commented Jul 7, 2025

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

ArthurZucker commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

[Ernie 4.5] Add ernie text models #39228

[Ernie 4.5] Add ernie text models #39228

Uh oh!

Conversation

vasqu commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vasqu commented Jul 7, 2025

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

ArthurZucker commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

[`Ernie 4.5`] Add ernie text models #39228

[`Ernie 4.5`] Add ernie text models #39228

vasqu commented Jul 4, 2025 •

edited

Loading