-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Fix in convert_hf_checkpoint
related to Gemma 3
#2062
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
convert_hf_checkpoint
related to Gemma 3
can we have it back compatible, meaning have this depending on |
297b047
to
9596dcd
Compare
It seems these two failing tests are flaky. |
for more information, see https://pre-commit.ci
Seems we may need to skip this test for our RTX3090...
|
Maybe setting |
would NCCL_DEBUG=INFO give interesting information? |
|
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jirka B <[email protected]> Co-authored-by: Jirka Borovec <[email protected]>
The existing test assumes that parameter names for Gemma 3 start with "vision_tower", "language_model", "multi_modal_projector", but they really start with "model.vision_tower", "model.language_model", "model.multi_modal_projector".
I have
transformers==4.52.4
, which is most recent.