Skip to content

Exclude parameters from AdamW's weight decaying #2650

@markub3327

Description

@markub3327

Hello,

I'm working with Transformer and I need to exclude its pos_embed parameters defined in my custom Layer from weight decaying and the remaining parameters will be used during weight decaying (all of them are trainable parameters). This behaviour is implemented here.

Thanks a lot.
Have a nice day.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions