Skip to content

Pull requests: PaddlePaddle/PaddleFormers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

【cherry-pick】Fix attn impl and ernie4.5 for erniekit (#2580)
#2590 opened Sep 11, 2025 by cheng221 Loading…
2 tasks
【cherry-pick】fix pretrained_config save dtype (#2587)
#2589 opened Sep 11, 2025 by cheng221 Loading…
2 tasks
[CI] update codecov contributor
#2588 opened Sep 11, 2025 by Liujie0926 Loading…
2 tasks
Update mergekit with hf lora model contributor
#2585 opened Sep 10, 2025 by llbdyiu66 Loading…
【GptOss】add pp model ,code clear
#2570 opened Sep 9, 2025 by xiaoguoguo626807 Loading…
2 tasks
【Bug】Fix attn_mask_startend_row_indices shape mismatch
#2564 opened Sep 8, 2025 by cheng221 Loading…
2 tasks
support Glm4Moe contributor
#2554 opened Sep 5, 2025 by WYB27 Loading…
【FlexCP】add Flexcp for trainer
#2541 opened Sep 4, 2025 by xiaoguoguo626807 Loading…
2 tasks
use key-value to init dataloader
#2537 opened Sep 3, 2025 by Waynezee Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2525 opened Sep 1, 2025 by hushenwei2000 Loading…
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2524 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2523 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
add moe
#2510 opened Aug 28, 2025 by a31413510 Loading…
fix bug support download ernie model contributor
#2509 opened Aug 28, 2025 by fjjF77 Loading…
fix typos contributor
#2500 opened Aug 28, 2025 by co63oc Loading…
2 tasks
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2496 opened Aug 27, 2025 by chen2016013 Loading…
2 tasks
Update lora layer source contributor
#2489 opened Aug 27, 2025 by emmanuel-ferdman Loading…
1 of 2 tasks
Merge dsv3 tainer part
#2487 opened Aug 27, 2025 by hushenwei2000 Draft
change deepseekv2 model
#2486 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
add pre_train entrance
#2483 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
Support GPT-OSS contributor
#2478 opened Aug 25, 2025 by WYB27 Loading…
support general pp model
#2473 opened Aug 25, 2025 by cheng221 Loading…
2 tasks
add moe
#2467 opened Aug 25, 2025 by a31413510 Loading…
2 tasks
ProTip! Add no:assignee to see everything that’s not assigned.