-
Notifications
You must be signed in to change notification settings - Fork 8.5k
Pull requests: hiyouga/LlamaFactory
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[feat] support HyperParallel PT training and activation optimization
#10370
opened Apr 9, 2026 by
Cui-yshoho
Contributor
•
Draft
fix: sanitize subprocess call in launcher.py
#10369
opened Apr 9, 2026 by
orbisai0security
Loading…
3 tasks done
[v1] fix device mesh and clip_grad_norm for ulysses cp
#10366
opened Apr 8, 2026 by
sunyi0505
Contributor
Loading…
2 tasks done
[model] add gemma4_text template for text-only SFT/DPO training
#10362
opened Apr 7, 2026 by
leivy-dev
Loading…
2 tasks done
fix: use json.loads with Path.read_text() instead of json.load with Path
#10361
opened Apr 6, 2026 by
satishkc7
Loading…
[perf] Skip unused lm_head projection and hidden state storage in RM trainer
#10353
opened Apr 5, 2026 by
tonywang1990
Loading…
4 tasks done
[ray] fix placement group over-allocation and NCCL hang on GPU-less head node
#10349
opened Apr 3, 2026 by
ilover311
Loading…
2 tasks done
fix: add qwen3_5_moe to MoE configuration in moe.py
invalid
This doesn't seem right
#10307
opened Mar 21, 2026 by
majiayu000
Loading…
[v1] add deepspeed zero3 trigger for low memory usage weight loading
#10300
opened Mar 19, 2026 by
jiaqiw09
Collaborator
Loading…
1 of 2 tasks
fix: mutable default arg and bool comparison
#10297
opened Mar 18, 2026 by
LincolnBurrows2017
Contributor
Loading…
feat: clearer train_result metrics log through calculate_tps function
#10288
opened Mar 17, 2026 by
UmeanNever
Loading…
1 of 2 tasks
[V1]support resume training from checkpoint
#10280
opened Mar 13, 2026 by
frozenleaves
Collaborator
Loading…
fix qwen3vl moe fuse on transformers 5.x and update docs about timeout
#10274
opened Mar 12, 2026 by
addsubmuldiv
Loading…
2 tasks
feat: add LightOnOCR-2 integration for LoRA/QLoRA fine-tuning
#10192
opened Feb 16, 2026 by
johnlockejrr
Loading…
2 tasks
Fix memory leak on MPS by explicitly clearing cache in trainer step
#10190
opened Feb 14, 2026 by
asebaq
Loading…
1 of 2 tasks
[v1] Add hyperparams and training docs
#10188
opened Feb 13, 2026 by
frozenleaves
Collaborator
Loading…
[feat] Add DeepSpeed ZeRO-3 LoRA checkpoint save support
#10124
opened Jan 22, 2026 by
kimberlykang
Loading…
2 tasks done
Previous Next
ProTip!
Adding no:label will show everything without a label.