-
Notifications
You must be signed in to change notification settings - Fork 9
Pull requests: OpenSparseLLMs/Linear-MoE
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add parameter counting func & add new A1B linear-moe-qwen2 model
#1
by LanDisen
was merged Sep 28, 2024
Loading…
Support expert parallel for Qwen2-MoE with MegaBlocks
#6
by Spico197
was merged Nov 12, 2024
Loading…
[Llama3] Able to run Llama3, support Mixattn in Llama3
#8
by JusenD
was merged Dec 11, 2024
Loading…
Add Linear-MoE evaluation README.md based on lm-evaluation-harness
#12
by LanDisen
was merged Jan 20, 2025
Loading…
Adding RWKV-7 implementation to the Linear RNN model.
#19
by MerCury-Orbit
was merged May 8, 2025
Loading…
ProTip!
Updated in the last three days: updated:>2026-02-17.