2026.01.19
Pre-release
Pre-release
What's Changed
- Add swiglu_oai_triton for GPTOSS by @Todobe in #270
- Optimize sinks attention for prefix cache by @Todobe in #260
- fix little batchsize and int8 quant on ci by @zhuyutong332 in #302
- fix bmm transpose in cann 8.5 by @randgun in #316
- Modify contribution guide by @BourneSun0527 in #315
- Integrate ccache for faster compilation by @randgun in #318
- add dfx for operator FusedDeepMoe by @wangyibo1005 in #317
- [Chore] CANN version bump to 8.5.0 by @iforgetmyname in #326
New Contributors
- @zhuyutong332 made their first contribution in #302
Full Changelog: 2026.01.12...2026.01.19