Checklist
Motivation
As we did in this comment:
#18306
We should profile the actual time breakdown in the update weights from disk.
Ideally speaking, 7B models' update should be within 1s (no considering save to disk time) in this https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/sglang/latency-accelerate-for-weight-updates/readme.md
Related resources
No response