Skip to content

Docs: add torch compile cache#4151

Merged
zhaochenyang20 merged 3 commits intosgl-project:mainfrom
zhaochenyang20:doc_update
Mar 6, 2025
Merged

Docs: add torch compile cache#4151
zhaochenyang20 merged 3 commits intosgl-project:mainfrom
zhaochenyang20:doc_update

Conversation

@zhaochenyang20
Copy link
Collaborator

Motivation

Modifications

Checklist

@zhaochenyang20 zhaochenyang20 merged commit ebddb65 into sgl-project:main Mar 6, 2025
10 of 12 checks passed
aoshen524 pushed a commit to aoshen524/sglang that referenced this pull request Mar 10, 2025
Co-authored-by: ybyang <ybyang7@iflytek.com>
@diggle001
Copy link

Hi, can anyone tell me Why doesn't torch-compile support FP8 models?
When I deployed the deepseek model of fp8, I added the --enable-torch-compile parameter to the startup command, which worked fine, but the startup still took a long time.

@ispobock
Copy link
Collaborator

Hi, can anyone tell me Why doesn't torch-compile support FP8 models? When I deployed the deepseek model of fp8, I added the --enable-torch-compile parameter to the startup command, which worked fine, but the startup still took a long time.

The torch.compile has supported FP8 since torch 2.5. The doc should be updated. @zhaochenyang20

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants