Skip to content

Fix the get_target_compute_version for sm >= 100#17716

Merged
yongwww merged 1 commit intoapache:mainfrom
yongwww:fix_compute_versin
Mar 7, 2025
Merged

Fix the get_target_compute_version for sm >= 100#17716
yongwww merged 1 commit intoapache:mainfrom
yongwww:fix_compute_versin

Conversation

@yongwww
Copy link
Copy Markdown
Member

@yongwww yongwww commented Mar 7, 2025

The sm_100 or larger will be parsed to 1.0.0, breaking some dependent passes. The expected major version should be 10.
ref: the GPU Compute Capability: https://developer.nvidia.com/cuda-gpus

cc: @MasterJH5574 @cyx-6

Copy link
Copy Markdown
Contributor

@MasterJH5574 MasterJH5574 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@yongwww yongwww merged commit 85ab5ba into apache:main Mar 7, 2025
@yongwww yongwww deleted the fix_compute_versin branch March 7, 2025 22:39
ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants