[relax] Fix tree attention for Qwen2-1.5 models by Hzfengsy · Pull Request #17700 · apache/tvm

Hzfengsy · 2025-03-03T10:13:16Z

Fix the compilation error(mlc-ai/mlc-llm#3143) for Qwen2-1.5 models in the tree attention implementation for vulkan backend.

cc @spectrometerHBH @vinx13

Fix the compilation error for Qwen2-1.5 models in the tree attention implementation for vulkan backend.

Hzfengsy · 2025-03-03T10:18:15Z

One additional note: this PR provides an immediate fix for the issue, but it doesn't address the underlying problem - the simplifier can potentially cause integer overflow. For illustration, here's a minimal reproducible example:

import tvm

x = tvm.tir.Var("x", "int32")
# Creating an expression that triggers integer overflow during simplification
expr = (tvm.tir.Div(x + 1073741826, 3) - 357913942) * 1536
ana = tvm.arith.Analyzer()
print(ana.simplify(expr))

cc @tqchen

MasterJH5574

Thanks for the fix!

Fix the compilation error for Qwen2-1.5 models in the tree attention implementation for vulkan backend.

[relax] Fix tree attention for Qwen2-1.5 models

5e19119

Fix the compilation error for Qwen2-1.5 models in the tree attention implementation for vulkan backend.

github-actions Bot requested review from spectrometerHBH and vinx13 March 3, 2025 10:14

vinx13 approved these changes Mar 3, 2025

View reviewed changes

MasterJH5574 approved these changes Mar 3, 2025

View reviewed changes

MasterJH5574 merged commit c286638 into apache:main Mar 3, 2025

ysh329 mentioned this pull request Apr 19, 2025

[Release] v0.20.0 Release Candidate Notes #17860

Closed

ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025

[relax] Fix tree attention for Qwen2-1.5 models (apache#17700)

38b8ef8

Fix the compilation error for Qwen2-1.5 models in the tree attention implementation for vulkan backend.

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[relax] Fix tree attention for Qwen2-1.5 models#17700

[relax] Fix tree attention for Qwen2-1.5 models#17700
MasterJH5574 merged 1 commit intoapache:mainfrom
Hzfengsy:fix_qwen_1.5b

Hzfengsy commented Mar 3, 2025

Uh oh!

Hzfengsy commented Mar 3, 2025

Uh oh!

MasterJH5574 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Hzfengsy commented Mar 3, 2025

Uh oh!

Hzfengsy commented Mar 3, 2025

Uh oh!

MasterJH5574 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants