Skip to content

[Unity][DLight] Use less shared memory for gemv#15482

Merged
junrushao merged 1 commit intoapache:unityfrom
Hzfengsy:dlight_gemv_for_large_workload
Aug 4, 2023
Merged

[Unity][DLight] Use less shared memory for gemv#15482
junrushao merged 1 commit intoapache:unityfrom
Hzfengsy:dlight_gemv_for_large_workload

Conversation

@Hzfengsy
Copy link
Copy Markdown
Member

@Hzfengsy Hzfengsy commented Aug 4, 2023

This PR fixes the issue of the GEMV rule uses too much shared memory on llama-70B model.

May have perf regression w/o #15471. (Actually not sure)

cc @cyx-6

This PR fixes the issue of the GEMV rule uses too much shared memory on
llama-70B model.
@tvm-bot
Copy link
Copy Markdown
Collaborator

tvm-bot commented Aug 4, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

@github-actions github-actions Bot requested a review from cyx-6 August 4, 2023 06:30
@junrushao junrushao merged commit a8218b3 into apache:unity Aug 4, 2023
@Hzfengsy Hzfengsy deleted the dlight_gemv_for_large_workload branch November 5, 2023 09:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants