[Unity][FX] Add support for PT2.0 scaled_dot_product_attention by masahi · Pull Request #14841 · apache/tvm

masahi · 2023-05-12T20:28:05Z

diffusers started to use scaled_dot_product_attention as of v0.16 in SD VAE. See the doc https://pytorch.org/docs/stable/generated/torch.nn.functional.scaled_dot_product_attention.html. We cannot support attention mask, dropout, and causal mask optimization for now.

The PT attention op requires a different input format than our attention op, so we need to transpose inputs. Luckily those transpose can be canceled in practice, since diffusers does transpose on q/k/v before calling scaled_dot_product_attention anyway (a design mistake?): https://github.com/huggingface/diffusers/blob/909742dbd6873052995dc6cd5f4150ff238015d2/src/diffusers/models/attention_processor.py#L906-L908

I'm also doing clean up on FX test cases. We shouldn't need @tvm.testing.requires_gpu since the tests don't execute anything on GPU. I'll also try removing local import torch if possible (CI should have PT installed, right?)

@MasterJH5574 @jinhongyii @cyx-6

tvm-bot · 2023-05-12T20:28:08Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @quic-sanirudh _{See #10317 for details}

_{Generated by tvm-bot}

yongwww · 2023-05-12T20:52:06Z

@masahi torch is installed in CI GPU container, but not in CI CPU container.

masahi · 2023-05-12T20:56:46Z

@masahi torch is installed in CI GPU container, but not in CI CPU container.

Oh I thought the cpu image has PT as well. It is a bit ironic since even for GPU image we install CPU-build of PT (since we only use PT for reference).

Is there a reason the CPU image cannot have PT? It is pretty bad if we have to add @tvm.testing.requires_gpu (despite nothing runs on GPU) and do local import in every tests, only because CPU image doesn't have PT.

yongwww · 2023-05-12T21:06:17Z

I also noticed this before, don't know the reason why pt cpu version was used in gpu container, I don't know the reason why cpu image doesn't have pt (and onnx) either

masahi · 2023-05-12T21:17:29Z

I also noticed this before, don't know the reason why pt cpu version was used in gpu container.

Because their gpu build often causes a trouble when we upgrade the PT version in our CI.

It seems our cpu image has tflite, mxnet, and caffe installed. I see no reason PT cannot be installed. I'll work on that next week.

yongwww · 2023-05-12T21:25:25Z

I also noticed this before, don't know the reason why pt cpu version was used in gpu container.

Because their gpu build often causes a trouble when we upgrade the PT version in our CI.

It seems our cpu image has tflite, mxnet, and caffe installed. I see no reason PT cannot be installed. I'll work on that next week.

Just tried to install torch 2.0 in my local cpu container (run from the same image as CI), it works well

masahi · 2023-05-12T21:30:08Z

Okay after #14842 lands and the new image is published, I'll update our cpu image.

vinx13 · 2023-05-13T04:03:37Z

note that float mask that's added to score can be supported via bias input of relax attention

yongwww

LGTM

masahi · 2023-05-15T19:01:56Z

Cleaned up test cases quite a bit.

masahi mentioned this pull request May 12, 2023

[Docker] Install PyTorch on cpu image #14842

Merged

masahi force-pushed the fx-sdp branch 2 times, most recently from a180871 to 987656c Compare May 14, 2023 23:53

masahi added 5 commits May 15, 2023 20:18

add converter for PT 2.0 scaled_dot_product_attention

e22cad9

remove requires_gpu in FX test

918a046

add test

970e532

more black

7dc449e

support float mask for attn_mask input

d444388

masahi force-pushed the fx-sdp branch from 987656c to d444388 Compare May 15, 2023 11:18

yongwww approved these changes May 15, 2023

View reviewed changes

masahi added 2 commits May 16, 2023 03:57

remove local import

bc4b5b4

more clean

19dad73

vinx13 approved these changes May 16, 2023

View reviewed changes

vinx13 merged commit e812a21 into apache:unity May 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Unity][FX] Add support for PT2.0 scaled_dot_product_attention#14841

[Unity][FX] Add support for PT2.0 scaled_dot_product_attention#14841
vinx13 merged 7 commits intoapache:unityfrom
masahi:fx-sdp

masahi commented May 12, 2023 •

edited

Loading

Uh oh!

tvm-bot commented May 12, 2023

Uh oh!

yongwww commented May 12, 2023

Uh oh!

masahi commented May 12, 2023 •

edited

Loading

Uh oh!

yongwww commented May 12, 2023 •

edited

Loading

Uh oh!

masahi commented May 12, 2023

Uh oh!

yongwww commented May 12, 2023

Uh oh!

masahi commented May 12, 2023

Uh oh!

vinx13 commented May 13, 2023

Uh oh!

yongwww left a comment

Uh oh!

masahi commented May 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

masahi commented May 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tvm-bot commented May 12, 2023

Uh oh!

yongwww commented May 12, 2023

Uh oh!

masahi commented May 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yongwww commented May 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

masahi commented May 12, 2023

Uh oh!

yongwww commented May 12, 2023

Uh oh!

masahi commented May 12, 2023

Uh oh!

vinx13 commented May 13, 2023

Uh oh!

yongwww left a comment

Choose a reason for hiding this comment

Uh oh!

masahi commented May 15, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

masahi commented May 12, 2023 •

edited

Loading

masahi commented May 12, 2023 •

edited

Loading

yongwww commented May 12, 2023 •

edited

Loading