Add kernelize to transformers by MekkCyber · Pull Request #38205 · huggingface/transformers

MekkCyber · 2025-05-19T15:13:55Z

What does this PR do?

Instead of dynamically switching the forward methods using a decorator, we are exploring a new approach that performs this replacement statically within modeling_utils.py. This allows us to modify the forward methods at load time, which makes the kernels compile compatible.

Also there is no need to check if torch is compiling or not since use_kernels is False by default, and in kernelize we only switch forwards if the kernel is compatible with compile.

This pr should be merged after : huggingface/kernels#87

HuggingFaceDocBuilderDev · 2025-05-19T15:27:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante

Sounds like a good plan to me 👍

(plz wait for Arthur's feedback before merging :P)

ArthurZucker

nice!

ArthurZucker · 2025-06-02T14:11:06Z

+            if torch.cuda.is_available():
+                kernelize(model, device=Device(type="cuda"))
+            # only cuda supported for now
+            else:
+                kernelize(model, device=Device(type="cpu"))


why not pass model.device? or device_map as it hold for each layer?

We can't use device_map, because when it's set to "auto", it only contains the indexes of the accelerators used. This means we would have to rely on torch.cuda.is_available() to check if CUDA is available.

But indeed we can simply use model.device to get the type of device being used (e.g., "cuda" or "cpu").

ArthurZucker

happy to merge

ArthurZucker · 2025-06-24T13:10:06Z

+    if past_key_values is not None and hasattr(past_key_values, "is_sliding"):
+        for i, is_sliding in enumerate(past_key_values.is_sliding):
+            if not is_sliding:
+                layer_idx = i
+                break


this is unrelated should be reverted

nope it isn't ! using index was not compiling

ok can you try casting is_sliding to a torch.tensor?

ArthurZucker · 2025-06-24T13:10:57Z

Update kernel pin as well!

ArthurZucker · 2025-06-24T15:20:30Z

Thanks 🫡

MekkCyber requested review from ArthurZucker and gante May 20, 2025 14:15

gante approved these changes May 20, 2025

View reviewed changes

ArthurZucker reviewed Jun 2, 2025

View reviewed changes

MekkCyber force-pushed the add_kernelize branch from ea83565 to 5321efa Compare June 3, 2025 14:24

MekkCyber force-pushed the add_kernelize branch from 2429fda to ad2d4c2 Compare June 16, 2025 10:45

ArthurZucker approved these changes Jun 24, 2025

View reviewed changes

MekkCyber force-pushed the add_kernelize branch from 00a0d42 to 0043afe Compare June 24, 2025 13:31

MekkCyber added 9 commits June 24, 2025 13:36

fix

c4d5f71

fix

143497f

fix flow

b7b1d56

remove non compiling path

f5ba19a

change

004420b

style

c8a34fb

fix

52fe165

update

98f2bf6

update pin

13b6f98

MekkCyber force-pushed the add_kernelize branch from 18c22e0 to 13b6f98 Compare June 24, 2025 13:36

MekkCyber and others added 2 commits June 24, 2025 16:43

Merge branch 'main' into add_kernelize

5c0fad6

revert

275d648

MekkCyber merged commit 08bf7f1 into main Jun 24, 2025
21 checks passed

MekkCyber deleted the add_kernelize branch June 24, 2025 15:38

sayakpaul mentioned this pull request Sep 23, 2025

What kernels should we integrate in Diffusers? huggingface/diffusers#12375

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add kernelize to transformers#38205

Add kernelize to transformers#38205
MekkCyber merged 11 commits into
mainfrom
add_kernelize

MekkCyber commented May 19, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 19, 2025

Uh oh!

gante left a comment

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker Jun 2, 2025

Uh oh!

MekkCyber Jun 3, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker Jun 24, 2025

Uh oh!

MekkCyber Jun 24, 2025

Uh oh!

ArthurZucker Jun 24, 2025

Uh oh!

ArthurZucker commented Jun 24, 2025

Uh oh!

ArthurZucker commented Jun 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

MekkCyber commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented May 19, 2025

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Jun 24, 2025

Uh oh!

ArthurZucker commented Jun 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MekkCyber commented May 19, 2025 •

edited

Loading