Huge Activation Values in `EfficientViT-SAM-L2` Checkpoint

**`L0:`**
<img width="1920" height="185" alt="Image" src="https://github.com/user-attachments/assets/42e0a09c-59ca-452f-a061-cb1a79802734" />

**`L1:`**
<img width="1905" height="177" alt="Image" src="https://github.com/user-attachments/assets/09b8dd88-fbb3-49f3-9232-2a10dad657cc" />

**`L2:`**
<img width="1880" height="175" alt="Image" src="https://github.com/user-attachments/assets/11357d8d-b349-4129-892f-5e0e02134222" />

I downloaded the checkpoint from [Hugging Face](https://huggingface.co/mit-han-lab/efficientvit-sam/resolve/main/efficientvit_sam_l2.pt)
The `L2` model is extremely unstable when running inference under FP16 AMP: the model outputs NaNs. This does not happen with L0/L1. I inspected the activations and found extremely large values.

I also inspected `XL0` and `XL1`. They are even worse, extremely large activation values appear throughout the model, not just in LiteMLA.

**Input image**
[image_8.tif](https://github.com/user-attachments/files/27463934/image_8.tif)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huge Activation Values in `EfficientViT-SAM-L2` Checkpoint #184

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Huge Activation Values in EfficientViT-SAM-L2 Checkpoint #184

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Huge Activation Values in `EfficientViT-SAM-L2` Checkpoint #184