Bfloat16 quantization by mrariden · Pull Request #1229 · MouseLand/cellpose

mrariden · 2025-06-06T21:48:06Z

CPSAM's transformer weights are unnecessarily too precise at 32bit, and 16 bit is sufficient for prediction. Switching to 16bit comes with multiple benefits, not the least of which is to free up RAM for loading more images for evaluation, reducing OOM issues.

This PR:

Sets default to bfloat16, although this can be changed during CellposeModel object instantiation using the use_bfloat16 flag.
Reduces model size from 1.2GB to 580MB
Reduces runtime by ~20%
Retains all segmentation accuracy

Testing:

Notebook verification for API
GUI tesing
CLI testing

mrariden added 3 commits May 19, 2025 15:15

working bf16 model

f402ee4

add bfloat16 to CellposeModel and transformer and make default in cpm

bff72ef

fix transformer training dtypes and training code types

083434f

mrariden self-assigned this Jun 6, 2025

mrariden merged commit 7e194bc into main Jun 9, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bfloat16 quantization#1229

Bfloat16 quantization#1229
mrariden merged 3 commits intomainfrom
bfloat16_quantization

mrariden commented Jun 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mrariden commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mrariden commented Jun 6, 2025 •

edited

Loading