Per-loss logits_scale_factor by jlamypoirier · Pull Request #516 · ServiceNow/Fast-LLM

jlamypoirier · 2026-05-15T22:16:38Z

Summary

Add a logits_scale_factor field to LanguageModelLossConfig, applied on top of the head's logits_scale_factor for that loss only. Every loss subclass picks it up automatically via self._logits_scale_factor — no per-subclass changes.

Primary use case: in RL losses, set to 1 / actor_temperature so new log-probabilities are computed at the same scale as the actor's stored old log-probabilities (importance ratio at step 0 is no longer offset by the actor's sampling temperature).

Reimplemented from PR #502's temperature field on the GRPO config: the field is moved to the base config (so all losses can opt in), renamed to match the existing head field, and given multiplier semantics (extra scale, default 1.0, stacked on top of the head's scale).

Test plan

Existing loss tests still pass at default logits_scale_factor=1.0.
Manual: setting logits_scale_factor=2.0 on a GRPO loss config produces softmax outputs equivalent to doubling the head's logits_scale_factor.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

loss: per-loss logits_scale_factor stacked on top of the model's

18980d3

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

jlamypoirier merged commit 8c6b67c into main May 15, 2026
2 of 3 checks passed

jlamypoirier deleted the jlp_loss-logits-scale-factor branch May 15, 2026 22:28

jlamypoirier mentioned this pull request May 15, 2026

Add GSPO loss #502

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Per-loss logits_scale_factor#516

Per-loss logits_scale_factor#516
jlamypoirier merged 1 commit into
mainfrom
jlp_loss-logits-scale-factor

jlamypoirier commented May 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jlamypoirier commented May 15, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant