Skip to content

Latent Interest Aggregates from Dwell Signals #16052

@dips7189

Description

@dips7189

Problem

Engagement-based ranking under-serves "silent" users who read content but do not
explicitly like, reply, or retweet. The system already logs and labels multiple
high-intent dwell signals (tweet detail, profile, link, fullscreen video), and
aggregates them in real time, but does not expose a persistent user preference
representation derived from those signals.

As a result, latent interest (interest without explicit engagement) is not
captured as a first-class signal.

Proposed solution

Introduce online, decayed latent-interest aggregates derived from existing dwell
labels, keyed by (userId, sourceAuthorId), to represent implicit long-term
interest in authors.

This change:

  • Defines AuthorLatentInterestEngagements using high-intent dwell signals
    (profile dwell, tweet detail dwell, long link dwell)
  • Adds authorLatentInterestRealTimeAggregates with exponential decay
  • Registers the aggregate group in ProdAggregateGroups
  • Ensures outputs are not filtered via aggregates_to_drop.txt
  • Mirrors existing user+author aggregate flag defaults (e.g. includeAnyFeature)

The change is additive and does not alter existing ranking or scoring behavior.
It provides a foundation for future use in scoring, mixing, or exploration to
better serve silent readers.

Future work (not included):

  • Consume this aggregate in scoring or mixing
  • Tune decay/weights via params
  • Extend to topic/entity latent interest

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions