fix(feedback): enforce a max message size from all sources by aliu39 · Pull Request #79326 · getsentry/sentry

aliu39 · 2024-10-17T21:54:09Z

Decided on a limit of 4096, generously below the LLM request, postgres, and kafka size limits described in the ticket. Messages that are too large will be truncated (or rejected, for crash report modal) and skip spam detection, auto-marking as spam.

Includes a small refactor of create_feedback.py, moving stuff around and commenting a bit. ~~Renamed auto_ignore_spam_feedback to set_feedback_ignored.~~

… refactoring

aliu39 · 2024-10-17T23:19:29Z

+            "feedback.large_message",
            tags={
-                "is_spam": is_message_spam,
+                "pow2_size_bucket": 2 ** math.ceil(math.log2(len(feedback_message))),


Limits the granularity of this tag. I'd rather do this for viewing in datadog, instead of logging each msg size.

cant we just use a distribution metric type?

Actually I'll add a log too so we can see the org/project id.

github-actions · 2024-10-17T23:36:43Z

This PR has a migration; here is the generated SQL for src/sentry/migrations/0778_userreport_comments_max_length.py ()

--
-- Alter field comments on userreport
--
-- (no-op)

wedamija

Migration lgtm

… in one file + rename/comment

markstory · 2024-10-21T14:04:40Z

+register(
+    "feedback.message.max-size",
+    type=Int,
+    default=4096,


Do you need an option if the max-length is also in the schema?

The max length schema is for legacy feedbacks only, aka user reports. User reports are shimmed to feedback issues, but not vice versa. IMO it makes sense to have a separate option for new feedback.

In the long-term it makes more sense to enforce a maximum on the upstream envelopes, but I need more time to look into that. This just plugs all holes for now, resolving that sentry issue. We can gather some metrics and have a flexible limit for now, using the option. Wdyt?

In the long-term it makes more sense to enforce a maximum on the upstream envelopes, but I need more time to look into that. This just plugs all holes for now, resolving that sentry issue.

That makes sense to me.

JoshFerge

some feedback: this PR could have been broken up for easier reviewing, but overall looks good. 👍🏼

aliu39 · 2024-10-22T17:35:06Z

some feedback: this PR could have been broken up for easier reviewing, but overall looks good. 👍🏼

Got it, thanks for lmk!

codecov · 2024-10-22T18:03:32Z

Codecov Report

Attention: Patch coverage is 80.48780% with 8 lines in your changes missing coverage. Please review.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/sentry/feedback/usecases/create_feedback.py	72.41%	7 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #79326      +/-   ##
==========================================
+ Coverage   78.33%   78.35%   +0.01%     
==========================================
  Files        7125     7123       -2     
  Lines      314677   314900     +223     
  Branches    51431    51464      +33     
==========================================
+ Hits       246515   246739     +224     
+ Misses      61698    61690       -8     
- Partials     6464     6471       +7

Closes #76298 Closes [SENTRY-3B86](https://sentry.sentry.io/issues/5552524761/) Decided on a limit of 4096, generously below the LLM request, postgres, and kafka size limits described in the ticket. Messages that are too large will be truncated (or rejected, for crash report modal) and skip spam detection, auto-marking as spam. Includes a small refactor of `create_feedback.py`, moving stuff around and commenting a bit. ~~Renamed `auto_ignore_spam_feedback` to `set_feedback_ignored`.~~

sentry · 2024-10-30T11:18:21Z

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

‼️ VertexRequestFailed: Response 429: { sentry.tasks.store.save_event_feedback View Issue

_{Did you find this useful? React with a 👍 or 👎}

Truncate large msgs in create_feedback and skip spam detection. +some…

6429272

… refactoring

aliu39 changed the title ~~Truncate large msgs in create_feedback and skip spam detection. +some refactoring~~ fix(feedback): enforce a max message size from all sources Oct 17, 2024

github-actions Bot added the Scope: Backend Automatically applied to PRs that change backend components label Oct 17, 2024

aliu39 added 3 commits October 17, 2024 15:35

Add create_feedback tests

852f56a

Add metric (with fancy tag)

dfae0ea

Rename size tag

16ca14d

vercel Bot deployed to Preview October 17, 2024 22:43 View deployment

aliu39 added 2 commits October 17, 2024 15:53

Truncate in save_userreport

54d97c8

Set length limit in crash report Form class

a325ab7

vercel Bot deployed to Preview October 17, 2024 22:57 View deployment

Add save_userreport and crash report form coverage

ab32aba

aliu39 marked this pull request as ready for review October 17, 2024 23:09

aliu39 requested review from a team as code owners October 17, 2024 23:09

aliu39 requested a review from JoshFerge October 17, 2024 23:10

vercel Bot deployed to Preview October 17, 2024 23:11 View deployment

aliu39 commented Oct 17, 2024

View reviewed changes

Fix typing

6c54428

vercel Bot deployed to Preview October 17, 2024 23:23 View deployment

Make migration

b517c67

aliu39 requested a review from a team as a code owner October 17, 2024 23:34

vercel Bot deployed to Preview October 17, 2024 23:37 View deployment

wedamija reviewed Oct 18, 2024

View reviewed changes

Move auto_ignore_feedback back to create_feedback to keep kafka logic…

9af5645

… in one file + rename/comment

vercel Bot deployed to Preview October 18, 2024 18:26 View deployment

Revert name for auto_ignore_spam_feedbacks

561c38e

JoshFerge reviewed Oct 18, 2024

View reviewed changes

Comment thread src/sentry/ingest/userreport.py Outdated

vercel Bot deployed to Preview October 18, 2024 19:02 View deployment

Use metrics.distribution

bf7b53a

vercel Bot deployed to Preview October 18, 2024 19:06 View deployment

aliu39 requested a review from JoshFerge October 18, 2024 20:52

markstory reviewed Oct 21, 2024

View reviewed changes

Add logs and sentry msg

c558d46

aliu39 requested review from JoshFerge and markstory and removed request for JoshFerge October 22, 2024 17:24

JoshFerge approved these changes Oct 22, 2024

View reviewed changes

vercel Bot deployed to Preview October 22, 2024 17:27 View deployment

bruno-garcia mentioned this pull request Oct 22, 2024

SDK dev docs for User Feedback v2 getsentry/sentry-docs#11635

Closed

2 tasks

aliu39 merged commit 477e69f into master Oct 22, 2024

aliu39 deleted the aliu/limit-feedback-size branch October 22, 2024 20:21

aliu39 mentioned this pull request Oct 22, 2024

Enforce size limits in feedback envelope processors #79568

Closed

4 tasks

github-actions Bot locked and limited conversation to collaborators Nov 14, 2024

Uh oh!

Uh oh!

Conversation

aliu39 commented Oct 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aliu39 Oct 17, 2024

Choose a reason for hiding this comment

Uh oh!

JoshFerge Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aliu39 Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

aliu39 Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Oct 17, 2024

Uh oh!

wedamija left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

markstory Oct 21, 2024

Choose a reason for hiding this comment

Uh oh!

aliu39 Oct 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markstory Oct 22, 2024

Choose a reason for hiding this comment

Uh oh!

JoshFerge left a comment

Choose a reason for hiding this comment

Uh oh!

aliu39 commented Oct 22, 2024

Uh oh!

codecov Bot commented Oct 22, 2024

Codecov Report

Uh oh!

sentry Bot commented Oct 30, 2024

Suspect Issues

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aliu39 commented Oct 17, 2024 •

edited

Loading

JoshFerge Oct 18, 2024 •

edited

Loading

aliu39 Oct 21, 2024 •

edited

Loading