feat: match substring on known public aliases by shruthilayaj · Pull Request #104786 · getsentry/sentry

shruthilayaj · 2025-12-11T18:36:09Z

Additionally:
removes intersecting filter from the ranked request because I'm not quite certain how it works.
adds some tags to see how the intersecting filter might impact the number of attrs returned

…ic-aliases

src/sentry/api/endpoints/organization_trace_item_stats.py

                    continue
-
+                if attr.name in additional_substring_matches:
+                    additional_substring_matches.remove(attr.name)


cursor · 2025-12-11T18:44:41Z

src/sentry/api/endpoints/organization_trace_item_stats.py

+                    sentry_sdk.set_tag(
+                        "num_attrs_without_intersecting_filter",
+                        len(attrs_without_intersecting_filter.attributes),
+                    )


Bug: Debug code doubles RPC calls in production

The code block labeled as "debug code" makes an additional attribute_names_rpc call whenever a query_filter is present. This effectively doubles the RPC load for requests with query filters, increasing latency and backend load in production. The comment explicitly identifies this as debug code for checking the intersecting filter behavior, suggesting it may have been unintentionally left in the production codebase.

src/sentry/api/endpoints/organization_trace_item_stats.py

+                if attr.name in additional_substring_matches:
+                    # dedupe additional known attrs on the first offset
+                    if offset == 0:


sentry · 2025-12-11T18:53:09Z

src/sentry/api/endpoints/organization_trace_item_stats.py

+                if attr.name in additional_substring_matches:
+                    # dedupe additional known attrs on the first offset
+                    if offset == 0:
+                        additional_substring_matches.remove(attr.name)
+                    # we've already shown this attr in the first offset, so
+                    # don't show it again
+                    else:
+                        continue


Bug: The additional_substring_matches set is not reset across paginated calls, causing attributes that appear on pages other than the first to be incorrectly skipped.
_{Severity: MEDIUM | Confidence: High}

🔍 Detailed Analysis

The deduplication logic for substring-matched attributes has a flaw related to pagination. The additional_substring_matches set is initialized once. On the first page (offset == 0), matching attributes are correctly processed and removed from the set. However, on subsequent pages (offset > 0), any attribute that is still in the set (because it didn't appear on page 1) is incorrectly skipped via a continue statement. This leads to missing attributes in the final results if they don't appear on the first page of the paginated response.

💡 Suggested Fix

The logic for handling additional_substring_matches should be re-evaluated. One option is to not skip attributes on subsequent pages, but instead perform the removal and add them to the results, ensuring consistent behavior across all pages. Alternatively, collect all attributes from all pages first before performing this deduplication logic.

🤖 Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/api/endpoints/organization_trace_item_stats.py#L196-L203 Potential issue: The deduplication logic for substring-matched attributes has a flaw related to pagination. The `additional_substring_matches` set is initialized once. On the first page (`offset == 0`), matching attributes are correctly processed and removed from the set. However, on subsequent pages (`offset > 0`), any attribute that is still in the set (because it didn't appear on page 1) is incorrectly skipped via a `continue` statement. This leads to missing attributes in the final results if they don't appear on the first page of the paginated response.

_{Did we get this right? 👍 / 👎 to inform future reviews.}
_{Reference ID: 7207400}

cursor · 2025-12-11T18:54:51Z

src/sentry/api/endpoints/organization_trace_item_stats.py

+                    # we've already shown this attr in the first offset, so
+                    # don't show it again
+                    else:
+                        continue


Bug: Pagination skips known attributes appearing on later pages

The additional_substring_matches set is recreated fresh on each HTTP request (pagination call). When offset > 0, any RPC attribute whose internal_name exists in this set gets skipped via continue. Since the set is never modified for offset > 0, legitimate attributes that happen to match known public aliases will be excluded from page 2 onwards. For example, if sentry.op appears in the RPC response on page 2, it would be skipped because it's in additional_substring_matches.

Additional Locations (1)

src/sentry/api/endpoints/organization_trace_item_stats.py#L129-L136

Abdkhan14

Unblocking for testing, a lot of cursor comments to take a look at

codecov · 2025-12-11T19:02:44Z

Codecov Report

❌ Patch coverage is 94.73684% with 1 line in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...try/api/endpoints/organization_trace_item_stats.py	94.73%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           master   #104786   +/-   ##
========================================
  Coverage   80.47%    80.47%           
========================================
  Files        9363      9363           
  Lines      401849    401867   +18     
  Branches    25833     25833           
========================================
+ Hits       323389    323420   +31     
+ Misses      78018     78005   -13     
  Partials      442       442

shruthilayaj · 2025-12-11T19:27:32Z

Unblocking for testing, a lot of cursor comments to take a look at

we only care about known public aliases in the first offset, so comments about subsequent pagination doesn't apply

shruthilayaj added 3 commits December 11, 2025 13:33

feat: match substring on known public aliases

6147fcd

removed the wrong intersecting filter

e2a3130

Merge branch 'master' into shruthi/feat/match-substring-on-known-publ…

3cca8d6

…ic-aliases

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Dec 11, 2025

shruthilayaj marked this pull request as ready for review December 11, 2025 18:36

shruthilayaj requested review from a team as code owners December 11, 2025 18:36

sentry bot reviewed Dec 11, 2025

View reviewed changes

src/sentry/api/endpoints/organization_trace_item_stats.py Outdated

continue

if attr.name in additional_substring_matches:

additional_substring_matches.remove(attr.name)

This comment was marked as outdated.

Sign in to view

vercel bot deployed to Preview December 11, 2025 18:39 View deployment

better de-dupe

e86c982

cursor bot reviewed Dec 11, 2025

View reviewed changes

sentry bot reviewed Dec 11, 2025

View reviewed changes

src/sentry/api/endpoints/organization_trace_item_stats.py

Comment on lines +196 to +198

if attr.name in additional_substring_matches:

# dedupe additional known attrs on the first offset

if offset == 0:

This comment was marked as outdated.

Sign in to view

check allowlist

94b2833

vercel bot deployed to Preview December 11, 2025 18:47 View deployment

vercel bot deployed to Preview December 11, 2025 18:51 View deployment

sentry bot reviewed Dec 11, 2025

View reviewed changes

cursor bot reviewed Dec 11, 2025

View reviewed changes

Abdkhan14 approved these changes Dec 11, 2025

View reviewed changes

shruthilayaj merged commit 03d24fe into master Dec 11, 2025
67 checks passed

shruthilayaj deleted the shruthi/feat/match-substring-on-known-public-aliases branch December 11, 2025 19:27

github-actions bot locked and limited conversation to collaborators Dec 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: match substring on known public aliases#104786

feat: match substring on known public aliases#104786
shruthilayaj merged 5 commits intomasterfrom
shruthi/feat/match-substring-on-known-public-aliases

shruthilayaj commented Dec 11, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Dec 11, 2025

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

sentry bot Dec 11, 2025

Uh oh!

cursor bot Dec 11, 2025

Uh oh!

Abdkhan14 left a comment

Uh oh!

codecov bot commented Dec 11, 2025 •

edited

Loading

Uh oh!

shruthilayaj commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

shruthilayaj commented Dec 11, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Dec 11, 2025

Choose a reason for hiding this comment

Bug: Debug code doubles RPC calls in production

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

sentry bot Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

cursor bot Dec 11, 2025

Choose a reason for hiding this comment

Bug: Pagination skips known attributes appearing on later pages

Uh oh!

Abdkhan14 left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

shruthilayaj commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Dec 11, 2025 •

edited

Loading