faet(sentry apps): Add context manager for sentry apps and impl for event webhooks #86136

Christinarlong · 2025-02-28T20:40:51Z

Per https://www.notion.so/sentry/Sentry-App-SLOs-19f8b10e4b5d805dacb9f80beccd1c65?pvs=4#1a68b10e4b5d807a80fbde13daf0ff7b

We're separating out the preparation and sending of webhooks.

Preparation

process_resource_change_bound

Sending

send_webhooks
send_resource_change_webhook
send_and_save_webhook_request

With the current implementation we will receive the following metrics:

sentry_app.prepare_webhook.{outcome} w/ event_type: "process_resource_change.{sender}_{action}"
- The outcome of this metric is independent of the sending tasks
- sender would be Error, Group
- action would be "created"
sentry_app.send_webhook.{outcome} w/ event_type : issue.created or error.created - potentially recorded 2 times (is this bad?)
- We declare the context manager in send_resource_change_webhook and send_and_save_webhook_request .
- Having the context manager be declared twice could lead to some ambiguity on where something failed (?) But I don't think it's that major since the logic in send_resource_change_webhook & send_webhooks is p. minimal (also we have the logs to pinpoint failure place).

Christinarlong · 2025-02-28T20:41:44Z

src/sentry/sentry_apps/tasks/sentry_apps.py

-            logger.info("process_resource_change.event_missing_event", extra=extra)
+    with SentryAppInteractionEvent(
+        operation_type=SentryAppInteractionType.PREPARE_WEBHOOK,
+        event_type=f"process_resource_change.{sender}_{action}",


We dont have an 'event' string at this point so making something up

Christinarlong · 2025-02-28T20:47:54Z

src/sentry/sentry_apps/tasks/sentry_apps.py

+            lifecycle.record_failure(e)
+            return None
+        except (ApiHostError, ApiTimeoutError, RequestException, ClientError) as e:
+            lifecycle.record_halt(e)


ApiHostError & ApiTimeoutError are raised from send_and_save_webhook_request when the response is 503 or 504.

RequestExceptions occur when we get a Timeout or ConnectionError, these I also woudn't consider our server being at fault.

ClientErrors are anything <500 and also wouldn't be our fault, because it's the 3p responsibility to properly consume our requests.

We re raise since the TASK_OPTIONS already have the retry logic specified (i.e which errors to retry or ignore)

JFYI that we will not encounter this block for NOT published apps (e.g internal & unpublished). As we only propagate up errors from send_and_save_webhook_request for published apps.

codecov · 2025-02-28T21:15:43Z

Codecov Report

Attention: Patch coverage is 94.82759% with 12 lines in your changes missing coverage. Please review.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/sentry/sentry_apps/tasks/sentry_apps.py	89.65%	6 Missing ⚠️
src/sentry/utils/sentry_apps/webhooks.py	82.85%	6 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #86136       +/-   ##
===========================================
+ Coverage   45.90%   87.92%   +42.01%     
===========================================
  Files        9709     9721       +12     
  Lines      551059   551742      +683     
  Branches    21527    21401      -126     
===========================================
+ Hits       252943   485095   +232152     
+ Misses     297730    66263   -231467     
+ Partials      386      384        -2

Christinarlong · 2025-03-04T18:39:37Z

tests/sentry/sentry_apps/tasks/test_sentry_apps.py

+MockResponseInstance = MockResponse({}, b"{}", "", True, 200, raiseStatusFalse, None)
+MockResponse404 = MockResponse({}, b'{"bruh": "bruhhhhhhh"}', "", False, 404, raiseException, None)


need to be bytes for json.loads()

src/sentry/sentry_apps/metrics.py

cathteng · 2025-03-04T21:09:40Z

src/sentry/sentry_apps/tasks/sentry_apps.py

+                lifecycle.record_failure(
+                    failure_reason="process_resource_change.event_missing_event", extra=extra
+                )
+                return


should these be raised as exceptions and caught inside process_resource_change_bound? then they would automatically be recorded as failures, and you can pick which exceptions to retry the task with

🤷‍♀️ ehhhh, they could ? I think the benefit with the current code is it's clear if and why we're recording X outcome or retrying the task. Having process_resource_change_bound do exception handling makes my life easier since I can just raise an error but I think this way's clearer so ye

make your life easier c:

tl;dr i think it's better to manually record halt than failure

src/sentry/sentry_apps/tasks/sentry_apps.py

src/sentry/utils/sentry_apps/webhooks.py

cathteng · 2025-03-04T21:21:33Z

src/sentry/testutils/asserts.py

@@ -110,3 +112,12 @@ def assert_middleware_metrics(middleware_calls):
    assert end1.args[0] == EventLifecycleOutcome.SUCCESS
    assert start2.args[0] == EventLifecycleOutcome.STARTED
    assert end2.args[0] == EventLifecycleOutcome.SUCCESS
+
+
+def assert_count_of_metric(mock_record, outcome, outcome_count):


for asserting failure, it might help to be more specific by checking the message that the lifecycle is terminating with like here

sentry/src/sentry/testutils/asserts.py

Lines 73 to 80 in 6eeb123

def assert_failure_metric(mock_record, error_msg):

(event_failures,) = (

call for call in mock_record.mock_calls if call.args[0] == EventLifecycleOutcome.FAILURE

)

if isinstance(error_msg, Exception):

assert isinstance(event_failures.args[1], type(error_msg))

else:

assert event_failures.args[1] == error_msg

currently when you assert a single failure metric, you're only asserting on the count and not the exact line of code, which could be done if you check which exception or message it failed with

we can do both, currently in the tests I do assert_failure_metric alongside assert_count_of_metric for most of the tests, but yeah I can make that consistent and do both for all.

Christinarlong added 3 commits February 28, 2025 09:22

inital commit

8299673

add context manager for event webhooks

b5e833b

add context manager for event webhooks

f3dde2d

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Feb 28, 2025

Christinarlong commented Feb 28, 2025

View reviewed changes

typing

757890f

vercel bot deployed to Preview February 28, 2025 20:55 View deployment

Christinarlong added 3 commits February 28, 2025 13:16

record halts in sending webhook

e9dacdf

update tests

a26919d

update tests

fe4de7f

vercel bot deployed to Preview March 3, 2025 18:28 View deployment

Christinarlong added 2 commits March 3, 2025 15:45

create helper for asserting count

3804a47

add testing for send webhook

48a67f5

vercel bot deployed to Preview March 4, 2025 00:30 View deployment

fix and add tests for event webhook SLOs

56d5e90

vercel bot deployed to Preview March 4, 2025 02:04 View deployment

add test for lifecycle halt for published apps

2614eb5

vercel bot deployed to Preview March 4, 2025 18:35 View deployment

Christinarlong commented Mar 4, 2025

View reviewed changes

Christinarlong marked this pull request as ready for review March 4, 2025 18:43

Christinarlong requested review from a team as code owners March 4, 2025 18:43

Merge branch 'master' into crl/sa-slos-context-manager

5188300

cathteng reviewed Mar 4, 2025

View reviewed changes

vercel bot deployed to Preview March 4, 2025 21:24 View deployment

Christinarlong added 2 commits March 4, 2025 17:16

pr fixes for tasks

c261e25

fix tests via pr comments

ce6b02f

vercel bot deployed to Preview March 5, 2025 01:56 View deployment

add assertionerror

887b1a3

vercel bot deployed to Preview March 5, 2025 17:44 View deployment

typing

91693e7

Christinarlong requested a review from cathteng March 5, 2025 17:49

vercel bot deployed to Preview March 5, 2025 17:53 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faet(sentry apps): Add context manager for sentry apps and impl for event webhooks #86136

faet(sentry apps): Add context manager for sentry apps and impl for event webhooks #86136

Christinarlong commented Feb 28, 2025 •

edited

Loading

Christinarlong Feb 28, 2025

Christinarlong Feb 28, 2025

Christinarlong Mar 4, 2025

codecov bot commented Feb 28, 2025 •

edited

Loading

Christinarlong Mar 4, 2025

cathteng Mar 4, 2025

Christinarlong Mar 4, 2025

cathteng Mar 4, 2025

cathteng Mar 4, 2025

cathteng Mar 4, 2025

Christinarlong Mar 4, 2025

		MockResponseInstance = MockResponse({}, b"{}", "", True, 200, raiseStatusFalse, None)
		MockResponse404 = MockResponse({}, b'{"bruh": "bruhhhhhhh"}', "", False, 404, raiseException, None)

	def assert_failure_metric(mock_record, error_msg):
	(event_failures,) = (
	call for call in mock_record.mock_calls if call.args[0] == EventLifecycleOutcome.FAILURE
	)
	if isinstance(error_msg, Exception):
	assert isinstance(event_failures.args[1], type(error_msg))
	else:
	assert event_failures.args[1] == error_msg

faet(sentry apps): Add context manager for sentry apps and impl for event webhooks #86136

Are you sure you want to change the base?

faet(sentry apps): Add context manager for sentry apps and impl for event webhooks #86136

Conversation

Christinarlong commented Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Feb 28, 2025 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Christinarlong commented Feb 28, 2025 •

edited

Loading

codecov bot commented Feb 28, 2025 •

edited

Loading