feat: Send GenAI spans as V2 envelope items by alexander-alderman-webb · Pull Request #6079 · getsentry/sentry-python

alexander-alderman-webb · 2026-04-15T11:15:28Z

Description

Issues

Reminders

Please add tests to validate your changes, and lint your code using tox -e linters.
Add GH Issue ID & Linear ID (if applicable)
PR title should use conventional commit style (feat:, fix:, ref:, meta:)
For external contributors: CONTRIBUTING.md, Sentry SDK development docs, Discord community

github-actions · 2026-04-15T11:15:50Z

Semver Impact of This PR

🟡 Minor (new features)

📋 Changelog Preview

This is how your changes will appear in the changelog.
Entries from this PR are highlighted with a left border (blockquote style).

New Features ✨

(ci) Cancel in-progress PR workflows on new commit push by joshuarli in #5994

Send GenAI spans as V2 envelope items by alexander-alderman-webb in #6079

Add db.driver.name spans to database integrations by ericapisani in #6082

Bug Fixes 🐛

(google_genai) Redact binary data in inline_data and fix multi-part message extraction by ericapisani in #5977
(grpc) Add isolation_scope to async server interceptor by robinvd in #5940
(profiler) Stop nulling buffer on teardown by ericapisani in #6075

Internal Changes 🔧

(celery) Remove unused NoOpMgr from utils by sentrivana in #6078
(pydantic-ai) Remove dead Model.request patch by alexander-alderman-webb in #5956
(tests) Replace deprecated enable_tracingwith traces_sample_rate by sentrivana in #6077
Set explicit base-branch for codecov action by ericapisani in #5992

_{🤖 This preview updates automatically when you update the PR.}

github-actions · 2026-04-15T11:16:47Z

Codecov Results 📊

✅ 142 passed | Total: 142 | Pass Rate: 100% | Execution Time: 20.99s

📊 Comparison with Base Branch

Metric	Change
Total Tests	—
Passed Tests	—
Failed Tests	—
Skipped Tests	—

✨ No test changes detected

All tests are passing successfully.

✅ Patch coverage is 81.40%. Project has 14170 uncovered lines.
✅ Project coverage is 34.06%. Comparing base (base) to head (head).

Files with missing lines (2)

File	Patch %	Lines
`client.py`	58.97%	⚠️ 272 Missing and 88 partials
`consts.py`	99.43%	⚠️ 2 Missing

Coverage diff

@@            Coverage Diff             @@
##          main       #PR       +/-##
==========================================
+ Coverage    33.75%    34.06%    +0.31%
==========================================
  Files          190       190         —
  Lines        21365     21490      +125
  Branches      7068      7158       +90
==========================================
+ Hits          7211      7320      +109
- Misses       14154     14170       +16
- Partials       700       725       +25

Generated by Codecov Action

sentry-warden

Sorting key uses 'name' twice instead of 'name' and 'description' (tests/integrations/google_genai/test_google_genai.py:330)

The sorting lambda in test_generate_content_with_tools was changed from key=lambda t: (t.get("name", ""), t.get("description", "")) to key=lambda t: (t.get("name", ""), t.get("name", "")). This appears to be an accidental duplication error. While this may not break the test currently (since tool names are unique in this test), it defeats the purpose of the secondary sort key and could cause non-deterministic test ordering if tools have the same name but different descriptions.

Orphaned _meta after GenAI spans are split from transaction (tests/integrations/openai/test_openai.py:3758)

In test_openai_message_truncation, the test accesses event["_meta"]["spans"]["0"] to verify truncation metadata for the GenAI span. However, with the V2 envelope changes, GenAI spans are now split out of the transaction via _split_gen_ai_spans() in client.py and sent as separate envelope items. The _meta is generated during serialization (line 848) before the span split occurs (line 1104), leaving orphaned metadata that references a span no longer present in the transaction. The test may pass but validates stale metadata that doesn't correspond to any span in the actual transaction payload.

Identified by Warden find-bugs

sentry-warden

Test assertions check orphaned _meta data after GenAI spans are extracted (tests/integrations/openai/test_openai.py:3756)

After GenAI spans are sent as separate V2 envelope items, the transaction's spans array no longer contains them. However, the test at lines 3757-3760 still asserts against event["_meta"]["spans"]["0"] which contains stale metadata referring to a span that's no longer in the transaction. The _meta path references span index "0" but if all spans were GenAI spans, the transaction's spans array will be empty while _meta["spans"]["0"] still exists from before the split.

Identified by Warden find-bugs

sentry-warden

Test assumes first span has error status without validation (tests/integrations/langchain/test_langchain.py:940)

In test_span_status_error, the assertion assert spans[0]["status"] == "error" assumes the first span in the list is the one with the error. However, langchain integration can produce multiple spans (agent, chat, tool execution), and the order may not be deterministic. Unlike similar tests in other integrations (e.g., anthropic tests verify GEN_AI_SYSTEM attribute, pydantic_ai tests assert len(spans) == 1), this test doesn't validate it's examining the correct span. This could lead to flaky tests.

test_async_exception_handling patches wrong client (embeddings instead of completions) (tests/integrations/litellm/test_litellm.py:866)

In test_async_exception_handling, the mock patches client.embeddings._client._client but the test calls litellm.acompletion() which uses the completions endpoint. This causes the mock to not actually intercept the API call, making the test unreliable. The sync version test_exception_handling correctly patches client.completions._client._client.

Test accesses non-existent 'data' key instead of 'attributes' on capture_items span payload (tests/tracing/test_misc.py:628)

The test uses capture_items("span") which transforms span payloads to have an 'attributes' key (see conftest.py lines 361-367), but the test accesses spans[0]["data"] which doesn't exist. Other tests using capture_items("span") consistently access span["attributes"] (e.g., test_google_genai.py). This will cause the test to fail with a KeyError at runtime.

Identified by Warden find-bugs

sentry-warden

Test accesses orphaned _meta after gen_ai span is removed from transaction (tests/integrations/openai/test_openai.py:3758)

After gen_ai spans are split from the transaction and sent as V2 envelope items, the transaction's spans list no longer contains the gen_ai span. However, the test still accesses event["_meta"]["spans"]["0"]["data"] expecting truncation metadata. Since the span at index 0 has been moved to the V2 envelope, _meta["spans"]["0"] now references metadata for a span that no longer exists in the transaction's spans array. This test will likely fail or assert against orphaned/stale metadata.

Test expects V2 span envelope for non-gen_ai op span, will fail (tests/tracing/test_misc.py:618)

The test test_conversation_id_propagates_to_span_with_gen_ai_operation_name was modified to use capture_items("span") which captures V2 envelope span items. However, the span being created has op="http.client", and _split_gen_ai_spans() in client.py only splits spans where op starts with gen_ai.. This span will NOT be sent as a V2 envelope item - it will remain in the transaction event. The test will fail because spans list will be empty or not contain the expected span.

Identified by Warden find-bugs

This reverts commit 6c5c812.

sentry-warden

Test asserts against stale _meta path after GenAI spans are extracted to V2 envelope items (tests/integrations/langchain/test_langchain.py:1381)

Line 1381 asserts tx["_meta"]["spans"]["0"]["data"]["gen_ai.request.messages"][""]["len"] == 5 but with _experiments={"gen_ai_as_v2_spans": True} enabled (line 1313), the GenAI span is extracted from the transaction and sent as a separate envelope item. After extraction, the transaction's spans array no longer contains the GenAI span at index 0, making the _meta path invalid or pointing to a different span. This test will fail at runtime when the extracted spans leave behind mismatched _meta indices.

Identified by Warden find-bugs

sentry-warden

Async exception handling test mocks wrong client endpoint (tests/integrations/litellm/test_litellm.py:878)

In test_async_exception_handling, the mock patches client.embeddings._client._client on line 878-879, but the test calls litellm.acompletion() which uses the completions endpoint, not embeddings. The sync version test_exception_handling correctly mocks client.completions._client._client. This mismatch means the mock may not properly intercept the request, causing the test to potentially fail or not test what it intends.

Test checks wrong key 'attributes' instead of 'data' for transaction context (tests/integrations/openai_agents/test_openai_agents.py:3592)

The test test_no_conversation_id_when_not_provided checks transaction["contexts"]["trace"].get("attributes", {}) at lines 3592-3594, but all other tests in this file check transaction["contexts"]["trace"]["data"] for transaction span attributes (see lines 3389 and 3528). This inconsistency means the test could pass even if gen_ai.conversation.id is incorrectly present in the data key of the transaction context.

Identified by Warden find-bugs

feat: Send GenAI spans as V2 envelope items

2be94ca

sentry-warden bot reviewed Apr 15, 2026

View reviewed changes

Comment thread sentry_sdk/client.py Outdated

Comment thread sentry_sdk/client.py Outdated

sentry-warden bot reviewed Apr 15, 2026

View reviewed changes

Comment thread sentry_sdk/client.py Outdated

alexander-alderman-webb added 4 commits April 15, 2026 15:42

.

01f479a

.

80e6a10

.

0622cf4

.

7c75da1

sentry-warden bot reviewed Apr 15, 2026

View reviewed changes

Comment thread sentry_sdk/client.py

alexander-alderman-webb added 3 commits April 15, 2026 16:08

update

54a9b07

.

d1aa07c

.

117a6c9

sentry-warden bot reviewed Apr 15, 2026

View reviewed changes

Comment thread sentry_sdk/client.py Outdated

.

83c36b5

sentry-warden bot reviewed Apr 15, 2026

View reviewed changes

Comment thread sentry_sdk/client.py Outdated

openai tests

f71e0ce

sentry-warden bot reviewed Apr 16, 2026

View reviewed changes

Comment thread sentry_sdk/client.py Outdated

alexander-alderman-webb added 2 commits April 16, 2026 11:43

anthropic tests

1fab632

google-genai tests

f44316d

sentry-warden bot reviewed Apr 16, 2026

View reviewed changes

Comment thread tests/integrations/google_genai/test_google_genai.py

alexander-alderman-webb added 2 commits April 17, 2026 09:52

test litellm

ff9c5ec

test huggingface_hub

b92ae36

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/litellm/test_litellm.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

alexander-alderman-webb added 2 commits April 17, 2026 10:31

test langchain

907ca1d

test langgraph

b254297

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/huggingface_hub/test_huggingface_hub.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

accept any as sdk version

6f7a054

alexander-alderman-webb added 4 commits April 17, 2026 13:46

fix openai-agents tests

41e409d

fix common tests

8bf77f0

client handle None

7c3da4f

fix item_count

06c2a40

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/openai_agents/test_openai_agents.py Outdated

Comment thread tests/integrations/pydantic_ai/test_pydantic_ai.py

alexander-alderman-webb added 4 commits April 17, 2026 14:02

fix common tests

204b980

fix common tests

00733f9

common tests

a54cab4

tests

4b0c47b

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/openai_agents/test_openai_agents.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/openai_agents/test_openai_agents.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/openai_agents/test_openai_agents.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/langchain/test_langchain.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/tracing/test_misc.py Outdated

Comment thread tests/integrations/langchain/test_langchain.py

Comment thread tests/integrations/langchain/test_langchain.py

alexander-alderman-webb added 2 commits April 17, 2026 14:46

add experimental v2 option

6c5c812

push experiment

51a07ff

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

alexander-alderman-webb added 3 commits April 17, 2026 14:52

fix tests

bab7567

client changes

3e55795

simplify client logic

6d1d7ed

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread sentry_sdk/client.py

alexander-alderman-webb added 3 commits April 17, 2026 15:07

Revert "add experimental v2 option"

6bf4006

This reverts commit 6c5c812.

retry adding experimental option to tests

700e8a1

add experimental option to langgraph tests

9b20bd2

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/pydantic_ai/test_pydantic_ai.py Outdated

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/pydantic_ai/test_pydantic_ai.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Comment thread tests/integrations/pydantic_ai/test_pydantic_ai.py

sentry-warden bot reviewed Apr 17, 2026

View reviewed changes

Conversation

alexander-alderman-webb commented Apr 15, 2026

Description

Issues

Reminders

Uh oh!

github-actions bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Semver Impact of This PR

New Features ✨

Bug Fixes 🐛

Internal Changes 🔧

Uh oh!

github-actions bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Results 📊

📊 Comparison with Base Branch

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry-warden bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sentry-warden bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry-warden bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry-warden bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sentry-warden bot left a comment

Choose a reason for hiding this comment

Uh oh!

sentry-warden bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Apr 15, 2026 •

edited

Loading

github-actions bot commented Apr 15, 2026 •

edited

Loading