feat(profiling): Add pipeline workflow to perfetto profiling by markushi · Pull Request #5932 · getsentry/relay

markushi · 2026-05-05T06:38:39Z

As a follow up to:
https://github.com/getsentry/relay/pull/5659/changes/BASE..db555e68ad45debd66f46d28e84aa6952b7498b7#r3167376271, introduces a pipeline workflow instead of doing everything in one place.

Dav1dde

Thanks this addresses my biggest concern of the other PR.

I think there may still be some stricter typing we can do, and maybe de-duplicate some of the filtering logic, but this goes into the territory of maybe not worth it at this time. I tried some stuff locally and realized it'll need more changes quickly.

So ended up just leaving some nits.

For order of PRs, happy to merge them separately into master, as each one of the PRs is functional standalone.

Should also give other reviewers some time to take a look!

Dav1dde · 2026-05-05T07:33:00Z

    }
 }

+#[derive(Clone, Debug, Serialize)]


Looks like this is only Serialize for a test.

It's also used for the kafka message. But granted, it feels a bit hacky, as it also relies on flattening to be enabled in order to produce top level kafka message attributes. Happy to decouple this.

Since it's only two fields, I would prefer the decoupling and specify the kafka message separately.

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 4108a6e. Configure here.}

cursor · 2026-05-05T09:25:28Z

+        }
+        q
+    }
+}


Non-aggregated quantities may cause incorrect rate limiting

Medium Severity

ExpandedProfileChunks::quantities() uses q.extend() which appends without merging duplicate categories (e.g., three backend chunks produce [(ProfileChunk, 1), (ProfileChunk, 1), (ProfileChunk, 1)]). The equivalent SerializedProfileChunks::quantities() aggregates into [(ProfileChunk, 3)]. The CountRateLimited enforce loop calls try_consume once per entry — with Redis rate limiting, calling try_consume(1) three times vs try_consume(3) once yields different results since quota is actually consumed on each call. The [T]::quantities() implementation in counted.rs uses a BTreeMap for proper aggregation and could be followed here.

^{Reviewed by Cursor Bugbot for commit 4108a6e. Configure here.}

@Dav1dde out of scope for this PR, but what if we redefine Quantities as a BTreeMap of category -> usize? I encountered this question a lot when refactoring the transaction processor. That is, who does the deduplication, the caller of quantities or quantities itself?

We should use an enumap!

I personally like the small vec over all (usability and that it is actually small and on the stack), there is also no need to de-duplicate as long is it is somewhat bounded. Then again a *Map feels slightly more appropriate.

I feel like we should make it a newtype, give it a nice API and then the backing thing can be whatever is easiest (probably a BTreeMap). Seems like a good improvement!

cursor · 2026-05-05T09:25:28Z

+                        } else {
+                            item.set_payload(ContentType::Json, chunk.payload);
+                        }
+                        item


Reconstructed items missing platform header in serialization

Low Severity

When Expanded profile chunks are serialized back to an envelope in serialize_envelope, the reconstructed Item never has its platform set. The original items had a platform header (now required for perfetto items), but it's lost during expansion. If this path is ever used for forwarding, the downstream relay would reject perfetto items because profile_type() returns None — which this PR explicitly makes an error condition.

^{Reviewed by Cursor Bugbot for commit 4108a6e. Configure here.}

That's a good catch, @markushi 👀

jjbayer · 2026-05-05T14:02:43Z

    }
 }

+#[derive(Clone, Debug, Serialize)]


Since it's only two fields, I would prefer the decoupling and specify the kafka message separately.

jjbayer · 2026-05-05T14:06:59Z

+        }
+        q
+    }
+}


@Dav1dde out of scope for this PR, but what if we redefine Quantities as a BTreeMap of category -> usize? I encountered this question a lot when refactoring the transaction processor. That is, who does the deduplication, the caller of quantities or quantities itself?

jjbayer · 2026-05-05T14:09:44Z

+                let items = e
+                    .chunks
+                    .into_iter()
+                    .map(|chunk| {
+                        let mut item = Item::new(ItemType::ProfileChunk);
+                        if let Some(raw_profile) = chunk.raw_profile {
+                            let meta_length = chunk.payload.len() as u32;
+                            let mut compound = bytes::BytesMut::with_capacity(
+                                chunk.payload.len() + raw_profile.payload.len(),
+                            );
+                            compound.extend_from_slice(&chunk.payload);
+                            compound.extend_from_slice(&raw_profile.payload);
+                            item.set_payload(raw_profile.content_type, compound.freeze());
+                            item.set_meta_length(meta_length);
+                        } else {
+                            item.set_payload(ContentType::Json, chunk.payload);
+                        }
+                        item
+                    })
+                    .collect();
+                Envelope::from_parts(e.headers, items)
+            })),


Wouldn't it be a bug to call serialize_envelope on Self::Expanded? Expanded is only for processing relays, which call forward_store, so it seems like we could do the reverse of the below, raise an internal_error in this case?

jjbayer · 2026-05-05T14:11:21Z

+            }),
+            quantities,
+        })
+    } else {


nit: This if-else expression is long enough to put the two branches in separate functions IMO.

Introduce more typing, apply pipeline pattern

069ad03

Dav1dde reviewed May 5, 2026

View reviewed changes

Address PR feedback

4108a6e

markushi marked this pull request as ready for review May 5, 2026 09:18

markushi requested a review from a team as a code owner May 5, 2026 09:18

markushi mentioned this pull request May 5, 2026

feat(profiling): Add Perfetto trace format support #5659

Open

cursor Bot reviewed May 5, 2026

View reviewed changes

Dav1dde added the skip-changelog label May 5, 2026

jjbayer reviewed May 5, 2026

View reviewed changes

Conversation

markushi commented May 5, 2026

Uh oh!

Dav1dde left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot May 5, 2026

Choose a reason for hiding this comment

Non-aggregated quantities may cause incorrect rate limiting

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor Bot May 5, 2026

Choose a reason for hiding this comment

Reconstructed items missing platform header in serialization

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants