Preliminary work to integrate Performance Monitoring #276

aramperes · 2020-10-03T00:44:01Z

Hello! I'm the lead developer at Wavy Labs Inc. We're integrating Rust more and more into our stack and we've been using Sentry for over a year. We'd love to start integrating Performance Monitoring, but it seems this SDK doesn't support it yet.

I'd like to kickstart the discussion on this and contribute some developer time for this. I've been reading the Guidelines for Tracing Support and peeking into the Python SDK for reference. I've started some preliminary work on adding the new types (Span and Transaction), but I'll need some guidance on integrating with Hub. Feel free to start reviewing at any time!

Swatinem

I really appreciate you taking first steps here! Tracing is on our radar, we just wanted to gather more experience with our other SDKs before integrating it into Rust together will all the learnings from the other SDKs. That being said, I haven’t really looked at the implementation from other SDKs, or how it would best fit into Rust.

Nevertheless, looking at your draft and the Guidelines, I think that the Transaction type should rather live in sentry-core, as it should be able to "submit itself" to the client/transport. So it needs to have at least an Arc<Client> on itself.

Instead of "extending" the existing Event type, it should rather use composition, as in:

struct Transaction {
event: Event,
span: Span,
client: Arc<Client>,
}

It can then flatten all the event/span properties when serializing.
It can also have a DerefMut impl for both Event and Span.

I would also give it its own EnvelopeItem type.

Roping in @HazAT and @rhcarvalho as they should have more experience with the tracing API.

sentry-types/src/protocol/v7.rs

aramperes · 2020-10-05T21:20:01Z

Glad to help! I understand that this may not have been on your team's roadmap in the immediate future. Nevertheless we'll try to contribute as much as possible.

Since EnvelopeItem lives in sentry-types, and Client lives in sentry-core, I don't think it's possible to include a client reference in Transaction.

This seems more like the sort of thing that could use a trait and Hub::current, e.g.

// In `sentry-types`

#[derive(Clone, Debug, Serialize)]
pub struct Transaction<'a> {
    event: Event<'a>,
    span: Span,
}

// In `sentry-core`

// Trait in case other types of envelopes can 'submit themselves'
pub trait Sendable { // [or another name]
	pub fn send();
}

impl<'a> Sendable for Transaction<'a> {
    fn send(&self) {
        Hub::current().send_transaction(&self)
    }
}

Evidently, apps would have to import the Sendable trait to be able to dispatch transactions like this.

I'm a bit flakey on the terminology (finishing, dispatching, sending, etc.) so feel free to correct me.

* Added EnvelopeItem entry for Transaction * Revert changes to Event type and add Transaction type as a combination of Span and Event

Swatinem

This looks much better now.

Ah yes, I moved all the Envelope-related things into sentry-types. We had a similar case with Session, and decided to split the POD-bits into the types crate, and a small wrapper into core. See #251
I think something similar can be done here as well.

This PR on its own looks very promising already, and I would be happy to land this incrementally, meaning landing the bits from types even if they are not yet fully usable.

aramperes · 2020-10-06T13:13:02Z

Sounds good!

How about I finish this PR by:

implementing TraceContext
documenting the new structs and fields
adding unit tests for serializing Transaction and Span to make sure everything is proper
reverting the changes in Scope to be done in a later PR

Swatinem · 2020-10-06T13:35:28Z

Yes that sounds very good

rhcarvalho · 2020-10-06T14:45:07Z

FWIW we've updated the developer docs to include the tracesSampler option that gives more control over the sampling decision. Not a must have for a first pass, but something to keep in mind.

https://develop.sentry.dev/sdk/unified-api/tracing#sdk-configuration

(might be easier to read the diff -- getsentry/develop#165)

aramperes · 2020-10-06T16:08:35Z

@rhcarvalho Thanks for the update! I went through it and it's much clearer than before.

Just to double-check that this is essentially what the JSON structure of the envelope looks like for a Transcation event:

// Envelope Headers
{"event_id": "...", "sent_at": "..."}

// Item Headers
{"type": "transaction", ...}

// Item payload 
{
    // Transaction struct
    "type": "transaction",
    "event_id": "...",
    "sdk": "...",
    "platform": "...",
    "transaction": "...",
    "tags": { ... },
    "timestamp": "...",
    "start_timestamp": "...",
    "spans": [
        { // Span struct
            "trace_id": "...",
            "span_id": "...",
            "parent_span_id": "...",
            "op": "...",
            "description": "...",
            "timestamp": "...",
            "start_timestamp": "...",
            "same_process_as_parent": false,
            "status": "...", // Optional
            "tags": { ... }, // Optional
            "data": { ... }, // Optional
        },
        // ...
    ],
    "contexts": {
        "trace": { // TraceContext struct
            "trace_id": "...",
            "span_id": "...",
            "parent_span_id": "...",
            "op": "...",
            "description": "...",
            "status": "..." // Optional
        }
    }
}

This makes the Transaction struct a bit more complex than inheriting Span + Event. It's essentially a Span, but some fields are at the top level, and others are in TraceContext. Others are omitted (data, same_process_as_parent). It also has a type field always set to "transaction", plus some Event fields (event_id, sdk, platform, transaction). The meaning of timestamp is also swapped from Event: it becomes the end timestamp, and start_timestamp replaces the purpose of the timestamp field.

Makes me think Transaction, Event, Span, and TraceContext should just be their own struct instead of merging types together.

Another approach, with inheriting Event (although initialising this type could be a bit awkward):

pub struct Transaction<'a> {
   #[serde(flatten)]
   event: Event<'a>,

   type: Option<String>, // Always `"transaction"` -- possibly an enum and move into `Event`
   spans: Vec<Span>,
   start_timestamp: DateTime<Utc>,
}

In that case, there's a complication because Event#timestamp is set when the event is created, whereas it should be undefined until the Transaction is completed. This would make timestamp an Option property. Could also have a field end_timestamp instead of start_timestamp, and use timestamp as start_timestamp, but serializing would have to handle this case.

Swatinem · 2020-10-07T07:16:35Z

It also has a type field always set to "transaction"

I’m actually not sure if that is supposed to live on the transaction type itself, or if its meant to be just the envelope header.

aramperes · 2020-10-07T14:27:51Z

I'll exclude it, but it does appear to be sent in other SDKs. It is used to identity the type of event in SDKs that don't have strict typing, e.g. in Python 'Event' is just a Dict, so this field is used to send the envelope differently. It wouldn't be necessary in Rust since Transaction has a different struct.

https://github.com/getsentry/sentry-python/blob/91c7a8fcb8e94b37e7dba74e66f7d0992f3cf145/sentry_sdk/client.py#L337-L352

It's possible that it gets ignored on the server-side however.

* Reverted change in Scope. * Added fields for Transaction, Span, and TraceContext with documentation

Swatinem

after talking to @rhcarvalho, it looks like we need a type on the transaction object after all (at least when its serialized to json) I will research this myself to know how this should look like

sentry-types/src/protocol/v7.rs

* Removed unnecessary Span into_owned() method. * Renamed Transaction 'transaction' to 'name'.

Swatinem

Sorry for the delay here. Please rebase this on master again, the clippy error should be gone then.

This looks good from my perspective, I will ask @rhcarvalho for another review however, as he should know all the data structures related to tracing a bit better than me.

Swatinem · 2020-10-17T10:43:58Z

Thanks a lot for doing this!

I’m wondering if we should maybe gate all of this behind an unstable feature flag, to indicate that we would want to iterate on these types a bit more before we consider them to be "stable" before we put them in a release.

* master: (59 commits) fix: Correctly apply environment from env (#293) fix: Make Rust 1.48 clippy happy (#294) docs: Document integrations and the Hub better (#291) ref: Remove deprecated error-chain and failure crates (#290) release: 0.21.0 meta: Update Changelog feat: End sessions with explicit status (#289) fix: Scope transaction name can be overriden in sentry-actix (#287) fix: sentry-actix should not capture client errors (#286) fix: Clean up sentry-actix toml (#285) ref: Remove empty integrations (#283) feat: Add support for actix-web 3 (#282) feat: Preliminary work to integrate Performance Monitoring (#276) ref: Introduce a SessionFlusher (#279) fix: Set a default environment based on debug_assertions (#280) ref: Rearchitect the log and slog Integrations (#268) ref: Deprecate public fields on Integrations (#267) ci: Make testfast actually fast (#273) fix: Update surf and unbreak CI (#274) ci: Use smarter cache action (#272) ...

Preliminary work to integrate Performance Monitoring in Rust SDK

ad62e49

aramperes force-pushed the wavy/sentry-tracing branch from be8043f to ad62e49 Compare October 3, 2020 00:45

Swatinem reviewed Oct 5, 2020

View reviewed changes

sentry-types/src/protocol/v7.rs Outdated Show resolved Hide resolved

sentry-types/src/protocol/v7.rs Show resolved Hide resolved

tracing: Changes based on review

5d5383e

* Added EnvelopeItem entry for Transaction * Revert changes to Event type and add Transaction type as a combination of Span and Event

Swatinem reviewed Oct 6, 2020

View reviewed changes

aramperes added 5 commits October 7, 2020 15:32

Changes based on discussion

8f2e69c

* Reverted change in Scope. * Added fields for Transaction, Span, and TraceContext with documentation

Lint fixes

2553eb0

Add some missing documentation in TraceContext

71d73a8

Add test for Envelope/Transaction serialization

b84875f

Lint fix

8ba686a

aramperes changed the title ~~Preliminary work to integrate Performance Monitoring in Rust SDK~~ Preliminary work to integrate Performance Monitoring Oct 7, 2020

aramperes marked this pull request as ready for review October 7, 2020 20:22

Swatinem reviewed Oct 8, 2020

View reviewed changes

sentry-types/src/protocol/v7.rs Outdated Show resolved Hide resolved

sentry-types/src/protocol/v7.rs Outdated Show resolved Hide resolved

Clean-up types.

5d55206

* Removed unnecessary Span into_owned() method. * Renamed Transaction 'transaction' to 'name'.

Swatinem approved these changes Oct 16, 2020

View reviewed changes

Swatinem requested a review from rhcarvalho October 16, 2020 11:11

aramperes and others added 2 commits October 16, 2020 08:57

Merge remote-tracking branch 'upstream/master' into wavy/sentry-tracing

57d32dc

Merge branch 'master' into wavy/sentry-tracing

6c891ff

Swatinem merged commit c8ef5fe into getsentry:master Oct 17, 2020

aramperes deleted the wavy/sentry-tracing branch October 17, 2020 17:38

Swatinem mentioned this pull request Oct 19, 2020

tracing integration #180

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preliminary work to integrate Performance Monitoring #276

Preliminary work to integrate Performance Monitoring #276

aramperes commented Oct 3, 2020 •

edited

Loading

Swatinem left a comment

aramperes commented Oct 5, 2020 •

edited

Loading

Swatinem left a comment

aramperes commented Oct 6, 2020 •

edited

Loading

Swatinem commented Oct 6, 2020

rhcarvalho commented Oct 6, 2020

aramperes commented Oct 6, 2020 •

edited

Loading

Swatinem commented Oct 7, 2020

aramperes commented Oct 7, 2020

Swatinem left a comment

Swatinem left a comment

Swatinem commented Oct 17, 2020

Preliminary work to integrate Performance Monitoring #276

Preliminary work to integrate Performance Monitoring #276

Conversation

aramperes commented Oct 3, 2020 • edited Loading

Swatinem left a comment

Choose a reason for hiding this comment

aramperes commented Oct 5, 2020 • edited Loading

Swatinem left a comment

Choose a reason for hiding this comment

aramperes commented Oct 6, 2020 • edited Loading

Swatinem commented Oct 6, 2020

rhcarvalho commented Oct 6, 2020

aramperes commented Oct 6, 2020 • edited Loading

Swatinem commented Oct 7, 2020

aramperes commented Oct 7, 2020

Swatinem left a comment

Choose a reason for hiding this comment

Swatinem left a comment

Choose a reason for hiding this comment

Swatinem commented Oct 17, 2020

aramperes commented Oct 3, 2020 •

edited

Loading

aramperes commented Oct 5, 2020 •

edited

Loading

aramperes commented Oct 6, 2020 •

edited

Loading

aramperes commented Oct 6, 2020 •

edited

Loading