Same thing. OpenTelemetry grew up from Traces, but Metrics and Logs are much bet...

PeterCorless · 2025-01-10T23:33:33 1736552013

Cramer wants to get traces out of OTel. Which is ironic because he's one of the creators of OpenTracing.

https://cra.mr/the-problem-with-otel/

deepsun · 2025-01-13T21:44:43 1736804683

He also started Sentry, so must know a thing or two on the topic.

incangold · 2025-01-10T17:54:47 1736531687

I think giving metrics and logging a location in a trace is really useful.

But I still dislike OTel every time I have to deal with it.

hinkley · 2025-01-10T18:01:26 1736532086

You can’t do fine grained tracing in OTEL because if you hit 500 spans in a single trace it starts dropping the trace. Basically a toy solution for brownfield work.

IneffablePigeon · 2025-01-10T19:33:16 1736537596

This is just not true. We have traces with hundreds of thousands of spans. Those are not very readable but that’s another problem.

PeterCorless · 2025-01-10T23:35:40 1736552140

How are you storing them, and what do you use to read/visualize/analyze them? I'd imagine just putting them up in a UI becomes a needle-in-a-haystack issue. Are you programmatically analyzing them?

IneffablePigeon · 2025-01-11T10:19:19 1736590759

Honeycomb. For shorter traces (most of them), a waterfall view is great. For those long ones, we try to split them up if it makes sense but you can also just run queries scoped to that trace to answer questions about it (how many of the spans are db queries, how many are this query, are they quick, etc etc)

pranay01 · 2025-01-10T18:27:51 1736533671

As mentioned by philip below, 500 spans is a very small amount. I have seen customers send 1000s of spans in a trace very easily

phillipcarter · 2025-01-10T18:10:29 1736532629

...huh? I work with customers who (through a mistake) have created literally multi-million span traces using OTel. Are you referring to a particular backend?

hinkley · 2025-01-10T18:13:31 1736532811

phillipcarter · 2025-01-10T18:25:22 1736533522

Well that's a shame, I'm going to ask some folks about that. 500 spans per trace is ridiculously small and I can't imagine any good reason to have that limitation since it's just not that big of a footprint.

OTel doesn't define any limits on the # of spans in a trace (nor the # of attributes on a span!) but it will be bound by the limits of whatever backend you use. In the case of the one I work for, we do limit the total size of a span to be 1MB or less with 64KB per attribute before truncation. Other backends have different limitations. This is the first I've heard of such a small limitation on the total number of spans in a trace though. Traces are just (basically) collections of structured logs with in-built correlation IDs. I can't imagine why you'd limit them like this.

hinkley · 2025-01-10T18:35:40 1736534140

That was two years ago (we tried spans before metrics), so it’s fuzzy. I believe the collector sidecar was fine with it but the backend was not, which complicated debugging. There’s not a clear feedback path in OpenTelemetry that we could find. I completely forgot to mention the tendency toward silent failures. That’s a cardinal sin for telemetry. I would take it out back and shoot it for that fact alone.

The other problem I noticed looking at the wire protocol was that the data for the parent trace doesn’t seem to get sent until the trace closes. That seems like a bookkeeping nightmare to me. There should be a start of trace packet and an update at the end. I shouldn’t have finished spans showing up before the parent trace has been registered. And that’s what it looked like in the dumps my OPs people sent me to debug.

mdaniel · 2025-01-11T03:31:45 1736566305

Practically a given outcome, then; we could knock their Managed Prometheus offering off the Internet on the regular. It was just laughable for a company that prides itself in one trillion IAM transactions to 429 some metric ingest