UUIDv7 is a nice idea, and should *probably* be what people use by default inste...

kijin · on Oct 2, 2023

100 years sounds short-sighted for something that's supposed to be "universally" unique. We're already having problems with the 32-bit Unix timestamp not being large enough. If you're willing to use 160-bit (or longer) identifiers, you might as well give a few more bits to the timestamp. Round it up to an even number of base-62 characters, too. That part of KSUID has always struck me as a weird decision.

I wish UUIDv7 pulled the version/variant bits up front, though, just to make sure that the identifiers don't all start with null bytes.

wolletd · on Oct 2, 2023

Apparently, humanity is damned to repeat it's mistakes over and over again.

"100 years should be enough" is what led us to a mountain of Y2K issues, because when would a two digit year ever be ambigious?

But I guess it's a psychological issue. Unless you're a megalomaniac, it's just natural to assume that your decisions won't matter much outside of your life and lifetime. And in that case, 100 years totally is enough because I probably won't live that long. And even more, in a lot of cases, it's also the correct assumption and the project won't live longer than a few years.

So, thinking about it, unless you are developing a novel standard or something that you want the world to adopt, 100 years probably IS fine. Unfortunately, KSUID wants to be a novel standard, so there's an issue.

8organicbits · on Oct 2, 2023

The timestamp is first.

https://www.ietf.org/archive/id/draft-peabody-dispatch-new-u...

seabass · on Oct 2, 2023

If the version bits were up front, then switching to a hypothetical UUIDv8 in several years would be guaranteed to break the sortability. So I see that decision as a bit of future proofing.

kiitos · on Oct 2, 2023

Second precision is too coarse for many (most?) use cases.

travisjungroth · on Oct 2, 2023

How so? It seems like the only real use case for these timestamps is to get data from around the same time together. A second is fine for that. It's not about concurrency or avoiding collisions. A second can't handle that, but neither can a millisecond.

kiitos · on Oct 2, 2023

> It seems like the only real use case for these timestamps is to get data from around the same time together.

Yep.

> A second is fine for that.

Not when you're doing O(1k-1M) operations per second, it isn't!

travisjungroth · on Oct 3, 2023

I’d think that the locality would only matter at the scale of your query. I’m sure someone has queries with a window less than a second and so much traffic, but it seems niche enough to not optimize the standard for it.

I could definitely be off. I work at a company that gets those levels of traffic but don’t deal with it directly.

kiitos · on Oct 3, 2023

For me the whole value prop for ULIDs is that they can be generated by any node in a distributed system without coordination, while roughly preserving time order. "Roughly" meaning: all IDs will be globally ordered at millisecond precision, subject to the accuracy of each node's system clock; and IDs from a specific node will be locally ordered, subject to the details of the monotonicity part of the ID generator. This is important for me, because most of the things I attach IDs to will happen many many many times per second.

contravariant · on Oct 2, 2023

If you need more than second precision then millisecond doesn't get you much further. The fact that the epoch ends in 120 years is a bit more worrying, but is also just about non-critical enough that it will be ignored for at least the next century.

Also, to all future historians of 2150, sorry about the mess, but yes we knew this was going to happen. Whatever it was.

kiitos · on Oct 2, 2023

> If you need more than second precision then millisecond doesn't get you much further.

It gets you precisely 100x further.

HALtheWise · on Oct 2, 2023

*1000x

kiitos · on Oct 2, 2023

Oops, yes, this one.