Windows feature that resets system clocks based on random data is wreaking havoc

soyagi@yiffit.net · 1 year ago

Windows feature that resets system clocks based on random data is wreaking havoc

nickwitha_k (he/him)@lemmy.sdf.org · edit-2 1 year ago

That’s a hideous, over-engineered attempt at solving something that has already been solved better in open standards and FOSS. I’d argue that bare NTP is more secure on merit of not trusting random TLS certs to be accurate.

EDIT: Found my comment to be too negative. I will stand by the fact that it comes across as hackish but can understand the logic in how it is supposed to work in theory, so, not stupid, just fundamentally insecure and terrible by merit of its design not paying enough attention to context.

Loulou@lemmy.mindoki.com · 1 year ago

Or as they say, every probabilistic curve ends somewhere.

If it works 999.999 time out of a million, then every millionth windows will break.

What an awful way to try to figure out the time. I mean it could at least pop a big error if, lol, the time seems off by a week!

TonyTonyChopper@mander.xyz · 1 year ago

no no, clearly we’re in 2159

Loulou@lemmy.mindoki.com · 1 year ago

Tss +/- a hundred years or two? The planet is over 4 Billion years!

Droechai@lemm.ee · 1 year ago

In that context it’s basically accurate and the errors rounding level. Good way to stay positive!

Loulou@lemmy.mindoki.com · 1 year ago

You are right; life is short, no time for un-needed pessimism!

Cheers to you!

towerful · 1 year ago

NTP is touched on in the article, and a quick Google shows that the largest difference NTP can correct before exiting in a panic is 1000s.
However there is an argument/flag to run ntpd once in a “just fix it” mode. So, having to use cert timestamps to “rough” the clock and allow NTP to “fine” it isn’t necessary.

It does seem strange to essentially create an out-of-band/off-label/out-of-scope time management system, when there are already open standards that work well for it.

nickwitha_k (he/him)@lemmy.sdf.org · 1 year ago

Agreed. I think that the problem that I have with this is similar to the problem of orgs in the US using an SSN as a form of universal ID. The Social Security Administration clearly states that that is not the purpose and will not provide verification because of this. X.509 certs are not meant for this purpose and their implementation does not take this use case into consideration as would be required for its use in a secure manner.

Max-P@lemmy.max-p.me · 1 year ago

If they’re going to use heuristics like that to get an approximate time, they could at least use it to validate connections to NTP servers or something that can actually sync the time properly. Get approximate time for initial sync, then contact a Microsoft server to get a more accurate time over HTTPS (which is what this supposedly meant to address), then use NTP to get accurate time and validate that it’s close enough, and only then when everything checks out, set the system clock to that time.

Richard@lemmy.world · 1 year ago

Stupid of them to use a Windows server in the first place…

Excel@lemmy.megumin.org · edit-2 1 year ago

Sounds like the heuristic is taking multiple samples only uses them if they are within some consistency threshold, to hedge against the cases where the field has random data.

The reason it only fails rarely and randomly is because it only happens when multiple actually random timestamps happen to line up around the same time.

Sort of like how several applications (cough git cough) have failure modes when two different files happen to have the same hash.

Turns out developers are bad at statistics and probabilities and don’t understand the birthday paradox.

towerful · edit-2 1 year ago

Hmm, the birthday problem alludes to what’s going on, except the birthday problem discards the year and the time.
If it’s 2x 32bit random timestamps that have to align within a 10 minute window (600 seconds) it’s a probability of 600 in 4.3 billion (uint32 max).
Still vanishingly small.
However, if a server makes 10 requests as part of STS, and you have 5000 servers, then there is a significantly higher chance of having to deal with the fallout.

That is, of course, assuming all server clocks slip enough to trigger this, and that all STS timestamps are random 32bit.
And there might be something in the way that 32bit timestamp is randomised. As it’s part of a cryptography system, it would make sense to be cryptographically secure. But seeing as it’s not directly part of the cryptographic process, it could be a cheaper/faster RNG.

candybrie@lemmy.world · 1 year ago

The server clocks don’t actually have to slip at all to trigger this. They just have to not match up with whatever the STS comes up with as the time.

AutoTL;DR@lemmings.world · 1 year ago

This is the best summary I could come up with:

A few months ago, an engineer in a data center in Norway encountered some perplexing errors that caused a Windows server to suddenly reset its system clock to 55 days in the future.

The engineer relied on the server to maintain a routing table that tracked cell phone numbers in real time as they were being moved from one carrier to the other.

“With these updated routing tables, a lot of people were unable to make calls, as we didn’t have a correct state!” the engineer, who asked to be identified only by his first name, Simen, wrote in an email.

Simen had experienced a similar error last August when a machine running Windows Server 2019 reset its clock to January 2023 and then changed it back a short time later.

The mechanism, Microsoft engineers wrote, “helped us to break the cyclical dependency between client system time and security keys, including SSL certificates.”

Simen and Ken, who both asked to be identified only by their first names because they weren’t authorized by their employers to speak on the record, soon found that engineers and administrators had been reporting the same time resets since 2016.

I’m a bot and I’m open source!

XEAL@lemm.ee · 1 year ago

feature in 2016 as a way to ensure that system clocks were accurate

Oh, boy…