Web fingerprinting is worse than I thought

mihaic · on March 21, 2023

Fingerprinting is doing terrible things for big-tech data collection, and at the same time it's excruciatingly hard to protect against bots, spammers, fraudaters etc without it.

Few people seem to try to reconcile this, since neither side cares about the other.

I personally think that discussion about fingerprinting as raw tech, without mentioning the size of the company collecting the date or the purpose is meaningless, and only leads to a few tech savy users having less data collected on them.

Most people want to use Javascript, use the default setting and not be afraid of clicking on links. I can't really see a good solution without a coordination of regulation and tech standards, so I'm hopeful at least for decent solutions.

enjoytheview · on March 21, 2023

You don't need to precisely identify users across sessions without their consent to detect bots, advanced anti-bots make heavy use of biometrics to detect bots and don't rely too heavily on fingerprinting, mostly because they're easy to spoof in general, but generating human-like mouse data is a bigger challange.

jsmith45 · on March 21, 2023

Sure, but on the other hand, a lot of anti-fingerprinting efforts strive to reduce the info available including things like mouse movement data.

Mouse movement data is a fairly potent fingerprinting vector. Bucketing the average spouse speed and acceleration rates could provide provide useful information. This may imply specific OS speed settings, or physical mouse DPI. A machine learning system would likely be able to distinguish traditional mouse, vs trackpoint, vs touchpad, vs trackball. Etc.

Also it is not just bots that have non-human like mouse movement. Many assistive technologies would have no mouse movement, or would auto snap the mouse to relevant spot. That is actually a quite powerful for fingerprinting, since assistive technology users are a pretty small subset of internet users, so only a relatively small amount of additional data is needed to uniquely fingerprint that user/machine.

enjoytheview · on March 22, 2023

I wonder if that would be enough to precisely identify a single user between millions like regular fingerprinting can already do, but yeah it's still a big fingerprinting vector

faet · on March 21, 2023

https://github.com/Xetera/ghost-cursor

enjoytheview · on March 22, 2023

Bezier curves are easily detected by machine learning models as non-human, that software wont work on akamai or any decent anti-bot

paleotrope · on March 21, 2023

I wonder if you could use a chicken like in the old chicken tic-tac-toe machines to mimic real user behavior.

shanebellone · on March 21, 2023

Disabling JavaScript does not stop fingerprinting either. HTTP headers are sufficient to construct unique user identifiers. Passing that data via API to a FaaS provider would enable cross site tracking that's invisible to the visitor.

Edit: The required FaaS implementation is trivial too. I could launch an endpoint that performs exactly this function in 30-60 minutes.

fnordpiglet · on March 21, 2023

In fact, the disablement of JavaScript itself is a very identifying characteristic.

account42 · on March 24, 2023

Its one added bit versus countless bits that can no longer be probed. Yeah disabling JS alone is not enough but it is not useless either.

remram · on March 25, 2023

Not all bits have an equivalent distribution. If very few people have that bit set then it is very differentiating.

TheRealDunkirk · on March 21, 2023

> I can't really see a good solution without a coordination of regulation...

Totally agree that this is perfectly within the government's purview, and they should be doing something about it. But, as with anything else in the US, until a Fortune 100, some few 1%-ers, or the deep state MIC wants it, we're not going to be getting it.

jonhohle · on March 21, 2023

Until everyday people realize they’re being stalked, I don’t know what will change. I am seriously thinking about trying to go through the proposition process in my state to forbid selling of data (this should already run afoul of wiretapping laws, imho).

I thought having an ad campaign that targeted subgroups very specifically and boldly might be enough drum up public interest. Something like: “Hello $name from $city. How did $recent_embarrasing_purchase work out? I hope you enjoy your birthday in $birth_month.” And then a link to the proposed policy.

Unfortunately, marketers have neither scruples nor the ability to control themselves and have captured an asymmetric advantage. Technologists do what they do, preoccupied with whether or not they could, not stopping to think if they should. It seems like legislation may be the only remaining option.

cryptoboid · on March 21, 2023

Pretty much what Signal did a few years ago [1], but on a bigger scale. Sadly Facebook banned their Ads account and couldn't do it further, would be interesting if someone tries the same.

[1] https://signal.org/blog/the-instagram-ads-you-will-never-see...

troad · on March 21, 2023

People realise they're being stalked, they don't know what that means though.

Techie people are convinced non-techie people don't know they're being tracked. They do! Ask your smart non-techie friends what they think about online privacy. I guarantee you they'll say something like "yeah, I know it's probably tracking me, but whachya gonna do".

Thanks to this disconnect, we have so many privacy campaigns with a message like "Did you know you can be uniquely identified on the web?", but so few (none?) that actually proceed to explain why that's bad, and what someone could do with that information. That's the missing piece. Give average people an actual reason to dislike or fear tracking, not just the mere curio that it exists.

A4ET8a8uTh0 · on March 21, 2023

I will admit that it always made me confused as to why browser has access to detailed hardware information. I can understand OS. I can understand resolution. I can rationalize GPU. I don't understand though why it should be able to access .. well, everything about the machine.

edit: It is still impressive. Even with the firefox settings on, the website was able to identify me. I am not entirely certain how I want to approach this.

account42 · on March 24, 2023

> I can understand OS. I can understand resolution. I can rationalize GPU.

None of these should be available to websites by default. The first two come from simpler times when people were not as concerned with privacy implications. The third has been and continues to be pushed by advertising companies (Google, Apple, Microsoft).

A4ET8a8uTh0 · on March 21, 2023

edit/ update from original post cuz i cant edit anymore

So quick update since I am mildly obsessive.

I was sure it was either GPU, CPU or addons that were giving me away ( I do have a mildly unique setup ).

I ran few tests in VM and the moment I dropped GPU passthrough ( left CPU passthrough ), I was no longer ( based on that website anyway ) tracked across sessions.

In other words, cat and mouse game continues.

tenebrisalietum · on March 21, 2023

Because the browser has become a vendor neutral, architecture neutral app engine and people want to do things like play MIDI instruments, use serial ports, use proprietary USB check scanners for accounting/ERP apps that work on the web and don't need SCCM to manage, etc.

aprilnya · on March 21, 2023

yeah, but it should ask you if you want to allow the website to know this kind of stuff instead of just allowing it by default

account42 · on March 24, 2023

> people want to

Some people want to do those things and for very specific websites. Most people don't even know what MIDI or serial ports are.

whamlastxmas · on March 21, 2023

I would assume for more advanced browser features, like 4k video playing, that hardware information could tell the player whether your machine is capable of playing back 4k video without stuttering.

SV_BubbleTime · on March 21, 2023

I got me on iPhone through VPN change, clear cache, private window, and reboot.

I know what to think about this… I fucking hate it.

DavideNL · on March 29, 2023

So i just found that the "SnowHaze" browser prevents fingerprinting on fingerprint.com and https://browserleaks.com/canvas

• https://apps.apple.com/nl/app/snowhaze/id1121026941?l=en

• https://github.com/snowhaze/SnowHaze-iOS

DavideNL · on March 23, 2023

Note that there's also:

  Settings > Safari > Advanced > Experimental Features

where you can disable OpenGL and such (i haven't tested yet.)

A4ET8a8uTh0 · on March 21, 2023

That I can relate to, but the more immediate question is whether you are willing to adjust your habits to nullify its impact. Most people would not.

TheRealDunkirk · on March 21, 2023

> Until everyday people realize they’re being stalked, I don’t know what will change.

FTFY: People already know; nothing will change.

Many of the things that are happening (at least in the US) are deeply, deeply unpopular, but are not changing, and show no signs that they are even susceptible to change. Fortune-sized companies, the 1%, and the deep state are calling the shots, despite how much can be seen in real time, through things like Twitter and TikTok. I've actually had to pull back from Twitter because of all the things that are obviously beyond the pale, yet will never change. (Snowden, Assange, et. al.)

jonhohle · on March 22, 2023

That’s why I, unfortunately, think legislation is necessary. My state allows citizen proposals with 250k signatures to get on ballot and >50% support to become law that cannot be overturned by the legislature (that has its own issues, but in this case it would be binding).

shantnutiwari · on March 21, 2023

>> I thought having an ad campaign that targeted subgroups very specifically a

This has been tried by a guy who placed Facebook ads like these. FB blocked his account in a few hours.

So good in theory, wont work in practice

omginternets · on March 21, 2023

I would consider donating to such an effort. I'm sure there are others like me.

Firmwarrior · on March 21, 2023

Yeah man, I think that's the only way anything is going to change

People are such dumb fucking cattle that they'll lash out at you rather than the data brokers or the software vendors who ratted them out though

warner25 · on March 21, 2023

> they'll lash out at you

Not only that, but they might have a legal case against you. I've been slowly working through Seek and Hide: The Tangled History of the Right to Privacy, and my main takeaways have been:

(1) The constitutional right to free speech and a free press is not as broad as most people probably think.

(2) Truth is not necessarily an air-tight defense in a case of libel, as courts at various times and places have decided against publishers for true but embarrassing things intended to humiliate or harm.

Firmwarrior · on March 22, 2023

Maybe the trick would be to put it into a security envelope so you don't disclose anything.. although I personally love the idea of printing it on a postcard, since it's practically public record once a data broker gets his filthy paws on it anyway

hooverd · on March 21, 2023

Ad platforms aren't that stupid.

prettyStandard · on March 21, 2023

Do it.

hilbert42 · on March 21, 2023

Ha! I followed the instructions and went to fingerprint.com and it all 'crashed' because I had JavaScript turned off—that's my normal default setting.

I have five different browsers on my smartphone and three on the PC all sans JS and none of them are Chrome. Also, normal operation is to automatically delete all cookies at session's end.

My smartphone and PCs are de-googleized and firewalled and I never see ads in my browsers nor in apps. The apps are mainly from F-Droid and sans ads and the few Playstore ones I use are via Aurora Store and are firewalled from the internet when in use. Honestly, I cannot remember when I last saw an app display an ad, it has to be years back.

In the past I used to go to more extensive measures to stop the spying but I found it was unnecessary as the spy leakage was essentially negligible with much less stringent efforts.

It's pretty easy to render one's online personal data essentially wothlesss if one wants to. On the other hand if you insist on using JS, Gmail, Google search, Facebook etc. then you're fair game and you only have yourself to blame if your personal data is stolen.

psacawa · on March 21, 2023

Before you get all jubilant, note that they have fingerprinting techniques which don't use JS[0]. It was able to identity me. Contrary to popular opinion, disabling JS doesn't protect you from fingerprinting.

They describe their approach[1]. They use HTTP headers and conditional request triggered by CSS conditional media queries to gather data. Something like @media(...) {background: url(/tracking/$clientid)}. But in principle, they could also try and fingerprint the TCP/IP stack or the TLS implementation. I'm not sure it would get them more data than OS+Browser, though.

[0] https://noscriptfingerprint.com/

[1] https://fingerprint.com/blog/disabling-javascript-wont-stop-...

hilbert42 · on March 21, 2023

"Before you get all jubilant, note that they have fingerprinting techniques which don't use JS[0]. It was able to identity me."

I didn't detail every protection I've put in place or the post would have been too long. However, I'd suggest that spreading my browsing over at least eight browsers (and I actually use more than two machines and do so at different locations and with different ISPs) effectively reduces my profile across the net.

I also use randomized browser user agents and clean links, occasionally I'll even cut-and-paste links between multiple browsers in a single session. I often do this on HN not to hide from HN but for convenience when multitasking. (Having worked in surveillance professionally, this modus operandi just comes naturally, it's now second nature for me to work this way.)

Working with multiple browsers and multiple machines also solves the problem when on rare occasions I have to use JS. That said, I never watch YouTube with a JS-enabled browser, instead I'll use NewPipe or similar. There are other measures I could list but you get the idea. Oh, and I never use the internet on a smartphone with a SIM enabled, instead the SIM resides in a separate portable router and my 'real' phone is a dumb feature phone, it's only capable of making phone calls.

I really don't care if some stuff leaks but I've satisfied myself it's pretty trivial, as frankly, I've not had one indication over the past 20 or so years that I've been targeted as a result of fingerprinting. It's not necessary to make things completely watertight, I'm not trying to hide from the NSA or GCHQ, etc. (and it'd be unsuccessful and a complete waste of time to bother trying).

Moreover, even if something were to leak, I'm simply not a revenue-making target—that means I never respond to any targeted marketing because I simply never receive any.

worg · on March 21, 2023

FF mobile gives be different IDs each time I run a new private session on both the JS an non-js demos (I run w/o JS usually AND have enabled the resistFingerprint setting)

labcomputer · on March 21, 2023

It’s one thing to generate the same hash for the same client, but it would be interesting to know how often the hashes collide, too.

I also notice that the no-JS hash changes when I move the window to a different monitor.

smirgel · on March 21, 2023

To me this seems extremely elitist. Non-technical people deserve to have their personal data stolen because they don't know about javascript for example?

orbisvicis · on March 21, 2023

Technical defenses are never perfect. In a sense they provide security through obscurity, as evinced by the comments above regarding Stallman's use of wget. If everyone applied technical defenses equally then workarounds would quickly be found, and everyone would be equally vulnerable. So privacy is a scale, and being in the minority provides its own defense. If in aggregate each individual is equally valuable, then the value of breaching a minority's technical defenses is some inverse multiplier of the minority's size. Personally my threat model is to put in just enough work to never be the juiciest target.

RalfWausE · on March 21, 2023

I run a similar setup as the OP when browsing the modern web, but i think it is in a way our responsibility as professionals to help the less tech inclined to navigate the sea of monsters the modern web has become.

For example: I have set up the systems of family members for whom i am some sort of digital janitor with a nice collection of firefox plugins to get rid of the worst offenders.

dylan604 · on March 21, 2023

I get where you're coming from,but...

If you continue to willingly use socials like FB, TikTok, et al, your complaints about stolen personal data fall on deaf ears. Show me that you don't have those apps installed or do not visit their websites, then we can talk about being serious on deserving to not have data stolen.

hilbert42 · on March 21, 2023

"To me this seems extremely elitist."

Right, it probably is. But the issue of stolen personal data has been around for so long that nontechnical people have had years to develop political lobbying and to swing elections to put a stop to it.

The fact is that most people don't give a damn about such matters, if most did then the problems would be behind us by now.

Thus, unfortunately, with the internet it's every man and woman for him or herself. QED!

matheusmoreira · on March 21, 2023

Have you ever tried to talk to "non-technical" people about this subject? They treat you like you're one of those tinfoil hat crazies.

At this point I'm 100% OK with us being the only ones able to protect ourselves. We warned them and they didn't care. Allow them to remain uncaring. We don't have to help everyone. People must want to be helped.

hilbert42 · on March 23, 2023

"Allow them to remain uncaring. We don't have to help everyone. People must want to be helped."

When people don't understand the implications or full ramifications then governments and lawmakers have to step in as they have a duty of care to protect citizens. It's one of the principal reasons for having government.

There are any number of examples, regulating the use of poisons, putting protection fences around cliff top lookouts, specifying the breaking strain of elevator cables, aircraft compliance design, removing lead from petroleum, and so on.

Unfortunately, governments have failed to act despite many warnings about these privacy matters.

Incidentally, there's an uncanny parallel between this example of governments failing to act even when in presence of the facts and my last example. In 1923 when Thomas Midgley and cohorts—engine makers and petroleum companies—sought permission to put tetraethyllead in fuel governments already knew the dangers of lead poisoning. Not only did they ignore all scientific warnings about the dangers of using the additive but also they embraced Big Business and approved the move at the citizenry's great expense.

fnordpiglet · on March 21, 2023

What they want is things to be easy and require a low to non existent cognitive load. You start confusing them with details of what could happen etc and all the gyrations they have to do to avoid it, they tune out and look at you like a tinfoil hat crazy (are you sure they aren’t right?)

As the techno elite, it’s actually our job to create the underlying reality everyone else participates in when using technology. So, it is our responsibility to care, if you care. It’s not theirs - they’re just here for the party. But that doesn’t mean they’re sheep for slaughter, because there are plenty of folks ready to slaughter them for money.

It’s our ability to understand the issues and to actually improve them that uniquely makes it important for us to care. But we can’t expect people to turn off the cat video for long enough to listen to us nerd at them, and we really can’t expect them to do something complex to avoid something they don’t understand or care about. What our challenge is is - how do we improve internet technologies sufficiently that everyone enjoys what we know is important but we don’t require them to care? That’s how you build a better emergent reality.

I’m glad to have had a hand in the Netscape and Mozilla’s launch and have watched Firefox for years with pride. They are the closest to a mainstream any man product that even remotely cares. WebKit safari is a close second. I hope we all find ways to develop the tech platforms that protect as well.

matheusmoreira · on March 21, 2023

> are you sure they aren’t right?

Yes, I'm absolutely sure. Do I really need to justify myself here on HN of all places? On a thread about the fingerprinting implements of the surveillance capitalism industry?

> that doesn’t mean they’re sheep for slaughter

Welp. If they don't want to be slaughtered like sheep, they better start caring then. I'm done with that.

At this point what I really care about is strengthening my own privacy by having more users in the anonymity set. The more indistinguishable users there are, the more effectively we are protected. I figure that if they're apathetic enough to allow corporations to exploit them with absolute impunity, they're also apathetic enough to join the anonymity set. Browsers just need to make that choice for them. It needs to be the new default.

> we can’t expect people to turn off the cat video for long enough to listen to us

I can and I do. What we're saying about this matter is important. People should listen, join the discussion even. When we reach out to people about matters we consider important, we do it with the best of intentions. We expect they'll at least put some thought into it. If not that, we expect they'll at least treat us with some respect, not like some schizophrenic off his meds. Can't expect anyone to continue caring after multiple instances of that.

> What our challenge is is - how do we improve internet technologies sufficiently that everyone enjoys what we know is important but we don’t require them to care?

Someone's gonna need to have the balls to make the choice for them. I don't have the resources to just make a better browser though. I do what I can by installing uBlock Origin on every single browser I come across. Everyone loves it and tells me that the web "feels" much better, though they can't quite explain why.

yjftsjthsd-h · on March 21, 2023

> Non-technical people deserve to have their personal data stolen

Nobody said that. "My defenses work" != "my defenses should be necessary".

smirgel · on March 21, 2023

"On the other hand if you insist on using JS, Gmail, Google search, Facebook etc. then you're fair game and you only have yourself to blame if your personal data is stolen."

yjftsjthsd-h · on March 21, 2023

... okay, I reread it for a third time and you're right. Not sure how I managed to miss it the first two times I read the comment. Yeah, that's nonsense.

ncallaway · on March 21, 2023

Uh… they definitely said that. They specifically said that people were “fair game”.

favaq · on March 21, 2023

Yeah? If they don't know how to operate a computer then they shouldn't be operating one. The same I would feel if someone without a licence crashed their car.

dsQTbR7Y5mRHnZv · on March 21, 2023

Using the web while being unfamiliar with Javascript is not analogous to driving without a license. It's closer to driving without being a mechanic.

dylan604 · on March 21, 2023

But when my mechanic tells me that the grinding noise while braking means I need to have the brakes fixed doesn't excuse me from continuing to drive without fixing the brakes and it doesn't really magically get fixed by turning up the radio until the noise goes away. To further your comparison, devs would be the mechanics, and devs have been screaming that operating browsers without blockers is similar to not getting the squeaking noises fixed. Everyone just keeps turning up the volume until the underlying noise goes away.

comboy · on March 21, 2023

The more you customize the more unique your session becomes.

kuschku · on March 21, 2023

Not if you disable JS, cause the website then can't see any of these customizations.

zamnos · on March 21, 2023

Except that disabling JavaScript is an anomaly all on its own. The dozens of users running without JavaScript might not be individually fingerprint able but it's still a small enough cohort that I don't know how much I'd lean on that. Figure in the user agent string and it's probably unique enough a subgroup to sell ads to.

zagrebian · on March 21, 2023

> it's probably unique enough a subgroup to sell ads to

I have been browsing with JavaScript disabled by default for the past 6 months. Based on my experience, no-JavaScript ads are rarer than four-leaf clover.

Throwaway62884 · on March 21, 2023

More common than you think

https://amiunique.org/fpnojs

nerdponx · on March 21, 2023

Probably not a representative sample though.

bawolff · on March 21, 2023

Also that cuts down the group so much, i imagine other things that are usually too coarse grained to be useful suddenly become much more useful. E.g. geoip location or accept-language headers.

giancarlostoro · on March 21, 2023

> Figure in the user agent string and it's probably unique enough a subgroup to sell ads to.

But if you never see ads how do you sell ads to them and how do you meaningfully discover enough about the person to feed them valuable ads?

GoblinSlayer · on March 21, 2023

But ads don't work without javascript.

ch33zer · on March 21, 2023

Having JS off probably puts you in the < .1% of users bucket. Unless you additionally are: * Routinely moving between IPs * Modifying your headers to avoid giving away info (user agent, etc) * Defeating all the other non-JS things that fingerprinters probably look for

then you are not safe by just turning off JS.

account42 · on March 24, 2023

Being in a 0.1% bucket is only ~10 bits of information - much less than what can be gathered with JS on.

And of course its not enough, but the situation is even more hopeless with JS on.

reaperducer · on March 21, 2023

Not if you disable JS, cause the website then can't see any of these customizations.

That's adorable. I guess you're not old enough to remember when we used to track people with things like invisible pixels. Or todays equivalent: testing CSS parameters.

Neither require JavaScript, and there are a hundred other non-JavaScript methods.

kuschku · on March 22, 2023

With that method, you won't be able to distinguish between the many different devices using the same browser at same resolution behind one IP.

In the era of CGNAT that means you now only know which city I'm from and whether I use Chrome or Firefox. People mostly use browsers in maximized and resolutions are relatively standardized nowadays.

Compared to the data you get from canvas and webgl, that's much less unique.

psychlops · on March 21, 2023

Hah! I used to embed invisible pixels for our marketing department decades ago.

dylan604 · on March 21, 2023

This should automatically qualify one to lose their internet privileges. Not just the fact that you did it, but your cavalier attitude towards it with the lack of regret for having done it

psychlops · on March 21, 2023

I actually agree with you. However, I plead that it was novel at the time, used ineffectively by a tiny marketing department, and not anywhere near this spy level capacity achieved today.

withinboredom · on March 21, 2023

It’s fun to put Easter eggs for people like you. https://once.getswytch.com

chrismorgan · on March 21, 2023

I have no idea what you’re talking about. That URL only tries to load one piece of JavaScript, htmx, and all it does is unbreak the mobile navigation.

(Aside: this mobile navigation is, incidentally, the worst implementation I have ever encountered: instead of twiddling some classes or such, which would happen instantly, it makes an HTTP request that responds with the new navbar. For me, this means at least half a second’s latency on clicking the button, more if time has passed so that the HTTP connection is no longer open (1.5–2 seconds). It also fails the no-JS test, as the unintercepted form-submit just serves the page with the closed mobile navbar again, not switching out the navbar as I expected it might, and which would have been enough to avoid an unconditional “worst implementation” award. Sorry if you made this and it hurts your feelings, but… ugh, this is just a baffling misapplication of hx-post and naive Tailwind use, and just unconditionally a bad approach.)

Edit: better link which shows what I suppose you probably meant: https://once.getswytch.com/app

withinboredom · on March 21, 2023

Haha. Yeah, it is pretty terrible and I made it.

It’s mostly a tech demo, so the things it does are intentionally weird/strange.

Dah00n · on March 21, 2023

You are easily tracked without JS. It is much easier than tracking a default settings browser.

d-z-m · on March 21, 2023

> It is much easier than tracking a default settings browser.

Not true. Especially if you mean a default browser with Canvas/WebRTC APIs enabled.

It is much more difficult for fingerprinting companies to get a high entropy fingerprint from a no-JS user.

anthk · on March 21, 2023

Good luck with Lynx/Dillo with a fake UA.

chaosite · on March 21, 2023

It's important to know that the mentioned "resistFingerprinting" breaks a lot of the web.

Examples include the back button, uploading photos on some websites uploads random data instead of the photo, etc.

db48x · on March 21, 2023

If it breaks uploading a photo, it’s because the page unnecessarily copies the image into a <canvas> and then tries to upload the data from the <canvas> instead of the original image.

codetrotter · on March 21, 2023

> the page unnecessarily copies the image into a <canvas> and then tries to upload the data from the <canvas> instead of the original image.

Surely there could be valid reasons for doing so?

I imagine for example that:

1. It ensures the selected file is a valid image before uploading it

2. It strips meta data like GPS position from the image before uploading it

3. It could reduce the size of the image, by either scaling it down, or compressing it more, or both, before uploading it

magicalhippo · on March 21, 2023

These are valid use-cases I agree. However I don't see why <canvas> should be leaky to support those use-cases.

Browsers should ensure all <canvas> operations produce identical results across platforms and hardware, and anything in the spec that prevents this should be removed from the spec.

Now, I recognize some of that functionality is handy for certain apps. In that case do like Android and put it behind an opt-in API, so the user can deny.

Basically I think browsers need a "web app" mode and a "surf mode". Just using visiting my local news outlet shouldn't require all the fingerprinting stuff.

db48x · on March 21, 2023

The real snag comes from putting text into a canvas. Nobody can agree on what fonts they have installed, and of course there are all kinds of subtle variations from one version of the “same” font to the next, and then everyone has different ideas about hinting, kerning, stem widths, etc, etc, etc. You can fingerprint basically everyone just from that information alone.

giancarlostoro · on March 21, 2023

Every browser can agree on a specific font if they truly cared about end-user privacy.

magicalhippo · on March 21, 2023

Sure fonts and text is hard. But none of that is needed for basic surfing of the web.

db48x · on March 21, 2023

In that case you should use Firefox, and turn on “resistFingerprinting”. It’s not perfect, but it’s approaching real privacy.

medstrom · on March 21, 2023

Or the Arkenfox config (https://github.com/arkenfox/user.js), which enables resistFingerprinting among other essentials. In this kind of game, a community config is exactly what you want.

DoctorOW · on March 21, 2023

Either there is a complete and total consensus on every aspect of rendering or there are differences in how <canvas> is rendered.

magicalhippo · on March 21, 2023

Ciphers and hashes publish test data so you can ensure conformance. Don't see why, in principle, one couldn't do something similar with a stripped down <canvas>.

mhio · on March 21, 2023

> I think browsers need a "web app" mode and a "surf mode"

Agree. It will be hard to define a standard for "surf mode", but in addition to privacy benefits there would be security benefits for the browser container as well.

joveian · on March 21, 2023

I don't think it would be that hard, start with "no javascript". Add a better compataiblity method. Ideally add ways to get the browser to do common stuff like resize images, although even saving that for "app mode" would be a big improvement on the current situation. Making the standard is easy, it is getting anyone to follow it that is difficult. Sites could already work great without javascript if they wanted to but very few do.

mhio · on March 21, 2023

"No javascript" is a non starter in my opinion. That's a very simple on/off switch that is already available but has very little buy in. As you noted, "JS off" mode requires a shift in what HTML/CSS are capable of on their own.

> Making the standard is easy, it is getting anyone to follow it that is difficult

That's my point, those two parts aren't disconnected. The standard isn't useful (or a standard really) until people follow it, and in this case that's most of the internet connected world. Both people building for the a new default subset, and users accepting a default subset with opt in "web app" bells and whistles.

Without removing JS, in my head it's along the lines of starting with a freeze of a current ECMA version, define the API's that are stripped out, force low fidelity timers, remove JIT, limit some cross origin options. Stop adding shiny new feature's every 8 weeks. Keep it there for 3-4 years. Or maybe a similar concept with a WASM container when it gains some browser usefulness. Then there's the html and css subset too. So, defining that stock subset navigator at the right level is what I see as the "hard" part.

joveian · on March 22, 2023

There are some improvements that could be made to HTML/CSS but it is already possible to do a bunch of fancy stuff with no javascript. I don't think it is possible to avoid tracking while allowing javascript, unless only the most trivial javascript, and for that there is likely to already be HTML/CSS alternatives. The stuff you are talking about is already available if you dig into the settings, although of course picking and choosing your own collection of settings like I do is itself a unique identifier. But there need to be a bunch more restrictions to actually prevent fingerprinting.

I think the lack of buy in is because the people who would need to buy in are the ones pushing the tracking. Rather than a new standard something like a directory of sites that work well without javascript (and search engine just searching those sites) with enough people using it for it to be an advantage to be listed seems to me to be more likely to be effective.

josefx · on March 21, 2023

> Browsers should ensure all <canvas> operations produce identical results across platforms and hardware, and anything in the spec that prevents this should be removed from the spec.

You would basically have to kill all hardware accelerated features and run everything in an interpreter. Also make sure that turbo button is set to slow, to get consistent behavior across all CPUs.

The only real way to prevent finger printing is to lock these features away by default and force websites to beg for every single one of them, not a "accept all" screen, make the process so painful that 90% of users would rather avoid those abusive sites entirely, basically the same dark pattern shit every site pulled with the cookie and GDPR accept popups, just in reverse.

TylerE · on March 21, 2023

3 sounds incredibly undesirable to me, assuming we’re dealing with a jpeg. Go through 3 or 4 rounds of that and compression starts to get pretty visible.

kevincox · on March 21, 2023

Most websites will recompress user images. Although you probably don't want to do it client side.

The biggest reason is if course cost saving. Store and transfer smaller images. This could be done client side with a server side check on max size.

Another big reason is metadata stripping. Both to protect the user (can be done client side) and to avoid unintentional data channels being provided.

Another reason is to avoid triggering exploits. If a major browser has a JPEG rendering exploit Facebook doesn't want you to be able to pwn everyone who sees your post. By using a trusted encoded it is very likely that the produced image is more or less following the standards and not likely to trigger any exploits (as exploits usually require invalid files).

alexriddle · on March 21, 2023

I've had to implement this - we have a web app used by engineers in the field where signal is often not great. We got lots of complaints about image uploads as for a typical job there would be potentially 100+ images that needed to be uploaded (multiple assets with 2 before and 2 after photos per asset).

iPhone defaults to uploading a large image which can take ages to upload. We implemented a canvas based solution which sends a base64 string representing a compressed image and reduced the upload file size by about 90%. We don't need high quality original images in the backend.

I may have missed a trick, this has been in place for a few years now but at the time I couldn't find a better solution.

ambicapter · on March 21, 2023

I was under the impression that base64 encoding doesn't reduce file size of an image at all, rather it sometimes increases it. That wasn't the point of using base64 string, right?

bdhcuidbebe · on March 21, 2023

> a base64 string representing a compressed image

Parent explained that the base64 encoding held compressed data.

TylerE · on March 21, 2023

But why base64 and not just… send the bytes? 6 bits per character vs 8

donatj · on March 21, 2023

There are many perfectly valid reasons to do that. It’s a lot more scalable to resize images client side rather than server side and using a canvas is one of the simplest ways to achieve that.

chaosite · on March 21, 2023

It's necessary if you have filters in the upload flow, like Instagram does (which is why it breaks.)

Or it might not be strictly necessary, but Instagram does it anyway.

throwawayapples · on March 21, 2023

> unnecessarily

No, this is how most pre-upload image editors work. Why upload a 5MB avatar photo that's you're going to have the user crop and scale on the client-side to a few hundred KB first?

Using canvas for this is much more friendly to their bandwidth, no nefarious intent needed.

perihelions · on March 21, 2023

It also breaks page zoom. The user's preferred zoom level for a domain isn't preserved between new-tab page loads, but resets itself every time.

(I'm guessing it was too much implementation work to separate out this feature: to preserve normal, expected UI behavior client-side, while presenting a fake pagezoom value to scripts. That would degrade only a handful of (poorly-designed, script-layout) websites, rather than the whole accessible browser experience).

seqizz · on March 22, 2023

Yeah I enabled the option yesterday after learning, today I disabled it back since NOPE without site-specific zoom settings retained the web is too inconsistent for me.

vesinisa · on March 21, 2023

I just tried putting it on with the idea of trying it out for one workday to see if it breaks something. It immediataly broke favicons on my GitLab tabs (turning them into random vertical stripes of pixels), which is both odd and a pretty bad start.

I really like the idea behind this feature, but it seems the Web API might have become too complex to counteract bad actors like this. It's particularly scary that it can correlate your activity in private mode with your identity in normal mode.

d-z-m · on March 21, 2023

RFP randomizes Canvas data extraction by default, which might have something to do with it. Gitlab favicon seems normal to me when I navigate there(RFP on).

bawolff · on March 21, 2023

I've been using it for years. I've barely noticed.

account42 · on March 24, 2023

It also tells websites that you want a light color scheme (instead of not indicating any preference).

rolisz · on March 21, 2023

For the photo problem you can give explicit permission for the website to use Canvas and then reupload the photo. It's annoying, but oh well.

d-z-m · on March 21, 2023

This is FUD. As others have said, been using RFP for years and barely noticed.

chaosite · on March 21, 2023

I'm not saying don't use it, I also have it turned on. I'm saying that it has consequences, and you might not immediately realize it's related to RFP.

d-z-m · on March 21, 2023

There certainly are consequences. However, you said it "breaks a lot of the web" including "the back button".

Maybe this is the case with some very complicated SPA type sites, but personally, I've never seen this.

chaosite · on March 21, 2023

Yes, all the problems are on very complicated SPA type sites. You know, like Google Docs/Drive, YouTube, Facebook, Instagram.

redbell · on March 21, 2023

Another method for web fingerprinting is called GPU-Fingerprinting [0], codenamed 'DrawnApart', it relies on WebGL to count the number and speed of the execution units in the GPU, measure the time needed to complete vertex renders, handle stall functions, and more stuff..

_______________________

0. https://www.bleepingcomputer.com/news/security/researchers-u...

zamubafoo · on March 21, 2023

As the years pass, I keep thinking back and realize that Richard Stallman was right all along:

> For personal reasons, I do not browse the web from my computer. (I also have not net connection much of the time.) To look at page I send mail to a demon which runs wget and mails the page back to me. It is very efficient use of my time, but it is slow in real time.

Technotroll · on March 21, 2023

I think Stallman just shot himself in the foot by even revealing that much. Unless a lot of people do the same thing, it's very easy to conclude that it was Richard Stallman who sent that WGET request, granted a few variables. The difficult part is perhaps tracking it back to its actual source, but I don't think Stallman is that hard to find. All this is of course extremely chilling. I'm sure a profile could be built up around WGET requests, and then employing some "likelihood machine" on it, to make educated guesses as to how likely it is that the WGET request was actually from Richard Stallman. I think we've just stumbled upon a new and "fun" Where's Wally game here!

GuB-42 · on March 21, 2023

I actually did exactly that a while ago. Where I worked, we didn't have internet access but we had email access, so as a workaround, I made an email server on my home machine that fetched web pages for me. A coworker took it even further and made a proxy server that automated the process so you could actually browse the web, although very slowly. Just to say that Stallman is not the only one with this idea.

It was in the early 2000, and smartphones weren't a thing. It also was a time where companies were paranoid into letting employees access the internet, but at the same time had abysmal security. By that I mean viruses ran free on shared folders, undetected because their antivirus software was years outdated. Very different times...

bsenftner · on March 21, 2023

In the early 2000's I was working as an analyst for a VFX studio, and I had a meeting with the CFO first thing in the morning. At some point we needed to look at something on his computer, and he responds "we'll have to wait about 15 more minutes." "Why?" I ask, and he shows me every day when he turns his computer on the browser starts producing "pop under windows" at a rate of about 10 pr second, and that lasts for about 40 minutes. Shocked, I debate with him for a bit and he says they've tried every antivirus. I shake my head. When they stop popping windows he has a program that mass closes all of them and then he goes to work, using that same computer - and for studio finances. Blew my mind.

super256 · on March 21, 2023

In Germany there is a "WhatsApp" SIM [1], where you have to pay for normal internet use, but WhatsApp texts are free of charge.

With a technique which you described, you could probably abuse a phone with this SIM as a "free" hot spot with infinite data.

[1] https://www.whatsappsim.de/

Firmwarrior · on March 21, 2023

A lot of paid Wi-Fi hotspots allow DNS traffic through unmolested, that's a similar loophole

cph123 · on March 21, 2023

Maybe one could use something like this https://github.com/yarrick/iodine

867-5309 · on March 21, 2023

I'm pretty sure there was something, within the last year or so, on HN front page which used [exploit/protocol/hack] to browse wikipedia over [SMS/tweet/etc.], or similar

account42 · on March 24, 2023

> Guthabenaufladung mind. 5 € alle 6 Monate zur Verlängerung des Aktivitätszeitfensters.

So it costs at least 83 cents per month. Still might be worth it compared to the insane mobile data charges here if you can get a usable bandwidth. I suspect in practice they will just ban you if you abuse it like that.

chrismorgan · on March 21, 2023

I recall one time in 2015 or 2016 when I had only a very weak 2G signal, but wanted to check a couple of pages (at least one of which was several hundred kilobytes). Connections always timed out in browsers, but I got it working by SSHing into my VPS, downloading the page with curl, then copying that down with scp. My recollection is that the file size would increase by 32KB every 15–30 seconds. Fun times!

anthk · on March 21, 2023

Mosh works over 2.7 KBPS capped data plans. I connected to a tilde and just use lynx/links/edbrowse for light www/irc/jabber and gopher. It runs much faster than being connected natively to the inet. I can read everything and even answer in fora with Edbrowse.

cratermoon · on March 21, 2023

> It also was a time where companies were paranoid into letting employees access the internet, but at the same time had abysmal security

In the early 2000s I was working at an insurance company. They used some kind of blocker in their outgoing firewall that prevented access to certain sites. At one point the blocklist included sourceforge, which threw my team's work a wrench because at the time a lot of the packages we depended on were hosted there. It took a few days to get that removed from the blocklist.

This same insurance company shut down for multiple days when a virus, I think it was ILOVEYOU, infested their email system so bad that nobody could work, and everyone (except the poor IT folks) got a long weekend. And then a while later, it happened again, but with a different virus, possibly Nimda. The company was very bad about updating its systems, and even in 2003 most users were stuck on Win95.

WirelessGigabit · on March 21, 2023

Company I work for still blocks SourceForge because... something bad happened 15 years ago (?).

cratermoon · on March 23, 2023

Enterprises.

WirelessGigabit · on March 23, 2023

You gotta be a little more specific.

cptskippy · on March 21, 2023

> It also was a time where companies were paranoid into letting employees access the internet, but at the same time had abysmal security.

I recall we had a crappy firewall that would collapse under the load of NAT for the 100ish employees and so executives got static IPs mapped to their machines. The late 90s and 00s were crazy.

logifail · on March 21, 2023

> so executives got static IPs mapped to their machines. The late 90s and 00s were crazy

In my Uni days, all our department's machines had public IPs; no NAT, no firewall(!)

So much simpler to able to telnet, FTP and/or remote desktop straight from home to the office :)

eashman · on March 22, 2023

Same at my University in the mid-90s. I was the CS department network admin and we had an entire /24 to use as we liked.

At least it taught me how to detect attempted hacks early because every machine had to be monitored for attacks.

I just looked and they still have a /16 (65k public addresses). This is for a school that has maybe 15k students, not all of them living on the campus. And I’m sure most of the computing takes place in the cloud now anyway.

I know there are a lot of places who were on the Net early besides the military that have excess address capacity.

lamontcg · on March 21, 2023

I did the reverse and had an ssh connect to my home machine and ran an IPv4 tunnel back through it so that I could browse the entire corporate internet from my home network, creating a full VPN essentially. Make me about 10 times as productive as going through the dial-up we had to use while we were oncall.

nebula8804 · on March 21, 2023

Stallman shot himself in the foot by having a text only blog that was easily searchable when it came time for the wolves to cancel him. A crappy proprietary blog or thousands of hours of ranting via Youtube videos ironically would have slowed down the haters and maybe even cause them to miss things with which to cancel him with. Its hilarious in an ironic way. Bonus points if the cancelers were running GNU software. :D

user- · on March 21, 2023

You make it sound like he said something mildly insensentive. He was "cancelled" for making pro-cp comments, and for literal decades of being a creep. https://twitter.com/_sagesharp_/status/1173637138413318144

A4ET8a8uTh0 · on March 21, 2023

And how does that invalidate his technical expertise?

More importantly, was he charged and convicted with anything?

I am getting really tired of this public opinion tribunal, where mere accusation is enough to get a person out of their position. This is not how this is supposed to work at all.

someotherperson · on March 21, 2023

No you don't get it. He was a "creep" aka should be cancelled wholesale on everything /s

The only questionable thing he did was try to rationalise pedophilia, which he has since changed his mind about. Given that he's clearly _not all there in the head_ (i.e neurodiverse) and assuming he hasn't tried to access child pornography or similar I couldn't care less. All of the other accusations against him are nonsense[0] and all center around unsubstantiated rumors of him being a "creep." People being anti-Stallman is insane to me considering how much he has contributed and advocates for not only free software but also gender equality.

[0] https://stallmansupport.org/debunking-false-accusations-agai...

ThePowerOfFuet · on March 21, 2023

> Given that he's clearly _not all there in the head_ (i.e neurodiverse)

Neurodiverse people are "not all there in the head"?

gtfo.

someotherperson · on March 21, 2023

That's right, the guy who sits down, takes off his socks and eats his toenails/toe skin in the middle of a Q&A[0] shouldn't be held to some ambiguous standard of how to navigate society.

It's exactly this eggshell stepping that people are expected to adhere to that has people treating him the way they are.

https://www.youtube.com/watch?v=I25UeVXrEHQ&t=110s

DoItToMe81 · on March 22, 2023

Actually, he was "cancelled" by people lying about him supporting Epstein and saying child rape is good, neither of which were even close to being uttered.

The disingenuous nature of this all is why he's back in his foundations again.

BeefWellington · on March 21, 2023

WGET can be pretty trivially told to send custom headers.

no_time · on March 21, 2023

Try to do that to a site with CF bot protection cranked up... Not happening without a custom build/custom ssl proxy that mimics the SSL fingerprint of Chrome.

harry8 · on March 21, 2023

CF blocks you hard simply for enabling "do not track" in settings. The discussion about how awful they are needs to be had.

mananaysiempre · on March 21, 2023

I haven’t seen a custom build of Wget, but for Curl there is curl-impersonate[1].

[1] https://github.com/lwthiker/curl-impersonate

bigfudge · on March 21, 2023

It would be a lot of work to make it mimic a common profile though.

justinclift · on March 21, 2023

That work was probably done once, years ago. Might need a few string tweaks every few years, which could be automated.

bigfudge · on March 21, 2023

No, because any “RMS” set of headers would only be shared by the small number of nerds who care, fingerprinting us more accurately again.

steve1977 · on March 21, 2023

Just setup a honey pot and use headers from there ;)

croes · on March 21, 2023

Maybe he's setting a false trail and using curl

sltkr · on March 21, 2023

Maybe the script does:

    wget --user-agent="Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)" ...

Nobody will suspect a thing!

boopmaster · on March 21, 2023

Are we losing the point, here? “Does not browse” When interacting with webmail- also does not browse directly, preferring CLI scripts to act as intermediary. Does wget execute .js or .css or execute anything it reads beyond a URL redirect? Is wget a huge attack surface like a browser?

TylerE · on March 21, 2023

The GNU/Hurd in the IA didn’t already do it?

gabipurcaru · on March 21, 2023

I'm pretty sure wget has plenty more users in addition to Stallman

Technotroll · on March 21, 2023

I'm sure too! But there are some very important differences with Stallman's use and "my" use. Personally I use WGET all the time to get specific stuff, mainly downloads of binaries say for some UX system I'm setting up. I'm fairly certain that this is the most common use of WGET, so all that can easily be filtered out. This leaves Stallman's use case, and a few other secretive users, whom I'm sure can also be divided into separate categories, that can then be used to further identify each user's uniqueness. I'm not saying that it's easy, but I'm saying that he's got a higher chance of getting "caught" simply by revealing his rather unique use case.

ehnto · on March 21, 2023

Not to mention it's programmatic use by tools and applications.

m4jor · on March 21, 2023

There are so many bots sending wgets I dont think its a real issue.

anthk · on March 21, 2023

Wget can mask the User Agent and lots of variables.

szundi · on March 21, 2023

Hard to watch netflix or YouTube whis way. Considering I have just learned electronics design from YouTube, this is inconvenient.

cobbzilla · on March 21, 2023

For Youtube, try invidious or yt-dlp

account42 · on March 24, 2023

Also mpv calls yt-dlp automatically. Add a browser extension to launch mpv for links/the current page and the experience is so much better than in-browser video: native controls, video window can be placed anywhere, full power of ffmpeg.

I am continously baffled by how most people just accept media companies controlling the video player you are allowed to use and thereby the UX. Don't let them.

anthk · on March 21, 2023

I use DilloNG, an Invidious instance and mpv being called from the page with a right button click.

arjvik · on March 21, 2023

mpv is perfect for this

jonhohle · on March 21, 2023

A more modern equivalent might be using something like archivebox instead of browsing directly.

NoZebra120vClip · on March 21, 2023

& . . . . @

"Mail for you, sir!"

izacus · on March 21, 2023

Why is this being fought with technical measures (which are ineffective and cripple the web as a platform) instead of legal consumer law where you can easily fine and punish companies that do the fingerprinting?

EDIT: Note that you can do BOTH - but one without the other is just a game of whack-a-mole.

mcherm · on March 21, 2023

Because some browser-makers (Firefox at least) believe that the identity of those browsing the web should be protected. Legislators do not believe that. (At least, a majority of legislators do not.)

giancarlostoro · on March 21, 2023

What kills me is the cookie consent stuff, they should of enforced that Do Not Track is honored, and have fees that make sites ensure compliance or be sued over not honoring DNT which iirc was sent as a HTTP header, it would of actually been a meaningful solve.

dismalpedigree · on March 21, 2023

What a legislator believes is irrelevant. Only what the lobbyist is paid to believe is relevant.

dgroshev · on March 21, 2023

Would you consider the entire European Union a minority of the legislators? Because that's what GDPR is designed to do, make identifying customers well controlled and expensive whatever the method.

Granted, the enforcement should be stepped up.

mcherm · on March 23, 2023

> Would you consider the entire European Union a minority of the legislators?

No, I was referring only to my own legislators (in the US, and not specifically California). Many other places in the world are doing better.

giancarlostoro · on March 21, 2023

They should enforce that Do Not Track is honored. Its the easiest way, and websites dont need silly cookie consent dialogs if set.

dgroshev · on March 21, 2023

DNT is ~useless because it's opt-out, whereas "auxiliary", non-essential tracking is opt-in under GDPR.

Websites don't need cookie consent dialogs if they only use cookies to do things that don't need to be consented to, like providing the service they are offering. Look at Apple's website, they don't have any.

mnw21cam · on March 21, 2023

DNT may be opt-out. But it should certainly be treated as "Don't even bother asking for consent to track, because I already told you the answer is no, and you'll be harrassing me by asking."

giancarlostoro · on March 21, 2023

My argument is current laws did nothing to give teeth to DNT. I'm not worried about what the technological defaults are, but I would argue that without DNT being legitimized, it was dead on arrival. We have had it in browsers for ages, and we've dropped the ball on enforcing it for ages.

My other argument is, if you detect DNT, the cookie consent dialog shouldn't be shown at all.

TheCoelacanth · on March 21, 2023

The EU has about 1/18th of the world's population, so certainly that would be a rather small minority.

Arnt · on March 21, 2023

A law needs a justification and needs to apply equally to everyone. Writing that about fingerprinting would not be trivial. Some site operators can make a believable argument that they use it in ways that are good for society.

hutzlibu · on March 21, 2023

"Some site operators can make a believable argument that they use it in ways that are good for society."

Example please

Arnt · on March 21, 2023

My bank phoned me last summer. I'd authenticated with my usual two factors but a new browser fingerprint, then transferred a large sum to a new recipient. The bank blocked the transfers I did thay day, then phoned me to check whether I'd been phished, suffered a keylogger attack or something.

account42 · on March 24, 2023

So you were inconvenienced as a result of a false positive derived from tracking. Hardly a great argument.

Arnt · on March 27, 2023

You can make anything seem poor if you mention the negative effect and not the positive one.

Wrhector · on March 21, 2023

Credit Card Fraud, Spam, etc

bigfudge · on March 21, 2023

Even if this were the case - which I don’t actually believe, but… - it would be straightforward for that law to also constrain these purposes and prevent data sharing with non-worthy operations. At present it’s basically a free for all.

dgroshev · on March 21, 2023

That is literally what GDPR is. Somehow it got reduced to cookie banners in HN psyche, but the whole idea of GDPR is to make sure that the data can be collected and used for well defined purposes that are either necessary to provide a service (preventing CC fraud would qualify), or are explicitly consented to.

bigfudge · on March 23, 2023

The problem with the consent part is that you basically can’t take part in the modern world if you don’t. The theoretical possibility of opting out is undermined bu deliberately bad ux

dgroshev · on March 23, 2023

Not really, there are plenty of plugins that dismiss the popups automatically without consenting to marketing. But generally we should press our legislators to require a universal interface that then can be automated, not try to win a cat-and-mouse game against multibillion conglomerates whose income depends on winning it.

Arnt · on March 21, 2023

I think the misunderstandings about the GDPR (even many smart people don't get it) prove that designing and writing such a law is difficult and the result has to be complex.

IMO the GDPR is good. But… it is poorly understood by many affected people . IMO if a law is poorly understood by the people it affects, then one should assume the law to be at fault, not the people. IMO it's good but I'm not happy.

4×IMO! Wow.

account42 · on March 24, 2023

> IMO the GDPR is good. But… it is poorly understood by many affected people . IMO if a law is poorly understood by the people it affects, then one should assume the law to be at fault, not the people.

You are assuming that it has to be either of them who is at fault. In reality there are third-parties who have been spewing FUD in order to confuse people about the law.

Arnt · on March 30, 2023

I do indeed assume good intentions. Doing that is one of the principles by which I live.

_siis · on March 21, 2023

The short answer which should be obvious... regulatory doesn't work, legal doesn't currently work.

The burden of proof is on the claimant, and with proper information control you can't ever meet that burden of proof. It becomes an ant versus a gorilla instead of David vs. Goliath.

Tell me, how do you differentiate a simple random alpha-numeric string from another random string that may have been generated as a fingerprint.

Mathematically do you think there's any way to actually prove one way or the other? If not, how would that bias the system if the person is adversarial and lies.

The only way to prevent this is to make sure the information is nonsensical.

Preventing collection would identify you in a way that they can prevent access. Even though websites are public, you see this happening with any captcha service.

izacus · on March 21, 2023

Can you provide any proof that "regulatory doesn't work"?

Might be my European outlook, but consumer law has been stupidly effective at curbing abuses from companies here and was much more effective than playing the technology race USA is trying to fight. There's always a next side-step, the next abuse a company can invent - and you keep trying to push the responsibility of avoiding it to users (by adding more and more onerous technology) instead of punishing the abusers.

_siis · on March 21, 2023

You don't need proof you just need some sound reasoning about the trends. If it were as effective as you claim, progression in this area would have halted full stop.

Ask yourself how long have those consumer laws been in effect. Has this technology problem progressed during that time (increased or decreased). Have the fines against the large tech companies actually been collected and were they sufficient to curb that behavior or are they still being administrated or adjudicated (decades later)? Have the large tech companies provided all of the information they collect for review (including the intermediates they generate from processing for derivation internally, in a way that discloses all the ways they use it), or did they only provide a plausible alternative, or just the base information collected without explanation. Do you have a way to prove its the former and not the latter?

I'm sure consumer law has been effective at eliminating the provable abuses domestically. If they were effective internationally, why would the problem be progressing to ever more complicated ways of ubiquitous tracking (which are against that law), or even domestically for those multinationals.

Its business as usual and these people know centralized power structures suffer structurally from corruption and malign influence, and as a market force they exploit that.

There's enough money in people's futures that no fine will actually solve the issue because fraud gets baked into the process. Privacy, communication, and agency are what largely compose people's future.

Due process from corporate sovereignty guarantees they can draw it out as long as they need to while continuing to make money off their actions, both increasing costs to regulatory (as a resource drain), and increasing revenue.

The real cost is borne on either the individual or on the public, and corporations have incentive to lie in ways that are difficult or impossible to prove. A lie of omission, is a lie.

In my opinion, for certain critical societal protections, its necessary to have a guilty by default, for 'people' whose only possible motive is profit incentive. The corporations or the firm are considered people in most locales, but they only adjust behavior based on profit or future profit (through monopoly).

Placing the burden of proof on the company to prove they are complying, instead of compliant with good faith protections by default, would eliminate most benefits they might receive from deceit, or lying through omission.

izacus · on March 21, 2023

Just so we're clear - the consumer law has mostly not been adjusted to cover data mining yet and you seem to be building your argument on the assumption that it has.

Am I correct?

_siis · on March 21, 2023

As far as I was aware, it had. Everything I've seen in the last 5 years points to that. Is that not the case?

Granted, I didn't go directly to the regulatory site because who can sit down and analyze multiple legalese documents that have thousands of pages with crossreferencing requirements.

dgroshev · on March 21, 2023

Here's a bunch of consumer laws that work:

- living in the UK, I barely ever receive spam calls or messages. I can be reasonably sure that companies don't sell my contacts to third parties, I can withdraw my consent to marketing communications and spam will stop, I did it multiple times. My American friends seem to have way more problems with that, to the extent of buying burner phones to buy insurance. Considering that the tech is exactly the same across the pond, the difference is entirely in the legislation and consumer protection.

- cars became much cleaner and more efficient over the last three decades thanks to the ever ratcheting Euro standards. I only need an old car passing by to be reminded of that, you can just smell the difference.

- my broadband connection has a minimum average speed guaranteed by law, which protects me from the line being oversubscribed. This actually works, and a friend of mine got a sizeable compensation for a period when they didn't get the full speed.

So consumer laws work, and saying that enforcement can't be done is a bit of a post-hoc rationalisation. It is true that GDPR can and should be enforced harsher, but it's just one example in a long and successful history of consumer protections.

_siis · on March 21, 2023

I'll keep in mind points 1 and 3.

As for cars, how do we know that's true. There was Dieselgate, but from what I've heard they only got them because of whistleblowers.

Many VOCs which these laws are designed to reduce are odorless. The ones are visible are larger particle size and generally less of an issue from an environmental perspective from most accounts.

dgroshev · on March 21, 2023

You can literally smell it in the air, older cars don't have cats to burn everything uncombusted down to CO2+H2O. You can smell it with a modern car for the first few minutes while cat is heating up. You can see it in car shapes, there's a reason why every modern car looks the same — aerodynamics and pedestrian safety make car shapes converge. You can see it in ubiquitous cans of AdBlue on petrol stations, which was not a thing just two decades ago (and still aren't in many developing countries).

Finally, you can see it numbers: https://www.asm-autos.co.uk/workspace/images/yearly-co2-emis...

There is no fundamental reason why all those changes had to happen, it wasn't the market driving them. It was the regulation.

323 · on March 21, 2023

> Might be my European outlook

How did the EU cookie laws and GDPR solved this problem? It's as widespread as before, except that now you are annoyed by prompts too.

lwhi · on March 21, 2023

I think it absolutely does work.

We need better regulation to temper capitalism.

_siis · on March 21, 2023

That's very naive, and you need to educate yourself about what capitalism actually is because it certainly isn't what you are saying.

You've misused that term.