I wonder when the environmentalists will realize what is going on right now, and start to protest AI usage in general. I was already wondering the same when the open source community started to CI every damn PR and commit, but I guess I was too optimistic with that one.
I don't want to sound skeptical, but this is what Crypto people used to say when it was very new.
It was supposedly worth all the power expenditure, because changing the world needed energy. Now we see where we are.
I'm inside this "newfangled AI thing". There are groups which create value, but they create value for everybody. The humans and the nature in general, and they use AI for scientific ends. Medical image processing, ecosystem monitoring, etc. etc.
Letting bots loose on the internet, letting them consume what they say and making them answer "Sauce is a food taste enhancer, and dressing is used to keep wounds clean while allowing them to heal. A standard serving of a dressing is two spoons".
To be fair, newer research is demonstrating that smaller more power efficient models with the same performance are possible, so the hope is that these giant LLMs are just a stepping stone to a less energy hungry place. In contrast, proof of work fundamentally needs more energy then bigger the network gets. It's no guarantee but we can at least see some hope that as energy impact drops and increasing value is found that 'AI' will cross the threshold of being worth the energy.
Edit: although yes I do agree that the 'value' part is tricky. If internet spam can generate more 'value' for some people than doing science, then when intelligence is cheap we are in for a rough time.
To be clear, I'm not against AI or LLM as a technology in general. What I'm against is the unethical way how these LLMs trained and how people are dismissive of the damage they're doing and saying "we're doing something amazing, we need no permission".
Also, I'm very aware that there are many smaller models in production which can run real-time with negligible power and memory requirements (i.e. see human/animal detection models in mirrorless cameras, esp. Sony and Fuji).
However, to be honest I didn't see the same research on LLMs yet. Can you share if you have any, because I'd be glad to read them.
Lastly, I'm aware that AI is not something only covers object detection, NLP, etc. You can create very useful and light AI systems for many problems, but how LLMs pumped with that unstopping hype machine bothers me a lot.
I disagree. Crypto people kept suggesting that crypto was a solution to an X problem while ignoring that a database was a better solution the problem.
I’ve yet to hear any good use cases for crypto, and I’ve been asking for years on here. Meanwhile there are a bunch of AI tools out there that are working and helping.
AI is a gigantic landscape with tons of different applications to different problems, and there are many solutions which work for a given problem.
However, if we narrow what AI is to LLMs, we have a stochastic parrot which needs to be fed the world literally to enable it to create semi-coherent sentences about something being asked. More importantly, what that parrot says doesn't have to be true, it can't be guaranteed to be true, and can't be verified about its accuracy about its slop.
And you spend gigawatts of power just to train this thing which selects and prints words based on probability and some randomness.
I think it's premature to be integrating LLMs into operating systems. That said I think they're very valuable, and the training is fundamental research. I feel like complaining about the resources used to train new models is a bit like complaining about the resources used to build experimental fusion reactors or particle accelerators. The fact that we're seeing direct applications is a bonus, but it's still more like fundamental research than anything.
People are spending all that money training because they are trying to fix the problems you're complaining about, and this includes fixing the power consumption problem. If we can create 3B parameter models that have capabilities on par with today's 405B parameter models, that's worth spending a lot of energy training. But nobody knows what is possible, so they have to try. I feel like you're basically arguing nobody should try because you don't believe they will ever improve, but that seems contradicted by the general trajectory of how things have been working the past decade. More resources spent on training means more efficient and useful models.
You've moved the goalposts from AI to LLM's. Fair enough, we've been doing AI since the 50's, and this is the second AI boom in a decade.
Those "stochastic parrots" have still proven that they are immensely useful. You might not personally find value out of coding assistants, but many many people do (as an example). People are (allegedly) turning to LLM's rather than StackOverflow for help [0]. They work well for boilerplate where you're an SME and able to validate the output - I can review 10x the amount of code I can write for example. They work (remarkably) well for summarising input text. An example - I semi occasionally (3-4x per year) have to deal with a few hundred GB of audio files that need cleanup. The cleanup tasks are "run FFMPEG with parameters", except I can not ever remember the parameters (they're different for different things). I can: read https://ffmpeg.org/ffmpeg.html or I can ask ChatGPT to write a script to clip the silence and add a 0.5 second intro fade to every file in a specified folder, and the entire task is done before I've even thought about it. I get to focus on what I want to, rather than munging data around.
If you expand your definition from LLMs to Transformers, then you get Whisper as a stand out example of something awesome. There's definitely negatives, but things like Diffusion are being used outside of image generation for drug discovery. We're not going to yolo AI generated drugs into human testing, but we can save an awful lot of screwing around to find something viable.
> And you spend gigawatts of power just to train this thing which selects and prints words based on probability and some randomness.
> That doesn't solve any problems.
I disagree, it does solve problems. A very fair question to ask is "is it worth the cost" and I would agree that it's not worth the cost. That doesn't mean it doesn't solve real problems.
> Crypto people kept suggesting that crypto was a solution to an X problem while ignoring that a database was a better solution the problem.
Only if you ignore the fact that a database (in traditional sense) doesn't solve the problem of decentralized peer to peer payments, which is the key differentiator of cryptocurrencies.
> I’ve yet to hear any good use cases for crypto, and I’ve been asking for years on here.
If you've been asking for years, I'm sure that someone, at some point, has told you about crypto's censorship resistance and international payments in places that are poorly served by the banking system for a variety of reasons.
Would you like to hear more or have you already dismissed these as "not good use cases"? It would be nice to differentiate between use cases that don't apply to you personally, and use cases that don't apply to anyone.
> Only if you ignore the fact that a database (in traditional sense) doesn't solve the problem of decentralized peer to peer payments, which is the key differentiator of cryptocurrencies.
And the point that the useful cases cede to make a useful product. The thing that is a _feature_ of a cryptocurrency is why people don't use it. I've had this debate dozens of times on here.
> If you've been asking for years, I'm sure that someone, at some point, has told you about crypto's censorship resistance and international payments in places that are poorly served by the banking system for a variety of reasons.
You know what else solves that? Cash and Western Union. And it has done for a long, long time.
> The thing that is a _feature_ of a cryptocurrency is why people don't use it.
People do, in fact, use them. Is it a popular payment method in western countries? No, but do some people use it? Yes, they do.
For privileged people, decentralization is usually a serious flaw. For others, it's an extremely important characteristic that lets them transact at all. The world isn't black and white, and people have use cases that are different from yours.
You're being self-centered, and that's okay, but perhaps you should factor it into your mental model before making sweeping statements in front of a global audience.
> You know what else solves that? Cash and Western Union. And it has done for a long, long time.
Not nearly as well, or there wouldn't be anyone using cryptocurrencies for that purpose.
You could make identical boring, bad-faith arguments about AI products. I think 99.99% of all "AI" products available today are completely useless - to me - but I don't go around proclaiming that all of AI is completely useless, and that all of its problem areas are better solved by statistics and "if" statements.
Don't mistake your own privilege, ignorance, and lack of imagination with the lack of real-world applications.
> You're being self-centered, and that's okay, but perhaps you should factor it into your mental model before making sweeping statements in front of a global audience.
> but I don't go around proclaiming that all of AI is completely useless, and that all of its problem areas are better solved by statistics and "if" statements.
> Don't mistake your own privilege, ignorance, and lack of imagination with the lack of real-world applications.
If you can't make your point without making sniping attacks about my character, then this isn't a conversation I want to continue having.
Privileged person is anyone living in a western country who hasn't had to deal with censorship. I consider myself to be a privileged person in that regard. That's not an attack on anyone's character.
> You're being self-centered
That's anyone who fails to consider use cases other than their own. I wasn't speaking to your character, It was a description of your reply, not your character, because it contained sweeping statements that only apply to certain groups of people.
> but I don't go around proclaiming that all of AI is completely useless, and that all of its problem areas are better solved by statistics and "if" statements.
That's not an attack on anyone?
> Don't mistake your own privilege, ignorance, and lack of imagination with the lack of real-world applications.
I've explained that privilege isn't an attack on anyone's character. As for the rest, sorry, but which words am I supposed to use when someone denies that a problem is real (which fair enough, I'll elaborate), later admits that there are other services that solve the same problem, but they still want to claim that there are no problems that the obscure product is solving, despite that product having real-world users who are using it for that exact problem?
In terms of generative AI, for general use cases, Open AI reported having 11 million paid subscribers last quarter. People clearing paying for Adobe Firefly and Midjourney access. That's already a ton more people finding it useful in day to day life than Crypto ever had.
It is certainly reasonable to suspect that the scale of investments (in trillions of dollars) don't match the scale of the opportunity. But it's a bit silly to pretend that no one is getting any value out of this.
At the end of the day, data centers are 2% of energy use, according to the IEA. That's trending up, but even in couple of years, data center stuff is mostly going to be typical cloud stuff, then crypto, and then a fraction for AI.
I never heard of any reasonable uses for Crypto or Blockchain in general. A lot of people tried to sell us various things at the time but it was very obvious that it had no real value.
AI is already implemented into businesses in various ways. Even if it’s not done so official you still have loads of employees pouring company secrets into chatGPT and Claude because they work.
Manga Library Z, a manga archiving site that distributed old and out-of-print manga for free has been forced to close down due to all major credit card companies refusing to provide payment services. If some hypothetical widespread decentralized payment system can prevent scenarios like this one from happening, then it would be worth the "enormous waste of resources". These days, you're essentially relegated to a non-person if card companies stop allowing you to use their services.
Windows Update is another big one, the way it makes every Windows computer spin the fans like crazy on a regular basis. Probably room for improvement there.
maybe they could just make the software not be completely shit instead ?
why does Windows take 2 minutes to decide what updates to install, then 45 minutes to install them when Debian on the same machine can do both in under 30 seconds?