More

marricks · 2025-03-19T18:18:57 1742408337

Most people dislike Trump, Musk, and Vance. Musk is by far the least liked of the three.

People in the country very much dislike the status-quo, hence the dislike of the Democratic Party and voters going for ape-shit change over Biden.

I sincerely doubt working people will see their lives improves this term and change their mind on Trump; there wasn't a liberal shift to the right and we'll likely see a major reaction of everyone going to the left when they realize dismantling everything hurt everyone.

marricks · 2025-03-10T18:14:29 1741630469

> Chrome is their project, they should be free to do whatever they want with it.

Google has a long history of "accidentally" breaking gmail on firefox and funneling users to Chrome back in the day. It's beyond stupid to argue they should be able to do whatever they want with their vertically integrated monopoly.

Like, if you want to dig holes in your own driveway sure whatever, but if you own all the roads in Detroit and you want to dig holes in them, then make a killing selling new tires and suspension repair a fair society wouldn't move out of Detroit, they'd fucking run you out of town.

jordanb · 2025-03-10T18:18:21 1741630701

Not even "back in the day". Youtube and Gsuite constantly break on firefox.

sneak · 2025-03-10T18:16:54 1741630614

If people don’t like this, they can stop using gmail. Neither Chrome, nor Gmail is a monopoly.

The more things Google does to make gmail less useful, the better.

It’s no secret that Google is an ad company. Anyone still using gmail deserves what they get.

marricks · 2025-03-10T18:22:21 1741630941

Why be bitter at the people dealing with the shit, why not be angry at the people making the world shit? My company uses gmail so I'm forced to use it.

kazinator · 2025-03-10T18:36:11 1741631771

Ah, but who is really making the world shit? Google and their ilk? Or the millions of sheep who use their stuff?

Would Google be making the world shit if all its cloud services had only a few dozen thousand users?

What's forcing you to interact with Google isn't Google, but Google users.

jay_kyburz · 2025-03-10T19:43:30 1741635810

How is your company forcing you to use gmail any worse than your company forcing you to use outlook? Is it your company that is making the world shit, or google.

sneak · 2025-03-10T18:25:53 1741631153

Everyone dealing with gmail is doing so because they chose to.

Let’s not pretend this was done unto them. Anyone can stop using gmail at any time.

eszed · 2025-03-10T19:04:05 1741633445

Indeed. I'd like to. Except Google also make it nigh impossible for anyone hosting their own email (the original-internet ideal) to get email into gmail reliably enough to be useful. I have my own address on my own domain, but can't rely on it (yes, DKIM and DMARC and SPF are properly set up) not to be marked "spam" for opaque reasons, so gmail remains my "main" address. It's a network-effect problem: once enough people are "captured", then everyone else is forced to join - or else be unable to participate.

It's a collective action problem: you'll have to persuade millions and millions of "normies", who have no idea what's going on, or what internet privacy is, or what's broken about the system, and who don't care to learn, and won't listen to us - or you'll have to impose regulation. Those are the choices. The second seems more possible than the first. Us nerds saying "walk away" is idealistic; we will, and always will, get squished, because the corps have the power and most folks won't (ever) care.

sneak · 2025-03-11T16:38:38 1741711118

This used to be true, but isn’t now. I self-host and can deliver to gmail just fine without being part of the deliverability cartel.

eszed · 2025-03-11T21:12:57 1741727577

OK, good to know. It's been a couple of years since the last time I made a serious effort. I may give it another try.

Who's your host, just in case that's the difference?

sneak · 2025-03-12T18:23:57 1741803837

Hetzner.

kazinator · 2025-03-10T18:38:00 1741631880

No, not for all types of "dealing with".

If you're dealing with spam originating from Gmail, without any helpful action from Google, that's not really your choice.

If you're dealing with difficulties sending mail to Gmail users, without help from Google, that's also not really your choice.

If vast numbers of other people stopped using Gmail, those problems would mostly go away.

DrillShopper · 2025-03-10T18:43:46 1741632226

GP Post: > My company uses gmail so I'm forced to use it.

Your post: > Everyone dealing with gmail is doing so because they chose to.

No, it's clear that not everyone dealing with Gmail is doing so because they chose to. Repeating your incorrect statement does not make it correct.

Further, everyone has to deal with its impacts on the email ecosystem as it's practically impossible for somebody who works a 9-5 to run their own mail server that Gmail will deign to not only accept mail from but also successfully deliver it to its intended recipient.

So even if I never use Gmail I still have to deal with replies going to / coming from it.

ziddoap · 2025-03-10T18:49:26 1741632566

>Anyone can stop using gmail at any time.

Just going to copy/paste this part of the comment you replied to, because it seems like you may have missed it?

>My company uses gmail so I'm forced to use it.

jpc0 · 2025-03-11T08:21:39 1741681299

GSuite/Workspace and consumer GMail is not the same thing in the slightest. They may use the same mail servers but that is about where the similarity ends.

I would recommend Google Workspace to any company because it gives them a ton of business productivity tools.

I would probably not recommend gmail as a users default personal email because frankly it's not that good.

The reality is most users have a Google account ans just use their Gmail account which is bundled.

Most of my circle which cares effectively use their Gmail account for sites that insist on it and never open that e-mail if they can get away with it.

hn_acc1 · 2025-03-10T20:31:14 1741638674

Not me - it's work mandated.

Not my wife - her school board mandates it.

znkynz · 2025-03-11T00:50:39 1741654239

i think you underestimate the effort for change of the average user with a @gmail.com address.

throwaway7679 · 2025-03-10T19:35:42 1741635342

> Anyone can stop using gmail at any time

True, and applies to many other things as well. Anyone claiming otherwise is shirking responsibility for their own actions. Every single sibling comment here suffers from this.

Arguments in the form of "other people do it, so I must also" are unpersuasive and pathetic.

klardotsh · 2025-03-10T18:40:40 1741632040

Except for anyone whose employer requires them to use Google services, since Google Apps (or whatever they call it these days) is a hugely popular offering for central company email/contacts/calendar/office suite. And frankly, it's better than dealing with Outlook and its unrelenting AI slop machine advertising.

carlosjobim · 2025-03-10T20:39:53 1741639193

You're behind with the times, words have new meanings

Organizations I don't like = Monopoly!

Organizations I like = ...

throwaway48476 · 2025-03-10T18:26:32 1741631192

The only thing that can stop a monopoly is a bigger monopoly, the government.

kazinator · 2025-03-10T18:34:15 1741631655

You don't own the roads in Detroit; the government owns most of them.

Gmail is not a government service. Google is free to make that work with only one browser, if they want.

You can't assert that Google must make Gmail work with any browser whatsoever, because that means supporting someone using Windows 95 with Internet Explorer 5.5.

marricks · 2025-03-10T18:52:50 1741632770

I'm not going to waste my time explaining to you what a metaphor is, but I will say this Firefox was the dominant player in the 00's 2010's when they did this, not the 2% market share it is now.

marricks · 2025-03-07T22:57:07 1741388227

Which is funny because the CEO level one is the easiest to automate

JKCalhoun · 2025-03-07T23:41:49 1741390909

Steve Jobs said something to the effect that he made maybe three CEO decisions a year. I mean, I think these are decisions like, "We're going to open our own line of Apple retail stores", but, still.

cj · 2025-03-07T23:57:42 1741391862

Being a CEO isn’t all that different from being a parent of a child from the POV of impactful decisions.

How many critical “parental decisions” have you made in the past week? Probably very few (if any), but surely you did a lot of reinforcement of prior decisions that had already been made, enforcing rules that were already set, making sure things that were scheduled were completed, etc.

Important jobs don’t always mean constantly making important decisions. Following through and executing on things after they’re decided is the hard part.

See also: diet and exercise

wkat4242 · 2025-03-07T23:25:14 1741389914

Playing golf while bantering with your old boys network is going to be hard to automate :)

reverius42 · 2025-03-08T00:09:07 1741392547

The banter is actually quite easy to automate. You can hire a human to play golf for a small fraction of what the CEOs get paid, and then it's best of both worlds.

ttepasse · 2025-03-08T21:41:33 1741470093

With preexisting knowledge of military artillery arithmetic a a golf robot should not be impossible.

aleph_minus_one · 2025-03-08T01:53:38 1741398818

The basic role of a CEO is to be the face of the company and market it to the varioua stakeholders.

This is hard to automatize.

t_mann · 2025-03-08T09:24:11 1741425851

Is it? Take a look at the bot accounts filling up social media (the non-obvious ones). It wouldn't seem to hard to make one that makes 2am posts about '[next product] feels like real AGI' or tells stock analysts that their questions are boring on an earnings call, which is apparently what rockstar CEOs do.

Sneers aside, I think one common mis-assumption is that the difficulty of automating a task depends on how difficult it feels to humans. My hinge is that it mostly depends on the availability of training data. That would mean that all the public-facing aspects of being a CEO should by definition be easy to automate, while all the non-public stuff (also a pretty important part of being a CEO, I'd assume) should be hard.

croes · 2025-03-08T13:18:54 1741439934

Sounds like those AI created influencers

marricks · 2025-03-06T17:15:41 1741281341

Could be chrome vs safari or ff

marricks · 2025-03-01T14:47:33 1740840453

> What’s the plan here?

There's only ever been one plan and it hasn't changed:

1) cut funding to a public service

2) said service goes to shit because it lacks support

3) use chaos as an excuse to privatize it so someone makes a buck

then move on to the next service

pchew · 2025-03-01T15:01:37 1740841297

That's the plan for every other federal service. For public land in particular there's an extra fun bonus step of selling the land to be exploited fully. Look at the Secretary of Interior's record in North Dakota.

mrtesthah · 2025-03-01T15:03:16 1740841396

who needs public lands in the first place?

https://time.com/7261219/patagonia-ceo-trump-shouldnt-sell-p...

tym0 · 2025-03-01T15:48:48 1740844128

It "worked" for the UK, now the cost of water keeps raising and the level of service keeps going down.

marricks · 2025-02-19T23:44:09 1740008649

One difficult I remember reading is: they can't always do things an obvious way someone else did because... patents.

atq2119 · 2025-02-20T05:00:53 1740027653

Which is ironic, considering that patents are supposed to be non-obvious.

fennecfoxy · 2025-02-20T13:47:27 1740059247

We're talking about a company that took another one to court because both of their products had rounded corners.

joquarky · 2025-02-20T05:29:04 1740029344

And similarly, copyright was supposed to "promote the Progress of Science and useful Arts", but here we are

marricks · 2025-02-18T17:13:51 1739898831

You end the project with less fingers than when you started?

marricks · 2025-02-06T06:46:15 1738824375

Perhaps aspirational in using AI to help the military :)

https://www.cnn.com/2025/02/04/business/google-ai-weapons-su...

marricks · 2025-01-29T17:49:26 1738172966

I'm not sure how that relates to original comment. Do you mean you want everything that is or could be better than American technology banned/destroyed so we stay the best...?

Like, any global hegemony will be increasingly corrupt given the power that gives, IMO.

marricks · 2025-01-29T15:13:29 1738163609

Also, DeepSeek is allegedly... better? So saying they just copied ClosedAI isn't really sufficient of an answer. Seems to be just bluster because the US Govt would probably accept any excuse to ban it, see TikTok.

throwup238 · 2025-01-29T15:29:04 1738164544

It’s not better. In most of my tests (C++/QT code) it just runs out of context before it can really do anything. And the output is very bad - it mashes together the header and cpp file. The reasoning output is fun to look at and occasionally useful though.

The max token output is only 8K (32K thinking tokens). O1 is 128k, which is far more useful, and it doesn’t get stuck like R1 does.

The hype around the DeepSeek release is insane and I’m starting to really doubt their numbers.

sho_hn · 2025-01-29T15:43:19 1738165399

Is this a local run of one of the smaller models and/or other-models-distilled-with-r1, or are you using their Chat interface?

I've also compared o1 and (online-hosted) r1 on Qt/C++ code, being a KDE Plasma dev, and my impression so far was that the output is roughly on par. I've given both models some tricky tasks about dark corners of the meta-object system in crafting classes etc. and they came up with generally the same sort of suggestions and implementations.

I do appreciate that "asking about gotchas with few definitive solutions, even if they require some perspective" and "rote day-to-day coding ops" are very different benchmarks due to how things are represented in the training data corpus, though.

throwup238 · 2025-01-29T15:50:41 1738165841

I use it through Kagi Assistant which has the proper R1 model through Together.ai/Fireworks.ai

My standard test is to ask the model to write a QSyntaxHighlighter subclass that uses TreeSitter to implement syntax highlighting. O1 can do it after a few iterations, but R1’s output has been a mess. That said, its thought process revealed a few issues that I then fixed in my canonical implementation.

nialv7 · 2025-01-29T16:09:22 1738166962

Tried this on chat.deepseek.com, it seems to be able to do it.

throwup238 · 2025-01-29T16:13:47 1738167227

Does it compile? Put the full chat in Pastebin and let’s check it out!

I haven’t used their official chat interface or API for privacy reasons.

CamperBob2 · 2025-01-29T16:53:44 1738169624

Some have said (for what little that's worth) that Kagi's version is not the real thing, but one of the distillations.

sho_hn · 2025-01-29T15:58:33 1738166313

Thanks for adding detail! My prompts have been very in-the-bubble-of-Qt I'd say, less so about mashing together Qt and something else, which I agree is a good real-world test case.

throwup238 · 2025-01-29T16:12:10 1738167130

I haven’t had the chance to try it out with R1 yet but if you implement a debugger class that screenshots the widget/QML element, dumps its metadata like GammaRay, and includes the source, you can feed that context into Sonnet and o1. They are scarily good at identifying bugs and making modifications if you include all that context (although you have to be selective with what metadata you include. I usually just dump a few things like properties, bindings, signals, etc).

gliptic · 2025-01-29T15:34:32 1738164872

R1 is trained for a context length of 128K. Where are you getting 8K/32K? The model doesn't distinguish "thinking" tokens and "output" tokens, so this must be some specific API limitations.

throwup238 · 2025-01-29T15:35:28 1738164928

> max_tokens：The maximum length of the final response after the CoT output is completed, defaulting to 4K, with a maximum of 8K. Note that the CoT output can reach up to 32K tokens, and the parameter to control the CoT length (reasoning_effort) will be available soon. [1]

[1] https://api-docs.deepseek.com/guides/reasoning_model

gliptic · 2025-01-29T15:39:09 1738165149

So yes, it's a limitation of their own API at the moment, not a model limitation.

throwup238 · 2025-01-29T15:45:13 1738165513

I’m using it through Kagi which doesn’t use Deepseek’s official API [1]. That limitation from the docs seems to be everywhere.

In practice I don’t think anyone can economically host the whole model plus the kv cache for the entire context size of 128k (and I’m skeptical of Deepseek’s claims now anyway).

Edit: a Kagi team member just said on Discord that they’ll be increasing max tokens next release

[1] https://help.kagi.com/kagi/ai/llms-privacy.html

coliveira · 2025-01-29T15:48:18 1738165698

He's just repeating a lot of disinformation that has been released about deepseek in the last few days. People who took the time to test DeepSeek models know that the results have the same or better quality for coding tasks.

goosejuice · 2025-01-29T16:01:37 1738166497

Benchmarks are great to have but individual/org experiences on specific codebases still matter tremendously.

If an org consistently finds one model performs worse on their corpus than another, they aren't going to keep using it because it ranks higher in some set of benchmarks.

hn_throwaway_99 · 2025-01-29T17:16:25 1738170985

But you should also be very wary of these kind of anecdotes, and this thread highlights exactly why. That commenter says in another comment (https://news.ycombinator.com/item?id=42866350) that the token limitation that he is complaining about has actually nothing to do with DeepSeek's model or their API, but is a consequence of an artificial limit that Kagi imposes. In other words, his conclusion about DeepSeek is completely unwarranted.

throwup238 · 2025-01-29T17:26:49 1738171609

It mashed the header and C++ file together, which is egregiously bad in the context of QT. This isn’t a new library, it’s been around for almost thirty years. Max token sizes have nothing to do with that.

I invite anyone to post a chat transcript showing a successful run of R1 against this prompt (and please tell me which API/service it came from so I can go use it too!)

goosejuice · 2025-01-30T15:18:07 1738250287

I wasn't suggesting using the anecdotes of others to make a decision.

I'm talking about individuals and organizations making a decision on whether or not to use a model based on their own testing. That's what ultimately matters here.

sheepdestroyer · 2025-01-29T15:45:52 1738165552

There are R1 providers on openrouter with bigger input/output token limitations than what DeepSeek's API access currently offers.

For instance Fireworks offers R1 with 164K/164K. They are far more expensive than DeepSeek though

api · 2025-01-29T16:28:31 1738168111

It's not great at super-complex tasks due to limited context, but it's quite a good "junior intern that has memorized the Internet." Local deepseek-r1 on my laptop (M1 w/64GiB RAM) can answer about any question I can throw at it... as long as it's not something on China's censored list. :)

azinman2 · 2025-01-29T20:09:31 1738181371

How are you running r1 on 64mb of ram? I’m guessing you’re running a distill which is not r1

api · 2025-01-29T23:01:48 1738191708

The 70b distill at 4bit quantize fits, so yes, and performance and quality seem pretty good. I can't run the gigantic one.

azinman2 · 2025-01-31T06:13:13 1738303993

Ok but that’s not deepseek-r1. Lots of people keep saying this for distills and it’s getting very confusing.

adamnemecek · 2025-01-29T15:31:09 1738164669

Thanks for saying this, I thought I was insane, DeepSeek is kinda bad. I guess it’s impressive all things considered but in absolute terms it’s not great.

coliveira · 2025-01-29T15:51:41 1738165901

I have run personal tests and the results are at least as good as I get from OpenAI. Smarter people have also reached the same conclusion. Of course you can find contrary datapoints, but it doesn't change the big picture.

sebzim4500 · 2025-01-29T15:58:51 1738166331

To be fair, it's amazing by the standards of six months ago. The only models that beat it are o1, the latest gemini models and (for some things) sonnet 3.6

cdelsolar · 2025-01-29T20:56:48 1738184208

false. It seems better than o1 to me.

marricks · 2025-01-29T15:35:20 1738164920

> it just runs out of context before it can really do anything

I mean, couldn't that be because they're just overwhelmed by users at the moment?

> And the output is very bad - it mashes together the header and cpp file

That sounds way worse, and like, not something caused by being hugged to death though.

Aider recently stated DeepSeek is placed a the top of their benchmark though[1] so I'm inclined to believe it isn't all hype.

[1] https://aider.chat/docs/llms/deepseek.html

throwup238 · 2025-01-29T15:41:09 1738165269

It’s definitely not all hype, it really is a breakthrough for open source reasoning models. I don’t mean to diminish their contribution, especially since being able to read the reasoning output is a very interesting new modality (for lack of a better word) for me as a developer.

It’s just not as impressive as people make it out to be. It might be better than o1 on Python or Javascript thats all over the training data, but o1 is overwhelmingly better at anything outside the happy path.

beAbU · 2025-01-29T15:20:47 1738164047

How can they ban something thats open source that you can just run on your own hardware?

fabianhjr · 2025-01-29T15:24:59 1738164299

There are illegal numbers in the USA land of the "free".

https://en.wikipedia.org/wiki/Illegal_number

> An AACS encryption key (09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0) that came to prominence in May 2007 is an example of a number claimed to be a secret, and whose publication or inappropriate possession is claimed to be illegal in the United States.

JumpCrisscross · 2025-01-29T15:34:18 1738164858

> illegal numbers in the USA land of the "free"

This is a silly take for anyone in tech. Any binary sequence is a number. Any information can be, for practical purposes, rendered in binary [1].

Getting worked up about restrictions on numbers works as a meme, for the masses, because it sounds silly, but is tantamount to technically arguing against privacy, confidentiality, the concept of national secrets, IP as a whole, et cetera.

[1] https://en.m.wikipedia.org/wiki/Shannon%27s_source_coding_th...

fabianhjr · 2025-01-29T15:59:58 1738166398

Good thing that is part of the wikipedia entry:

> Any piece of digital information is representable as a number; consequently, if communicating a specific set of information is illegal in some way, then the number may be illegal as well.

sheepdestroyer · 2025-01-29T15:59:37 1738166377

All those things are not self-evident and thus debatable

JumpCrisscross · 2025-01-29T16:17:36 1738167456

> not self-evident and thus debatable

Totally agree. But prompting debate or even further thought isn’t the point of the meme.

sheepdestroyer · 2025-01-29T16:26:23 1738167983

I'd argue that, as satire, it's the main point ;)

JumpCrisscross · 2025-01-29T16:34:29 1738168469

> as satire, it's the main point

There is thought-stopping satire and thought-provoking satire. Much of it depends on the context. I’m not getting the latter from a “USA land of the ‘free’” comment.

suraci · 2025-01-29T16:20:56 1738167656

> is collecting rain water illegal?

> It depends on where you live. In many places, collecting rainwater is completely legal and even encouraged, but some regions have regulations or restrictions.

United States: Most states allow rainwater collection, but some have restrictions on how much you can collect or how it can be used. For example, Colorado has limits on the amount of rainwater homeowners can store. Australia: Generally legal and encouraged, with many homes using rainwater tanks. UK & Canada: Legal with few restrictions. India & Many Other Countries: Often encouraged due to water scarcity.

bloopernova · 2025-01-29T15:36:48 1738165008

That takes me back! Fark.com would delete any comment that contained random hexadecimal.

KPGv2 · 2025-01-29T15:40:40 1738165240

It was the beginning of the end for Digg, too, IIRC. Started a lot of people leaving for Reddit, right?

bloopernova · 2025-01-29T15:47:41 1738165661

I think so; I joined Reddit when it was in tech news as people left Digg after the big redesign. I'm not sure when the exodus started. I left Fark over the hd-dvd mess.

KPGv2 · 2025-01-29T15:40:08 1738165208

> whose publication or inappropriate possession is claimed to be illegal in the United States.

That's not the same thing as a number being illegal at all. Here, watch this:

> I claim breathing is illegal in the United States

There, now breathing is claimed to be illegal in the United States.

I-M-S · 2025-01-29T15:47:38 1738165658

In both cases, legality depends entirely on repercussions, i.e. if there's someone to enforce the ban. I suspect that in the "illegal numbers" case there might be.

vluft · 2025-01-29T15:54:33 1738166073

man that's very concerning for wikipedia who is publishing it right there on the page linked above.

dylan604 · 2025-01-29T16:14:58 1738167298

Only concerning if they are a US based company hosting their data in US data centers. oops

shafyy · 2025-01-29T15:24:56 1738164296

It's not open source. The provide the model and the weights, but not the source code and, crucially, the training data. As long as LLM makers don't provide the training data (and they never will, because then they will be admitting to stealing), LLMs are never going to be open source.

sho_hn · 2025-01-29T15:29:23 1738164563

Thanks for reminding people of this.

Open source means two things in spirit:

(a) You have everything you need to be able to re-create something, and at any step of the process change it.

(b) You have broad permissions how to put the result to use.

The "open source" models from both Meta so far fail either both or one of these checks (Meta's fails both). We should resist the dilution of the term open source to the point where it means nothing useful.

jprete · 2025-01-29T15:36:08 1738164968

I think people are looking for the term "freeware" although the connotations don't match.

sho_hn · 2025-01-29T16:10:27 1738167027

Agreed, but the "connotations don't match" is mostly because the folks who chose to call it open source wanted the marketing benefits of doing so. Otherwise it'd match pretty well.

KPGv2 · 2025-01-29T15:43:18 1738165398

At the risk of being called rms, no, that's not what open source means. Open source just means you have access to the source code. Which you do. Code that is open source but restrictively licensed is still open source.

That's why terms like "libre" were born to describe certain kinds of software. And that's what you're describing.

This is a debate that started, like, twenty years ago or something when we started getting big code projects that were open source but encumbered by patents so that they couldn't be redistributed, but could still be read and modified for internal use.

jefftk · 2025-01-29T15:57:14 1738166234

> Open source just means you have access to the source code.

That's https://en.wikipedia.org/wiki/Source-available_software , not 'open source'. The latter was specifically coined [1] as a way to talk about "free software" (with its freedom connotations) without the price connotations:

The argument was as follows: those new to the term "free software" assume it is referring to the price. Oldtimers must then launch into an explanation, usually given as follows: "We mean free as in freedom, not free as in beer." At this point, a discussion on software has turned into one about the price of an alcoholic beverage. The problem was not that explaining the meaning is impossible—the problem was that the name for an important idea should not be so confusing to newcomers. A clearer term was needed. No political issues were raised regarding the free software term; the issue was its lack of clarity to those new to the concept.

[1] https://opensource.com/article/18/2/coining-term-open-source...

HDThoreaun · 2025-01-29T16:01:51 1738166511

You dont get to redefine what "open" means.

jefftk · 2025-01-29T16:17:24 1738167444

It's common for terms to have a more specific meaning when combined with other terms. "Open source" has had a specific meaning now for decades, which goes beyond "you can see the source" to, among other things, "you're allowed to it without restriction".

RobotToaster · 2025-01-29T16:59:23 1738169963

So Swedish meatballs are any ball of meat made in Sweden?

And French fries are anything that was fried in France?

davidcbc · 2025-01-29T17:20:03 1738171203

Tell that to Sam Altman

esafak · 2025-01-29T21:54:54 1738187694

He did not succeed, did he?

dTal · 2025-01-29T16:30:19 1738168219

I don't know why you've been downvoted. This is a 100% correct history. "Open source" was specifically coined as a synonym to "free software", and has always been used that way.

sho_hn · 2025-01-29T15:47:42 1738165662

> Open source just means you have access to the source code. Which you do.

No, they also fail even that test. Neither Meta nor DeepSeek have released the source code of their training pipeline or anything like that. There's very little literal "source code" in any of these releases at all.

What you can get from them is the model weights, which for the purpose of this discussion, is very similar to compiler binary executable output you cannot easily reverse, which is what open source seeks to address. In the case of Meta, this comes with additional usage limitations on how you may put them to use.

As a sibling comment said, this is basically "freeware" (with asterisks) but has nothing to do with open source, either according to RMS or OSI.

> This is a debate that started, like, twenty years ago

For the record, I do appreciate the distinction. This isn't meant as an argument from authority at all, but I've been an active open source (and free software) developer for close to those 20 years, am on the board of one of the larger FOSS orgs, and most households have a few copies of FOSS code I've written running. It's also why I care! :-)

nuancebydefault · 2025-01-29T18:28:28 1738175308

The weights, which are part of the source, are open. Now you are arguing it not being open source because they don't provide the source for that part of the source. If you follow that reasoning you can ad infinitum claim the absence of sources since every source originates from something.

Kerbonut · 2025-01-30T05:53:59 1738216439

The source is the training data and the code used to turn the training data _into_ the weights. Thus GP is correct, the weights are more akin to a binary from a traditional compiler.

nuancebydefault · 2025-01-30T19:11:11 1738264271

To me this 'source' requirement does not make sense. It is not that you bring training data and the application together and press a train button, there's much more actions involved.

Also the training data is of a massive amount.

Additionally, what about human in the loop training, do you deliver humans as part of the source?

JumpCrisscross · 2025-01-29T15:56:26 1738166186

> they also fail even that test. Neither Meta nor DeepSeek have released the source code of the

This debate is over and makes the open source community look silly. Open model and weights is, practically speaking, open source for LLMs.

I have tremendous respect for FOSS and those who build and maintain it. But arguing for open training data means only toy models can practically exist. As a result, the practical definition will prevail. And if the only people putting forward a practical definition are Meta et al, this is what you get: source available.

sho_hn · 2025-01-29T16:02:43 1738166563

I'm not arguing for open training data BTW, and the problem is exactly this sort of myopic focus on the concerns of the AI community and the benefits of open-washing marketing.

Completely, fully breaking the meaning of the term "open source" is causing collateral damage outside the AI topic, that's where it really hurts. The open source principle is still useful and necessary, and we need words to communicate about it and raise correct expectations and apply correct standards. As a dev you very likely don't want to live in a tech environment where we regress on this.

It's not "source available" either. There's no source. It's freeware.

"I can download it and run it" isn't open source.

I'm actually not too worried that people won't eventually re-discover the same needs that open source originally discovered, but it's pretty lame if we lose a whole bunch of time and effort to re-learn some lessons yet again.

JumpCrisscross · 2025-01-29T16:12:59 1738167179

> it's pretty lame if we lose a whole bunch of time and effort to re-learn some lessons yet again

We need to relearn because we need a different definition for LLMs. One that works in practice, not just at the peripheries.

Maybe we can have FOSS LLMs vs open-source ones, like we do with software licenses. The former refers to the hardcore definition. The latter the practical (and widely used) one.

sho_hn · 2025-01-29T16:14:25 1738167265

Sure, I don't disagree. I fully understand the open-weights folks looking for a word to communicate their approach and its benefits, and I support them in doing so. It's just a shame they picked this one in - and that's giving folks a lot of benefit of the doubt - a snap judgement.

> Maybe we can have FOSS LLMs vs open-source ones, like we do with software licenses.

Why not just call them freeware LLMs, which would be much more accurate?

There's nothing "hardcore" or "zealot" about not calling these open source LLMs because there's just ... absolutely nothing there that you call open source in any way. We don't call any other freeware "open source" for being a free download with a limited use license.

This is just "we chose a word to communicate we are different from the other guys". In games, they chose to call it "free to play (f2p)" when addressing a similar issue (but it's also not a great fit since f2p games usually have a server dependency).

JumpCrisscross · 2025-01-29T16:32:00 1738168320

> Why not just call them freeware LLMs, which would be much more accurate?

Most of the public is unfamiliar with the term. And with some of the FOSS community arguing for open training data, it was easy to overrule them and take the term.

sho_hn · 2025-01-29T16:50:50 1738169450

Most of the public is also unfamiliar with the term open source, and I'm not sure they did themselves any favors by picking one that invites far more questions and needs for explanation. In that sense, it may have accomplished little but its harmful effects.

I get your overall take is "this is just how things go in language", but you can escalate that non-caring perspective all the way to entropy and the heat death of the universe, and I guess I prefer being an element that creates some structure in things, however fleeting.

JumpCrisscross · 2025-01-29T18:01:45 1738173705

> Most of the public is also unfamiliar with the term open source

I’d argue otherwise. (Familiar with, not know.) Particularly in policy circles.

> picking one that invites far more questions and needs for explanation

There wasn't ever a debate. And now, not even the OSI demands training data. (It couldn’t. It, too, would be ignored.)

Flimm · 2025-01-29T16:54:47 1738169687

The only practical and widely used definition of open source is the one known as the Open Source Definition published by the OSI.

The set of free/libre licenses (as defined by the FSF) is almost identical to the set of open sources licenses (as defined by the OSI).

The debate within FOSS communities has been between copyleft licenses like the GPL, and permissive licenses like the MIT licence. Both copyleft and permissive licenses are considered free/libre by the FSF, and both of them are considered open source by the OSI.

HDThoreaun · 2025-01-29T15:39:49 1738165189

Open source means the source code is freely available. It’s in the name.

idle_zealot · 2025-01-29T15:53:06 1738165986

The source being available means the code is "source available." Open implies more rights.

coliveira · 2025-01-29T15:56:17 1738166177

People say this, but when it comes to AI models, the training data is not owned by these companies/groups, so it cannot be "open sourced" in any sense. And the training code is basically accessing that training data that cannot be open sourced, therefore it also cannot be shared. So the full open source model you wish to have can only provide subpar results.

sheepdestroyer · 2025-01-29T16:01:29 1738166489

They could easily list the data used though. These datasets are mostly known and floating around. When they are constructed, instructions for replication could be provided too

coliveira · 2025-01-29T16:03:41 1738166621

They could, but even if they give this list the detractors will still say it is not open source.

rvnx · 2025-01-29T16:24:17 1738167857

yes and as a bonus they may get sued, which in the long-term, makes free / offline models to not be viable

It would be so much better if all models were trained with LibGen.

Timon3 · 2025-01-29T17:57:38 1738173458

Isn't this the same situation that any codebase faces when one thinks about open sourcing it? I can't legally open source the code I don't own.

beAbU · 2025-01-29T15:33:30 1738164810

Thanks, I was not aware of this distinction.

But I think my argument still stands though? Users can run Deepseek locally, so unless the US Gov't wants to reach for book burning levels or idiocy, there is not really a feasible way to ban the American public of running DeepSeek, no?

shafyy · 2025-01-29T21:52:08 1738187528

Yes, your argument still stands. But I think it's important to stand firm that the term "open source" is not a good label for what these "freeware" LLMs are.

beAbU · 2025-01-29T22:47:53 1738190873

Fair point, agreed.

superkuh · 2025-01-29T15:32:08 1738164728

There was an executive order passed by the previous administration that make using anything with more than 10 billion parameters illegal and punishable by government force if done without authorization. Of course like most government regulations (even though this is not a regulation, it is an executive action) the point is not to stop the behavior but instead to create a system where everyone breaks the regulation constantly so that if anyone rocks the boat they can be indicted/charged and dealt with.

https://www.federalregister.gov/documents/2023/11/01/2023-24...

>(k) The term “dual-use foundation model” means an AI model that is trained on broad data; generally uses self-supervision; contains at least tens of billions of parameters; is applicable across a wide range of contexts; and that exhibits, or could be easily modified to exhibit, high levels of performance at tasks that pose a serious risk to security, national economic security, national public health or safety, or any combination of those matters, such as by: ...

ceejayoz · 2025-01-29T15:41:23 1738165283

That order does not "make using anything with more than 10 billion parameters illegal and punishable by government force if done without authorization".

It orders the Secretary of Commerce to "solicit input from the private sector, academia, civil society, and other stakeholders through a public consultation process on potential risks, benefits, other implications, and appropriate policy and regulatory approaches related to dual-use foundation models for which the model weights are widely available".

derektank · 2025-01-29T15:50:51 1738165851

Many regulations are created by executive action, without input from Congress. The Council on Environmental Quality, created by the National Environmental Policy Act, has the power to issue it's own regulations. Executive Orders can function similarly and the executive can order rulemaking bodies to create and remove regulations, though there is a judicial effort to restrict this kind of policymaking and return regulatory power back to Congress.

Spooky23 · 2025-01-30T03:37:55 1738208275

There’s an effort to restrict certain regulatory rule-making where it’s ideologically convenient, but it isn’t “returning” regulatory power. That rulemaking authority isn’t derived by some bullshit executive order, but by Federal law, as implemented by congress.

Congress has never ceded power to anyone. They wield legislative authority and power of the purse, and wield it as they see fit. The special interests campaigning about this are extreme reactionaries whose stated purpose is to make government ineffective.

bilekas · 2025-01-29T15:32:53 1738164773

If I'm no wrong wasn't PGP encryption once illegal to export ? Not quite the same but the government has a nice habit of feeling like they can bad the export of research.

https://en.wikipedia.org/wiki/Export_of_cryptography_from_th...

Prbeek · 2025-01-29T15:45:34 1738165534

Add PS1 too. The US government banned sale of PlayStation to China because the PLA would apparently have access to cutting edge chips for their missiles

beAbU · 2025-01-29T15:34:25 1738164865

You are right, but I cannot find a single example of such a ban actually being effective though. Information wants to be free and all that.

KPGv2 · 2025-01-29T15:45:32 1738165532

Because you haven't heard of the proprietary software that wasn't ever sold internationally because of these bans.

Of course Joe Sixpack can throw their code up anywhere, but Joe Corporation gets wrecked if they try to sell it.

https://developer.apple.com/documentation/security/complying...

For example, this is enforced by Apple Store.

coliveira · 2025-01-29T15:59:29 1738166369

But that's not the goal, the goal is to protect the "intelectual property" only to American companies. Countries not in the "friends list" cannot sell products in that area without suffering repercussions. That's how the US has maintained technological dominance in some areas by restricting what other countries can do.

calgoo · 2025-01-29T16:04:02 1738166642

If i remember correctly, if you changed the dropdown on the webpage to USA you could download the full version of PGP anyway.

michaelt · 2025-01-29T15:50:15 1738165815

Make commercial hosting illegal, and make the hardware to run it locally cost $6000+

Drakim · 2025-01-29T15:24:05 1738164245

They banned certain branches of math during the cold war, it can be done.

jerry80 · 2025-01-29T16:07:29 1738166849

Such as?

Drakim · 2025-01-31T21:45:37 1738359937

All non-trivial encryption algorithms.

https://en.wikipedia.org/wiki/Crypto_Wars

semking · 2025-01-29T15:15:48 1738163748

I never said they are just a clone! There's an actual tech breakthrough!

Read the two following sections of my blog post:

1. "Distilled language models"

2. "DeepSeek: Less supervision"