More

treesciencebot · 2025-04-25T23:11:09 1745622669

treesciencebot · 2025-03-05T23:38:04 1741217884

GH200 is nowhere near $343,000 number. You can get a single server order around 45k (with inception discount). If you are buying bulk, it goes down to sub-30k ish. This comes with a H100's performance and insane amount of high bandwith memory.

wmf · 2025-03-05T23:45:39 1741218339

They probably meant 8xH200 for $343,000 which is in the ballpark.

zitterbewegung · 2025-03-06T00:43:23 1741221803

Yes this is what I meant since 8 would cover 512GB of Ram

treesciencebot · 2025-01-22T23:33:28 1737588808

at a relatively new high-rise in rincon hill, AT&T still charges 80-90$ for 1 gig symmetrical (same with webpass/xfinity).

treesciencebot · 2025-01-19T01:52:09 1737251529

For traditional LLMs this might be true (especially large MoEs at bs=1) but I highly disagree with "multi-modal models" phrase since most of the models that output in other modalities are generally compute bound. Which means less flops will make the experience so much worse (imagine waiting a couple minutes for an image and hours for videos).

treesciencebot · 2024-12-17T23:54:17 1734479657

For anyone that wants to test the original (non-distilled) HunyuanVideo (which is an amazing model) we have 580p version taking under a minute and 720p version taking around 2.5-3 minutes in our playground: https://fal.ai/models/fal-ai/hunyuan-video (it requires github login & and is pay-per-use but new accounts get some free credits).

echelon · 2024-12-18T03:07:09 1734491229

Open source video models are going to beat closed source. Ecosystem and tools matter.

Midjourney has name recognition, but nobody talks about Dall-E anymore. The same will happen to Sora. Flux and Stable Diffusion won images, and Hunyuan and similar will win video.

Hunyuan, LTX-1, Mochi-1, and all the other open models from non-leading foundation model companies will eventually leapfrog Sora and Veo. Because you can program against them and run them locally or in your own cloud. You can fine tune them to do whatever you want. You can build audio reactive models, controllable models, interactive art walls, you name it.

Sora and Veo just aren't interesting. They're at one end of the quality spectrum, and open models will quickly close that gap and then some.

dragonwriter · 2024-12-18T03:44:21 1734493461

> Open source video models are going to beat closed source. Ecosystem and tools matter. Midjourney has name recognition, but nobody talks about Dall-E anymore. The same will happen to Sora. Flux and Stable Diffusion won images, and Hunyuan and similar will win video.

Neither Flux (except the distilled Flux Schnell model) nor Stable Diffusion has open licensed weights, Stable Diffusion and Flux Dev are weights-available with limited, non-open licenses, Flux Pro is hosted-only.

echelon · 2024-12-18T04:08:10 1734494890

Just because the OSI doesn't like Open RAIL doesn't make it not open source unless you're strictly talking about the OSD. The OSI can't even figure where the boundaries of open models lie - data, training code, weights, etc.

The RAIL licenses do have usage restrictions (eg. against harming minors, use in defamation, etc.), but they're completely unenforced.

Flux Schnell is Apache. LTX-1 is Apache.

dragonwriter · 2024-12-18T05:43:16 1734500596

> Just because the OSI doesn’t like Open RAIL doesn’t make it not open source unless you’re strictly talking about the OSD.

If you aren’t talking about the OSD, you end up reducing “open source” to a semantically-null buzzword. But, in any case, I intentionally didn’t mention “open source”. The weights are under a use-restrictive license, not an open license, even leaving out the debates over what “source” is. And tha’s just SD1.x, SD2.x, and SDXL, which have the CreativeML OpenRAIL-M license (SD1.x) or CreativeML OpenRAIL++M licenses (SD2.x/SDXL). SD3.x has a far more restrictive license, as does Flux Dev.

> Flux Schnell is Apache.

Huh. It’s almost like I should have explicitly except Flux Schnell from the other Stable Diffusion and Flux models when I said they didn’t have open licenses.

Oh, I did.

> LTX-1 is Apache.

Yes, it is. LTX-1 is “neither Flux (except the distilled Flux Schnell model) nor Stable Diffusion”. AuraFlow (an image model) is also Apache, and while its behind Flux – Dev or Schnell – or SDXL in current mindshare, it got picked – largely for licensing reasons – as the basis for the next version of Pony Diffusion, a popular (largely, though not exclusively, for NSFW capabilities) community model series whose previous versions were based on SD1.5 and SDXL, which gives it a good chance of becoming a major player.

fc417fc802 · 2024-12-18T08:07:16 1734509236

> Just because the OSI doesn't like ...

Statements that begin like this are nearly always rhetorical attempts to subvert the standard usage of the terminology.

> but they're completely unenforced

Utterly irrelevant from a legal perspective. Also entirely circumstantial in that it depends entirely on the license holder and can easily vary between end users.

I'm also rather confused how RAIL entered into this to begin with. Unless I've missed something significant, most variants (or at least high end variants) of Stable Diffusion [0] and Flux [1] are under non-commercial licenses.

Not that I take issue with that. I've no delusion that a company is going to spend hundreds of thousands of dollars on compute and then open the floor to competitors who literally clone their data.

[0] https://huggingface.co/stabilityai/stable-diffusion-3.5-larg...

[1] https://github.com/black-forest-labs/flux/blob/main/model_li...

creato · 2024-12-18T03:15:50 1734491750

I'm curious what your take on GIMP vs. Photoshop would be?

raxxorraxor · 2024-12-18T15:20:38 1734535238

Easily Gimp and Krita or painting (you can buy the latter on steam, if you want to support open source).

Photoshop is a round and mature product, but since I don't do any print, I can do everything with Gimp (perhaps you can do print too, no experience here).

Creative cloud or however it is called today is a non-starter for me. Also, I can integrate Gimp in image pipelines more easily. I also use Blender for modelling.

Maybe I am not entirely up to date, but today you can use these tools to make things that were just not possible a few years ago. In a quality that is competitive with high quality media products.

For me it is a hobby and I get the advantages in a professional environment to use the same tools that fit long and complicated pipelines. But if you just want to create high quality art, the tooling is readily available.

whywhywhywhy · 2024-12-18T14:41:32 1734532892

It’s not comparable because GIMP has never had the effort put into it to compete with Photoshops most basic features. 15-20 years ago they were arguing that adjustment layers were not needed and they only managed to ship some form of it this year.

Blender vs commercial 3D software is a better example.

echelon · 2024-12-18T03:27:35 1734492455

Nobody is itching to put GIMP into their product, but everyone can think of ways to build upon Llama and Flux and provide new value.

treesciencebot · 2024-12-09T19:37:52 1733773072

Hunyuan at other providers like fal.ai is cheaper than SORA for the same resolution (720p 5 seconds gets you ~15 videos for $20 vs almost 50 videos at fal). It is slower than SORA (~3 minutes for a 720p video) but faster than replicate's hunyuan (by 6-7x for the same settings).

https://fal.ai/models/fal-ai/hunyuan-video

treesciencebot · 2024-11-14T01:36:18 1731548178

I think the main problem is with 2) since 1 is entirely up to the content creator's discretion.

yen223 · 2024-11-14T01:48:21 1731548901

I will not be surprised if YouTube figures out a way to make content creators pay a portion of their sponsored content.

LordKeren · 2024-11-14T02:38:28 1731551908

They’re already going after sponsor reads.

On iOS with YouTube premium, I get a button prompt asking if I want to skip a commonly skipped sections.

9/10 it’s the sponsor read. The remaining 1/10 is patreon

I haven’t seen much about it so I’m not sure if im in an A/B test

kalleboo · 2024-11-14T02:06:54 1731550014

Didn't Twitch try to do this?

wongarsu · 2024-11-14T02:16:41 1731550601

The backlash from Twitch's attempt may well be why Youtube isn't doing it

treesciencebot · 2024-11-13T20:00:00 1731528000

i don't think anyone has done real-time multi-speaker dialog generation before

treesciencebot · 2024-11-04T22:39:03 1730759943

Liability. Till we solve this, we cant really give AI any real responsibilities.

digging · 2024-11-04T22:43:36 1730760216

Human CEOs generally aren't held liable for their actions, so why would AI need to be? Once again, I think we're just smoothing out a wrinkle here.

SauntSolaire · 2024-11-04T23:46:38 1730763998

I know this is more of a throwaway cynical quip, but this is a biased line of thinking. CEO's are, for obvious reasons, more likely to do things they wouldn't be held liable for, versus do things which would see them likely to be punished. So executives might, for example, get away with things by successfully skirting the line of legality.

Say an AI CEO blatantly crosses this line, now who is liable?

from-nibly · 2024-11-05T00:38:31 1730767111

It's a cheese touch situation. Last human to make a decision.

disqard · 2024-11-04T23:15:43 1730762143

I think as long as the LLM can "take full responsibility", there should be no objections from the shareholders.

Imagine saving an extra $50m per year? Yes, please!

A4ET8a8uTh0 · 2024-11-04T22:44:53 1730760293

It is already used widely across industries where one would think people should be more conservative ( healthcare transcription services come to mind, but it is hardly the only example of this ). As always in America, only lawsuits will shows us how the dust has settled.

treesciencebot · 2024-10-30T20:46:54 1730321214

It excels at particularly text and scene composition, as well as being able to generate vector graphics. You can use it through their website or through fal.ai https://fal.ai/models/fal-ai/recraft-v3/playground.