More

allanren · 2024-09-01T14:29:12 1725200952

I had a e-reader, it's just to slow to run anything except reading books. Not sure if the new color screen will improve the experience.

allanren · 2024-08-26T15:53:22 1724687602

Launch it first and get feedback. Just think it as a preparation for your next project.

allanren · 2024-08-23T16:29:25 1724430565

Claude is growing strong with these new features

allanren · 2024-08-16T04:09:22 1723781362

It has to do with the training data. Mostly are photorealistic style. I'm sure there will be more real world style coming out soon.

allanren · 2024-08-15T14:24:13 1723731853

Their CEO is on a long vacation too. Seems they are going through a major management change.

allanren · 2024-08-14T16:06:03 1723651563

Learning using Figma with tailwindcss recently.

allanren · 2024-08-13T17:08:50 1723568930

Seems the future of programming language is natural language

itishappy · 2024-08-13T17:10:41 1723569041

Good news, you can make new natural languages too!

maxbond · 2024-08-14T13:00:37 1723640437

Well, it would be a constructed language if you did it intentionally, but your point stands in that you can make a new language.

allanren · 2024-08-12T16:40:03 1723480803

I don't see that words often. Maybe it depends on the why question is asked.

allanren · 2024-08-12T06:03:05 1723442585

It's good to see Python finally able to get rid of GIL. Looking forward to see how much performance can it improve.

fastball · 2024-08-12T06:29:45 1723444185

We already know the upper bound of perf improvement – existing perf * number of cores. It will be worse than that though, as all the GILectomy plans make single-threaded performance worse.

So if you're expecting something better than that, you will be disappointed.

vlovich123 · 2024-08-12T06:51:36 1723445496

All the GILectomy plans IIRC also include single threaded performance improvements to offset any such costs. So while performance vs GIL is maybe worse for single threaded for the same Python version, performance will still be ahead of where it is today for single-threaded python (assuming everything goes according to plan). That's also why multi-threaded performance will be more than just existing perf * number of cores (vs what it is today, not what removing the GIL alone provides).

eviks · 2024-08-12T07:02:16 1723446136

But it doesn't offset anything since you get all the other improvements anyway, they're not tied to gil/nogil

vlovich123 · 2024-08-12T07:21:02 1723447262

I could be misremembering, but I thought that the MSFT team proposed those performance improvements specifically to offset any concerns about single threaded performance degradation from removing the GIL. Thus even if development is happening in parallel by independent (which I thought it wasn't - I thought it was all 1 team doing this work), it was predicated upon nogil being accepted in the first place. Thus if GIL were to remain in Python, then this performance work wouldn't be happening.

eviks · 2024-08-12T07:32:05 1723447925

Maybe the work wouldn't be happening without the noGIL work, but once it's happened it's not tied to the GIL, you can pick those improvements and continue with a GIL-only Python

vlovich123 · 2024-08-12T07:47:43 1723448863

This post is literally about step 1: add this behind an unsupported experimental flag to get more insights. Step 2 is mid-term to make it a supported option based on readiness (within another 2 years). Step 3 is making it the default & then removing the GIL [1]. Steps 2 and 3 may not happen if some major unsolvable obstacle appears. But I doubt it's going to be so easy to reverse this direction. Given MSFT is driving all of this right now, it's hard to imagine there's going to be much appetite to break their trust; MSFT is more likely to cut funding before completion which would create some chaos than the steering committee is to violate an agreement around funding (MSFT has made specific long term commitments they're going to keep, but those commitments are only for a few years IIRC).

[1] https://developer.vonage.com/en/blog/removing-pythons-gil-it...

Sakos · 2024-08-12T08:24:34 1723451074

Why is MSFT so interested in funding this?

robertlagrant · 2024-08-12T10:48:13 1723459693

MSFT are keen to invest in popular development tools such as Python, Javascript, and Git. They hired Guido; they bought NPM; they bought GitHub.

I don't know what the plan is, but they haven't succeeded (outside of the corporate world, really, for C# and the functional world in F#) in making modern tooling and languages people want to use.

skeledrew · 2024-08-13T02:54:12 1723517652

If you have too steep a hill making inroads into a desired community, and there's enough money, just buy the thing that brings said community together.

logicchains · 2024-08-12T07:13:12 1723446792

The GIL causes a huge performance hit in data processing/ML by forcing the use of multi-process, which leads to a bunch of unnecessary copying of memory between processes unless you put in a bunch of effort to explicitly share memory. So in some cases the savings will be gigantic, from no longer unnecessarily copying huge dataframes between processes.

antupis · 2024-08-12T07:26:17 1723447577

But usually, in spaces where you need speed Python is just an orchestrator or glue between pipelines, and actually, calculations are done by db or some c/c++/fortran library.

logicchains · 2024-08-12T08:27:46 1723451266

Yes pandas/numpy calls C++ to do calculations efficiently, but the "glue" can still introduce significant slowdown relative to that when it's copying tens of gigabytes of dataframe unnecessarily between processes. Of course that slow part itself could also be moved to C++, but that's much more effort then just parallel mapping over the dataset in Python with no copying/multiprocessing, as will be possible with no-gil.

aragilar · 2024-08-12T09:03:24 1723453404

Bad code/quick hacks will always be slow (but can be great for prototypes), and sometimes it's worth planning how you're going to process something rather than piling on multiprocessing. Once you reach the point of multigigabyte IPC, it's worth spending the time doing it right.

robertlagrant · 2024-08-12T10:49:25 1723459765

Building libraries on a GIL-less Python would enable people to access that power without them all building it from scratch themselves.

aragilar · 2024-08-12T11:34:37 1723462477

GIL-less Python isn't magic pixie dust, the same group of users who have slow, poorly structured code are at best run into deadlocks. GIL-less Python can be used by well-designed libraries to achieve speedups, but that's not code written by the aforementioned pandas users, and speaking from experience, there's a lot more room for order of magnitude speedups from fixing quick hacks than running things in parallel, and usually it's a lot easier than managing multithreaded code.

robertlagrant · 2024-08-12T11:45:20 1723463120

> GIL-less Python can be used by well-designed libraries to achieve speedups, but that's not code written by the aforementioned pandas users

Yes, that's why having something like Pandas use it would be better than getting all users to write their own version.

aragilar · 2024-08-12T12:48:43 1723466923

I would be shocked if pandas wasn't already using multithreading where they could. Naturally, free-threaded Python (to use the actual name it's being called) gives libraries like pandas more options (which I think is a good thing, even if I think things aren't going to be as smooth as people would like), but there's only so much pandas can do for badly written code. This would be like postgresql moving from multiple processes to multiple threads, sure there may be speedups for some users, but for users that haven't added any indices, there's a lot of performance left on the table.

graemep · 2024-08-12T08:16:36 1723450596

If the libraries are thread safe can they not release the GIL to avoid copying.

I am pretty sure you are going to say there is a reason this cannot be done, would just like to know what it is!

logicchains · 2024-08-12T08:23:55 1723451035

What libraries? If you're writing some pandas code and want to parallelise some part of your data pipeline, as far as I'm aware Pandas doesn't have much support for that, you need to manually use multiprocessing to process different parts of the dataframe on different threads. Yes there are pandas alternatives that claim to be a drop-in replacement with better parallelism support, but the more pandas features you use, the more likely you are to depend on something they don't support, meaning you need to rewrite some code to switch to them.

tgv · 2024-08-12T07:37:25 1723448245

But that's such a small fraction of total Python use, that it cannot serve as a validation to make it the default.

graemep · 2024-08-12T08:13:51 1723450431

It is a fraction of usage that is commercially important to people who fund a lot of Python development.

tgv · 2024-08-13T06:40:56 1723531256

Aka a power grab for short-term gain.

hyperbrainer · 2024-08-13T08:24:10 1723537450

I would use python much more if every version did not have these many breaking changes, especially with the removal of the GIL. Shame they did not learn from 2 to 3.

allanren · 2024-08-12T05:48:49 1723441729

This just doesn't sound real.