I'd appreciate knowing what's wrong with MyPy that this project will fix. Check ...

IshKebab · 2025-01-29T22:25:44 1738189544

I found performance not that different to Pyright. The major difference is quality and correctness. Mypy is full of weird bugs and insane typing holes.

I found cases where it would even treat `foo: SomeType` and `foo # type: SomeType` differently!

I tried to fix that one but looking into the code lowered my impression of it even further. It's a complete mess. It's not at all a surprise that it gets so many things wrong.

Overall Mypy is like kind of like a type checker written by people who've only ever seen linters before. It checks some types but it's kind of wooly and heuristic and optional.

Pyright is a type checker written by someone who knows what they are doing. It is mostly sound, doesn't just say "eh we won't check that" half the time, and has barely any bugs.

Seriously check the closed/open issues on Github - there's a touch of "I disagree so I'm closing that" but only a touch. It mostly has so few open issues because the main author is a machine.

The only real problems with it are performance (it's ok but definitely could be better), and the slightly annoying dependence on Node.

akoboldfrying · 2025-01-30T01:30:58 1738200658

I upvoted as your comment sounds reasonable, but in a sibling comment I see:

>By comparison [to pyright], mypy uses a more traditional multi-pass architecture where semantic analysis is performed multiple times on a module from the top to the bottom until all types converge

That makes it sound like mypy does in fact do The Right Thing (albeit the slower, less-suitable-for-LSP thing), rather than a messy pile of heuristic/optional hacks.

Redoubts · 2025-01-30T19:21:42 1738264902

> It is mostly sound, doesn't just say "eh we won't check that" half the time

No, but they did kinda say that about `attrs`, which is a big deal for me and my stuff at least. Hopefully this new project can deal with all the crazy dynamic stuff python allows you to get away with.

IshKebab · 2025-01-31T14:49:48 1738334988

> Hopefully this new project can deal with all the crazy dynamic stuff python allows you to get away with.

Unlikely. It's best not to write code like that in the first place.

SOLAR_FIELDS · 2025-01-30T04:57:19 1738213039

Not really knowing fully about how type checkers work, one thing I struggled with a fair amount working with Mypy is how the stubs made available from various libraries were both all over the place in quality and not straightforward as a beginner to get working. Is this approach of using stubs a requirement for basically any static type checker that Python uses, or is this a specific way that Mypy chose to implement this concept?

Yoric · 2025-01-30T13:49:53 1738244993

Yes, I think that is actually the greatest weakness of mypy: the ecosystem is hostile to static typing.

harrall · 2025-01-30T00:12:33 1738195953

This is a good summary: https://github.com/microsoft/pyright/blob/main/docs/mypy-com...

But without that, I always felt like I was actively fighting mypy. It seemed like it was written for a totally different language than Python.

Compared to another more modern type system like TypeScript, sometimes you don't explicitly type something and yet TypeScript usually does exactly what you expect.

pushfoo · 2025-01-31T19:40:21 1738352421

Lacking named tuple support is a deal-breaker for some projects. This is directly from their GitHub issues comments[1]:

> Duplicate of #5613, still low priority, better use dataclasses, as suggested above.

1. https://github.com/python/mypy/issues/5944#issuecomment-4412...

arthur-st · 2025-01-29T21:16:02 1738185362

MyPy's rules are reference-grade, being as close to an official spec as we get until the Typing Council is done establishing their moat.

To understand shortcomings of MyPy, I strongly suggest reading pyright's documentation for how they compare: https://github.com/microsoft/pyright/blob/main/docs/mypy-com...

Quoting the pertinent part:

> Pyright was designed with performance in mind. It is not unusual for pyright to be 3x to 5x faster than mypy when type checking large code bases. Some of its design decisions were motivated by this goal.

> Pyright was also designed to be used as the foundation for a Python language server. Language servers provide interactive programming features such as completion suggestions, function signature help, type information on hover, semantic-aware search, semantic-aware renaming, semantic token coloring, refactoring tools, etc. For a good user experience, these features require highly responsive type evaluation performance during interactive code modification. They also require type evaluation to work on code that is incomplete and contains syntax errors.

> To achieve these design goals, pyright is implemented as a “lazy” or “just-in-time” type evaluator. Rather than analyzing all code in a module from top to bottom, it is able to evaluate the type of an arbitrary identifier anywhere within a module. If the type of that identifier depends on the types of other expressions or symbols, pyright recursively evaluates those in turn until it has enough information to determine the type of the target identifier. By comparison, mypy uses a more traditional multi-pass architecture where semantic analysis is performed multiple times on a module from the top to the bottom until all types converge.

> Pyright implements its own parser, which recovers gracefully from syntax errors and continues parsing the remainder of the source file. By comparison, mypy uses the parser built in to the Python interpreter, and it does not support recovery after a syntax error. This also means that when you run mypy on an older version of Python, it cannot support newer language features that require grammar changes.

Astral's type checker seems to an exercise in speeding up Pyright's approach to designing a type checker, and removing the Node dependency from it.

zelphirkalt · 2025-01-29T22:02:27 1738188147

I haven't had any issues from MyPy regarding speed. So performance issues did not exist whenever I used MyPy. Also not sure why I need incremental anything. I save a file and then I want it to be checked.

If I am not implementing a LS, then how is it of any importance, whether the type checker was designed with typing a LS? How does that benefit me in my normal projects?

If there are no semantic improvements, that allow more type inference than MyPy allows, I don't see much going for Pyright. Sounds like a "ours is blazingly faster than the other" kind of sales pitch.

E_Bfx · 2025-01-29T22:26:06 1738189566

In a medium size codebase (~ 100 python modules of 200 lines), mypy take 5 minutes to type check. This can be a problem for a CI.

wk_end · 2025-01-30T00:55:53 1738198553

Just to throw my anecdote in: I used to work at the mypy shop - our client code base was on the order of millions of lines of very thorny Python code. This was several years ago, but to the best of my recollection, even at that scale, mypy was nowhere near that slow.

Like I said, this was many years ago - mypy might've gotten slower, but computers have also gotten faster, so who knows. My hunch is still that you have an issue with misconfiguration, or perhaps you're hitting a bug.

wavemode · 2025-01-30T04:33:06 1738211586

My current company is a Python shop, 1M+ LOC. My CI run earlier today completed mypy typechecking in 9 minutes 5 seconds. Take from that what you will.

Redoubts · 2025-01-30T19:28:24 1738265304

Ditto, same order of magnitude experience; at least for --no-incremental runs.

Part of the problem for me is how easily caches get invalidated. A type error somewhere will invalidate the cache of the file and anything in its dependency tree, which blows a huge hole runtime.

Checking 1 file in a big repo can take 10 seconds, or more than a minute as a result.

E_Bfx · 2025-02-01T14:33:09 1738420389

I guess that there is something with the cache that we don't do right. Thanks for your return.

zelphirkalt · 2025-01-29T22:51:03 1738191063

Never happened for me. Similarly sized code base, done in seconds, if not 1s. Guess we all have our anecdotes.

sgarland · 2025-01-30T00:08:33 1738195713

I think you have something misconfigured, or are timing incorrectly. I'm working on a project right now with ~10K LOC. I haven't timed it, but it's easily <= 2 seconds. Even if I nuke MyPy's cache, it's at most 5 seconds. This is on an M3 MBP FWIW.

imron · 2025-01-30T00:34:01 1738197241

And with dmypy (included with myoy) it’s even faster

Redoubts · 2025-01-30T19:32:29 1738265549

I've found dmypy very underbaked. It's very easy to get it to regularly crash or pin a CPU indefinitely in my codebase.

imron · 2025-01-30T22:02:31 1738274551

Yeah it’s far from perfect, but speed is usually not its biggest fault.

I’ll still be switching to the astral offering as soon as it’s production ready.

arthur-st · 2025-01-30T12:42:23 1738240943

Pyright has semantic improvements (and also some differences) over MyPy. As for using the type checker as a language server, it's difficult to go back to “it's compiling” after you've had one stop you from typing bugs out in-flight.

maxloh · 2025-02-07T08:42:17 1738917737

Pyright infers return types for me correctly in most cases, while mypy couldn't even property infer the return type of void functions (you have to specify `-> None` explicitly).

tcdent · 2025-01-29T21:09:52 1738184992

yeah, 'awful' is just the OP being dramatic, though I am looking forward to Astral's contribution

theLiminator · 2025-01-29T21:50:07 1738187407

I think that https://github.com/microsoft/pyright/blob/main/docs/mypy-com... actually covers the majority of my gripes with mypy, the main issues I encounter with mypy are due to its lack of precision. It produces a lot of false positives for code that is well-typed that pyright can handle. Also the lack of type inference and lack of type checking for unannotated code (by default) is kinda painful. Mypy in general makes certain patterns which are pythonic not typecheck whilst pyright is a lot less painful in that regard.

I've personally found that dealing with mypy has been more noisy and painful than using pyright and leads to a lot of users/other devs just ignoring typing altogether.

I think that type checking has to closely match the semantics of the language and if there's a gap, it will often push users to do the easy thing, which is just ignoring checks.