Why is Swift so slow (timeout) in compiling this code? (2022)

Joker_vD · on May 30, 2023

> I wouldn't be surprised if "parse and compile 8MB of integer literals" isn't a very well optimized code path in the compiler, because nobody is parsing and compiling 8MB of integer literals outside of artificial exercises like this

Actually, no. There is no portable "incbin" in C/C++, so one way to bake assets into the .rodata section is to convert a binary file into a huge "const unsigned char data[8192567] = { 0xFF, 0xD8, 0xFF, 0xE0, ... }" string and stash it into a .h or .c file. And yes, people do that often enough that gcc and clang actually had to optimize for this case specifically.

tialaramex · on May 30, 2023

C23 gets #embed https://thephd.dev/finally-embed-in-c23 and in practice later C++ compilers will presumably also just do #embed rather than pretend they aren't the same code anyway.

However, even though there's a clear need, as that post explains, the compilers varied between "Bad at this" and "Completely awful at this" which is part of what was so frustrating for JeanHyde over years of getting this through committee.

In principle #embed is just shoving the bytes in as comma separated values, but in practice by the "as if" rule in C and C++ the compiler won't do that because it's stupid.

est31 · on May 30, 2023

Since Rust 1.67.0, the compiler has a specific optimized code path for the (already older) include_bytes! macro: https://github.com/rust-lang/rust/pull/103812#issuecomment-1...

tialaramex · on May 30, 2023

Ha, cute, I think I began examining details of include_bytes! only this year so it's possible I happened to first look at this soon after the big improvements landed in the release I was testing against.

titzer · on May 30, 2023

Also, a reminder that JS engines do regularly have to contend with enormous JSON and do not generally display pathological behavior for large literals.

I'm not sure who posted that comment, but if they are on the Swift team and blaming users, that should stop. Code like that is out there, and it's the job of tooling to handle it gracefully.

jitl · on May 30, 2023

> JS engines do regularly have to contend with enormous JSON and do not generally display pathological behavior for large literals

Actually it’s common advice to replace very large literals with `JSON.parse(“…”)` because it’s faster according to Chrome engineers [1]. At Notion we did so for our emoji unicode tables for a noticeable time-to-interactive improvement over the large literal.

[1]: https://youtu.be/ff4fgQxPaO0

derefr · on May 30, 2023

Right, but that's a linear speedup (because the JSON parser is less complex than the JS parser), not a change from superlinear to linear parse time.

refulgentis · on May 30, 2023

It wasn’t a Swift team member, Swift community has a quick response for these cases because it’s been around forever and is unique enough of a “simple” case that it, ex. ends up at #1 on HN.

My understanding of Swift’s teams thoughts, though a few years out of date, is it’s very very hard to get the Swift type inference engine to handle a huge blob, without type hinting, or with literals that could be multiple types.

earthnail · on May 30, 2023

Three years ago I tried putting constants for a matrix (I think it was 1024x80 or sth like that) into a strongly typed array of floats. Still took the compiler 30mins to compile it. In the end I split the matrix up in rows, with each row being one variable, and a final variable to concatenate the rows. No typing hint whatsoever helped.

Big wtf moment.

refulgentis · on May 30, 2023

Yeah this one's real strange to me. I feel like it used to be as simple as the [int] definition people suggest in the thread.

I don't miss Xcode too much. IIRC I built some custom script using arcanda dropped by Apple employees to output per-function build times. It was great! I knew exactly what functions were slow. But now I am old and lazy lol.

euiq · on May 30, 2023

Note that someone who seems to manage a Swift team at Apple left a more encouraging response later: <https://forums.swift.org/t/why-is-swift-so-slow-timeout-in-c...>

turnsout · on May 30, 2023

Right, but unlike Swift, the type system in JS is not doing computationally expensive type checking for each element. This is the equivalent of creating 1M DOM trees in JS and expecting it to be responsive.

Rather than trying to force Swift to create fixed dimension arrays, the uncomfortable part about Swift for control freaks is that you should just make the array dynamic, and trust that the compiler is smart.

It's not blaming the user, but the compiler team has to balance functionality ("Why can't the type system give me a better error?") and speed. There's no way they can handle every pathological edge case gracefully.

titzer · on May 30, 2023

> computationally expensive type checking

So, an hour's worth of compilation time = 3600 seconds, divided by a million elements, that's 3.6 ms, or about 10 million instructions per element. Just what in the heck is this compiler doing with 10 million instructions on an integer literal? Like, it's never seen integer literals before? And they're all integers. Every element. It's not a "pathological edge case". The whole thread is absurd. People are speaking up and blaming programmers, the language, the type system, when this is so obviously the compiler's fault.

stephc_int13 · on May 30, 2023

We have absurdly fast hardware these days.

I can barely believe that a single thread, copy pasted quicksort can sort 1M elements in less than 50ms on my computer.

Fast compilers such as TCC should be frequently used as a reality check for compiler teams. Of course the constraints are not the same but still...

turnsout · on May 30, 2023

I have no idea what's going on under the hood, but I'm sure it's not a linear time process, and all it takes is one of those elements to be "109.1" to flip 1M elements from Ints to Doubles. Yes, they could special-case this (and arrays of Doubles, and Strings), but where do you draw the line? That's now extra code to maintain in the compiler just to support a behavior that should be discouraged. If you need to read in 1M integers, put them in a file and read them in.

titzer · on May 30, 2023

> should be discouraged. If you need to read in 1M integers, put them in a file and read them in.

This is exactly the attitude that needs to be addressed. It's hostile to argue with users who are doing something completely legal and tell them they shouldn't do that, they should do it another way, that that behavior needs to be discouraged, etc.

We went through this all the time with V8, trying to tell JS developers they just shouldn't do that because V8 had couldn't run that code fast. Or worse, that it had a particular pathology that made certain code absurdly slow. It just doesn't fly. It's V8's job to not go off the rails for user inputs; it should provide good default performance all the time, not get stuck in deopt loops, use absurd amounts of memory, etc. Yeah, and that's hard work.

turnsout · on May 30, 2023

I hear you, ideally the compiler should be able to manage any arbitrary input in a reasonable time, and catch itself if necessary. Usually it does—sometimes with complex ungrouped arithmetic operations with ambiguous types, Swift will error out and tell you to break up your expression, rather than get stuck.

I would love for the Swift compiler to be dramatically faster, but I understand the challenge, with a powerful type inference engine that supports Generics. It's a resource scarcity problem. If the Swift team spends 10 hours to handle long strings of integer literals, that's 10 hours they haven't put toward features that would benefit a larger audience.

ihatejs · on May 30, 2023

Your take sounds more reasonable. I would think you would get diminishing returns if you try to solve all pathological cases.

Sometimes it is the fault of the language design too. Maybe the language spec needs to be changed. Imagine all the wasted effort to optimize V8 when you could have put static typing into Javascript itself.

turnsout · on May 30, 2023

Indeed—Chris Lattner has mentioned recently that in retrospect, he regrets certain Swift design decisions which have made the compiler so complex and relatively slow.

wtetzner · on May 30, 2023

> If you need to read in 1M integers, put them in a file and read them in.

..and if you're in a context where you need that data in the binary, because you don't have a reliable place to store a file?

> That's now extra code to maintain in the compiler just to support a behavior that should be discouraged.

If you don't want to maintain the code to properly support a language feature, don't include that feature in the language.

tialaramex · on May 30, 2023

For a compiled language it seems reasonable to assume that "put them in a file and read them in" means at compile time, like the C23 pre-processor feature #embed and the Rust macro include_bytes!

Now, #embed and include_bytes! always give you bytes (in Rust these are definitely u8, an unsigned 8-bit integer, I don't know what is promised in C but in practice I expect you get the same) because that's what modern files are - whereas maybe you want, say, 32-bit big endian signed integers in this Swift example. But, Swift is a higher level language, it's OK if it has some more nuance here and maybe pays a small performance penalty for it. 10% slower wouldn't be objectionable if we can specify the type of data for example.

wtetzner · on May 31, 2023

> But, Swift is a higher level language, it's OK if it has some more nuance here and maybe pays a small performance penalty for it.

I thought the concern was about compile time performance? I'm not sure what Swift being higher-level has to do with that.

roelschroeven · on May 31, 2023

The comments on that Swift forum seem to indicate that the slowness is not caused by the type checker though.

tinus_hn · on May 30, 2023

Actually the xbm image format is c files like this with a long array of hexadecimal numbers.

https://en.m.wikipedia.org/wiki/X_BitMap

bluedino · on May 30, 2023

I remember paint programs that could export images as C/Pascal/whatever source code files.

stefncb · on May 30, 2023

Gimp still does that, it's rather useful if you want to play around with images without a loader.

twoodfin · on May 30, 2023

I once had good reason (or at least I thought so!) to code-generate nested structures into .rodata by way of thousands of static arrays (each with its own variable), referenced by other static arrays, referenced by static structs…

The application was a highly optimized encoding of Aho-Corasick state machines for megabytes of static dictionary strings. The code generation (not in C) was trivial, and along with .rodata came all the benefits of shared pages and lazy loading.

Across a number of compilers I only ran into one bug in 32-bit gcc, which was worked around easily by disabling the (unhelpful) optimization pass that was getting snarled.

tredre3 · on May 30, 2023

> { 0xFF, 0xD8, 0xFF, 0xE0, ... }

You're right of course but experienced developers know to write those as strings instead of arrays, especially for very large content. Otherwise both the compiler and IDE become painfully slow. ie your example would be written "\xFF\xD8\xFF\xE0..."

tialaramex · on May 30, 2023

Don't try that in MSVC. It deliberately chokes on large strings. If you work hard you might put a few thousand integers in a string, but this example has a million integers. Most ways you can attempt that in MSVC time out, abort compilation or emit an error.

jandrese · on May 30, 2023

For an example of C using this in real life open a .xpm file in a text editor. There are many implementations of a C tool that converts a binary file into a .h.

And yes, it is dumb that we have to do tricks like this in 2023.

armchairhacker · on May 30, 2023

finally. #embed

https://thephd.dev/finally-embed-in-c23

tl;dr It was such a big issue they actually managed to get it through the C Committee

ajross · on May 30, 2023

> Actually, no. There is no portable "incbin" in C/C++

I kinda hate this kind of argument. I mean, it's true, as .incbin is a GNU assembler directive (FWIW: binutils/llvm objcopy is a better mechanism still for this sort of thing in most contexts, as it doesn't involve source compilation of any kind).

But it leads ridiculous design decisions, like "I'm going to write 8MB of source code instead of doing the portability work on my own to turn that data into a linkable symbol".

There's a huge gulf in the space between "non-portable" and "impossible". In this case, the problem is trivially solved on every platform ever. Trivial problems should employ trivial solutions, even where they have to involve some per-platform engineering.

iainmerrick · on May 30, 2023

But it leads ridiculous design decisions, like "I'm going to write 8MB of source code instead of doing the portability work on my own to turn that data into a linkable symbol".

Why is that ridiculous? It strikes me as not necessarily the best, but the most obvious approach, the most portable, and possibly the quickest to implement.

Your toolchain might have a special way to import binary blobs, but a) you’ll have to dig through the docs to find it, b) you’ll probably need to solve the problem again when porting to a different platform, and c) who knows if it actually works, or if there are hidden gotchas?

Sure, if there’s a known tool or option that does the job, you should go ahead and use it. But in general, writing a little script to generate a bunch of boilerplate code is perfectly workable.

ajross · on May 30, 2023

> Your toolchain might have a special way to import binary blobs, but a) you’ll have to dig through the docs to find it

This is a corrollary: "I don't want to learn my tools, so I'll learn the language standard instead" is fundamentally exactly the problem I'm talking about.

Straight up: C linkage is a 1970's paradigm full of tools that had to run on a PDP/11, and it's vastly simpler than learning C++ or Rust. It's just not "modern" and no one taught it to you, so it looks weird and mysterious. That's the problem!

josephg · on May 30, 2023

I go back and forth on this argument when it comes to codegen. Like, you could make the same argument that protobuf shouldn't output C code. It should output an object file that you can link into whatever compiled language you want. C, fortran, C++, rust, who cares. As you say, the linking model is simple and works well.

Why do we generate big C/C++ strings instead, and compile those? Because object files have lots of compiler/platform/architecture specific stuff in them. Outputting C (or C++) then compiling it is a much more convenient way to generate those object files, because it works on every OS, compiler and architecture. Even systems that you don't know about, or that don't exist yet.

I hear what you're saying and I'm torn about it. I mean, aren't binary blobs just a simpler version of the problem protobuf faces? C code is already the most portable way to make object files. Why wouldn't we use it?

ajross · on May 30, 2023

> Like, you could make the same argument that protobuf shouldn't output C code.

The case at hand is an 8MB static array of integers. Obviously yes, of course, absolutely: you choose the correct/simple/obvious/trivialest implementation strategy for the problem. That's exactly what I'm saying!

In the case of protobuf (static generation of an otherwise arbitrarily complicated data structure with reasonably bounded size), code generation makes a ton of sense.

Joker_vD · on May 30, 2023

And the simple/obvious/trivialest solution is to write an array literal with 4 millions integers in it, while fighting with Microsoft's link.exe is anything but. Even using rc.exe and loading that data from the resource section is a non-trivial amount of additional work.

ajross · on May 30, 2023

Come on. Both binutils and nasm can generate perfectly working PE object files. I don't know the answer off the top of my head, but I bet anything even pure MSVC has a simple answer here. Dealing with the nonsense in the linked article is something you do for a quick hack or to test compiler performance, but (as demonstrated!) it scales poorly. It's terrible engineering, period. Use the right tools, even if they aren't ISO-specified. And if you can't or won't, please don't get into fights on the internet justifying the resulting hackery.

Joker_vD · on May 30, 2023

> I bet anything even pure MSVC has a simple answer here

You lose your bet, because it doesn't, neither its inline assembler nor actual MASM shipped with Visual Studio support any "incbin"-like directives. I guess you can generate an .asm with a huge db/dd, I guess, if you don't like a large literal array in .c files, but that's it.

wtetzner · on May 30, 2023

> Come on. Both binutils and nasm can generate perfectly working PE object files. I don't know the answer off the top of my head, but I bet anything even pure MSVC has a simple answer here.

Right, meaning you have to implement N solutions instead of just one. It's a common enough and useful enough feature for the language to support it. I think it would be a different story if linkers were covered by the language specification.

ajross · on May 30, 2023

I remain shocked at how controversial this is. Yes. Yes, implementing N trivial and easily maintained solutions is clearly better than one portable hack.

wtetzner · on May 31, 2023

Clearly many people disagree. And given that C now has #embed, I don't even think I'd consider it to be a hack.

> I remain shocked at how controversial this is.

I am a bit shocked that you think the right solution to making data statically available to the rest of your program is somehow outside the scope of the programming language.

ajross · on May 31, 2023

The comparison wasn't to #embed[1], but to an 8MB static array. You're winning an argument against a strawman, not me. For the record, I think #embed (given tooling that supports it) would be an excellent choice! That's not a defense of the technique under discussion though.

[1] Which FWIW is much less portable as an issue of practical engineering than assembler or binutils tooling!

maskros · on May 30, 2023

It's not "I don't want to learn my tools."

It's "I don't want to learn and debug _everybody else who may possible want to build this otherwise portable C program_'s tools."

When those tools change how they do this unportable thing every couple of years in subtle and incompatible ways, which require #ifdef's to handle the different ways those linked against symbols can be accessed, multiplied by dozens of different platforms, then yes, I'm going to compile an 8MB literal.

Joker_vD · on May 30, 2023

I am pretty certain I've seen linkers routinely writing 0 instead of symbols' actual sizes so getting the actual size of the embedded binary blob is not very pretty.

Also, you argument sounds exactly like those that the author of https://thephd.dev/finally-embed-in-c23 has been fighting against for 5 years straight.

iainmerrick · on May 30, 2023

Yes! That person is a hero.

iainmerrick · on May 30, 2023

It's just not "modern" and no one taught it to you, so it looks weird and mysterious.

I don’t know what to say except that I’ve worked with C linkers for a long time, since before I learned C++ and before Rust even existed, and I still don’t like ‘em.

duped · on May 30, 2023

> But it leads ridiculous design decisions, like "I'm going to write 8MB of source code instead of doing the portability work on my own to turn that data into a linkable symbol".

people use scripts to do this kind of thing and never think about it again, and it's still more portable than writing a custom build step.

DSMan195276 · on May 30, 2023

> FWIW: binutils/llvm objcopy is a better mechanism still for this sort of thing in most contexts, as it doesn't involve source compilation of any kind

I used to agree with that, but honestly a simple `xxd` to get a C array avoids so many issues with `objcopy` that I'd rather just use that now. With `objcopy` even just getting the names of the produced symbols to be consistent is a pain, and you have to specify the output target and architecture which is just another thing you have to update for more platforms (and if someone's using a cross-compiler, they have to override that setting too).

In contrast if you just produce a C array then it compiles like normal C code and links like normal C code, problem solved and all the complexity is gone.

cmeacham98 · on May 30, 2023

Lots of uses on that thread are suggesting this is some sort of design constraint or tradeoff - but I really am not sure I agree.

I understand this might not be a high priority, because it's a somewhat contrived usecase rare to matter in "the real world", but I strongly suspect this must be due to bug(s) in the compiler. I cannot think of a single reason it _has_ to be this way per the design of the language.

brundolf · on May 30, 2023

Anecdote posted as-is:

I recently started a macOS app project to learn Swift and SwiftUI. A week or two into the project, I managed to hit an honest-to-god compiler bug. A particular combination of syntaxes would reliably cause the compiler to crash (not yield a compile error; crash). I tried restarting XCode, restarting my computer, updating everything. Whenever I brought that syntax into the editor, XCode's Swift compiler integration would crash.

It wasn't anything exotic; I was making a straightforward app, and wasn't trying any Swift language funny-business (I didn't know the language well enough to even try)

It was bizarre to so easily find a bug in the headlining compiler made by the world's most valuable company; I can only think it's the result of this being a language that (effectively) only targets one company's systems, and the effect that has on the size and involvement of the community

jb1991 · on May 30, 2023

I think many of us will be really interested to know exactly what syntax you found the crashes it.

brundolf · on May 30, 2023

Unfortunately it was several weeks ago and I don't remember or have the code on hand. I think it involved a switch statement in some way

jitl · on May 30, 2023

I've also hit these kinds of issues with Swift/SwiftUI. It's always surprising for a production compiler to die and throw up it's hands and say "you hit a compiler bug!"

manmal · on May 30, 2023

Type checking in Swift is notoriously slow and picky. Things have improved over the years (considerably!), but the odd complicated generics usecase will either take a minute extra to compile, or throw a Segfault.

jodrellblank · on May 30, 2023

Doesn't the link go on to say it's not the type checker causing this?

"I found a trace of the compiler kind of surprising; the time is not all being spent in the type checker, as you observed:" - 5th reply by scanon

manmal · on May 30, 2023

It says "not all", which I interpret as "some of it is not spent in the type checker". I'm also not sure the times Xcode displays in the screenshot are bound to be accurate.

stephencanon · on May 30, 2023

Instruments is a sampling profiler; it's the ground truth w.r.t. where time is being spent.

manmal · on May 30, 2023

I don’t know about Instruments‘ implementation, but sampling profilers are not renowned for their accuracy.

stephencanon · on May 31, 2023

All profiling techniques have their drawbacks, but for a quick "where is the time being spent" analysis at multiple-second granularity, you're solidly in the territory where sampling profilers work really well.

manmal · on May 31, 2023

Thanks for this insight!

danpalmer · on May 30, 2023

There are different algorithms for type checking that impose different requirements on users.

The Java approach is roughly: write types everywhere, keep them simple (and limited in scope), it's easy for the compiler to infer the types.

The Haskell/ML approach is roughly: know a lot about strong type systems, be purely functional, infer most of the types, produce bad error messages when you can't.

The Swift approach attempts to be the best of both worlds, lots of inference, strong capabilities in expressing complex types, but the trade-off is that it can't use the inference algorithms of either of the above examples. I believe Swift necessarily has poor time complexity in type checking – it can't be better with the design it has and requirements it puts on authors.

kaba0 · on May 30, 2023

Rust’s type system seems to be ideal in this respect, and is not prone to timing out, so it is likely some design issue with Swift.

danpalmer · on May 31, 2023

On the contrary, Rust's type system moves more of that effort on to the developer – there's a reason why Rust has a much steeper learning curve than Swift.

The trade-off is a somewhat rare sharp edge for Swift and a much easier onboarding, vs Rusts noticeably harder onboarding and more predictable sharp edges.

kaba0 · on May 31, 2023

Rust is a low-level language, which requires the dev to track lifetimes as well - that’s the reason it has harder onboarding.

It managing an even more complex type system without any of the disadvantages seems to make it clearly great!

layer8 · on May 30, 2023

Not a bug, but probably a naive algorithm where a more sophisticated algorithm with much lower algorithmic complexity could be used.

slavapestov · on May 30, 2023

Looks like the time was being spent in LLVM, not the type checker as everyone is assuming here: https://forums.swift.org/t/why-is-swift-so-slow-timeout-in-c...

marksands07 · on May 30, 2023

Is there anyone on the compiler team that also plays a role in llvm development and optimizations? Or is there anyone on the llvm core team that cares about optimizations that would directly benefit Swift? If there's no one available that either can or is willing to optimize llvm, this feels like a worst case scenario. I'm curious what steps need to be taken to see actual compiler performance gains as it relates to llvm.

thegreatbeanz · on May 30, 2023

Interesting that the two highest functions in that trace relate to Bitcode serialization and GlobalISel. Why is Swift serializing Bitcode if it is also running ISEL in the same compile?

danpalmer · on May 30, 2023

I recommend reducing the warning limit on slow compilation in Swift.

  -Xfrontend -warn-long-function-bodies=<limit-ms>
  -Xfrontend -warn-long-expression-type-checking=<limit-ms>

Set these to ~100ms and add a few explicit types whenever they log, and compile time will be significantly improved.

Swift is a slow compiler in exceptional cases, maybe in the 99th percentile, but when you know where this is happening it's fairly trivial to avoid it and keep the compiler nimble and stay productive as an engineer.

iainmerrick · on May 30, 2023

In this particular case, adding an explicit type didn’t help at all.

I’ve worked in a large Swift project with those timeout warnings enabled, and didn’t find them too helpful. They showed up a lot and it was rarely obvious how to quickly fix them. Possibly it would have been useful if those warnings were enabled from the start and developers had always fixed them proactively. I’m skeptical that that could work in a big fast-moving team project, though.

danpalmer · on May 30, 2023

I think it's reasonable to have a warnings-are-errors approach even in medium sized teams, or if not quite that approach perhaps a "warnings go in the bug tracker" approach.

I'm far from a Swift expert, but I usually found I could reduce the compilation time with ~10 mins of work. When I turned these on for a small codebase (60k lines, ~50% of an engineer split across 3 people), it only took a few hours to solve all the instances of this, and they were mostly obvious cases where a 100 line function could be split into 3 and solve the issue.

addaon · on May 30, 2023

Warnings-as-errors is a great policy in general, but these are non-deterministic warnings based on compile time wall timing. Coworker working on a slower computer than you? He can run into hundreds of errors after a pull that you introduced but couldn't see, what now? Running something intensive in the background? You can't compile successfully any more. Not ideal.

danpalmer · on May 30, 2023

That's true, but a setting of 100ms is a few orders of magnitude higher than expressions should be so it's quite possible to make this fairly reliable.

addaon · on May 30, 2023

The problem is that if this ever catches anything, then there's things that it almost catches. And you have no way of telling if you have something expression that takes 99 ms on your machine, so will take 101 ms on your coworkers'. The appropriate way to deal with this in general is to have separate warning and error thresholds, where the difference between the two is greater than the variance between machines, e.g. 50 ms and 100 ms; at least in this case when your coworker checks out a broken build he can see that warnings were introduced in your commit in a CI build log and track it down to the source.

danpalmer · on May 30, 2023

It's easy to have one setting for local development and one for CI. Xcode supports different profiles natively so you can just put 50ms or whatever in your user settings and leave 100ms or even 1s in your standard build settings.

Also, this sort of works the other way too. Just because something failed at the 100ms threshold doesn't mean it would complete in 101ms, it might take 500ms, 10 minutes, or be undecidable, and it's important to fail CI in those sorts of cases to protect other developers from issues, so having a limit is useful even if it's relatively high for the default case.

addaon · on May 30, 2023

I guess I don't see what the actual workflow is supposed to be for a new team member who joins (potentially with a different computer than everyone else), checks out the code, and has dozens of errors when they first try to build.

I'm a huge fan of error'ing on warnings, but I really don't see it as appropriate for nondeterministic cases.

titzer · on May 30, 2023

> it's fairly trivial to avoid it and keep the compiler nimble

Personally I think it's the job of tools to make humans productive instead of the other way around.

danpalmer · on May 30, 2023

I see what you mean, but in this case it's that the Swift compiler is keeping you productive by doing a ton of work for you, and asking for a little clarity every now and again so that it doesn't have to do exhaustive searches of what you mean.

JohnMakin · on May 30, 2023

Reminds me a little bit of the time in an intermediate programming course where we had to write a variety of programs and the input was an enormous string containing the entire novel "Jane Eyre."

I discovered very quickly that building a string by concatenating the input one word at a time in Java, my preferred language at that time, was a huge mistake, because of the way string literals work in Java. StringBuilder to the rescue!

At first I was astounded at how slow it was, but when I learned how the underlying language semantics worked, it made sense.

jandrese · on May 30, 2023

Wow, that's a fantastic exercise for a computer science class. Most classes only work on programs so small that students don't really get a feel for how things scale.

JohnMakin · on May 30, 2023

It was one of my more memorable ones. It had ~50 exercises that I recall, and the core of the program was supposed to compute a word frequency list on the words in the novel (with a blacklist of words like common adjectives etc). The twist was you had to do it with a particular programming paradigm in mind each time - functional, object oriented, MVC, REST API, anything you could think of. It was very eye opening solving the exact same problem in so many different ways.

jws · on May 30, 2023

In this case, Swift is being, lets say suboptimal, compiling an 8MB array of integers.

It's worth mentioning another, more common, cause. There can be computational cost explosion when Swift resolves the types for function calls (and operators). You notice it when you get the "too complex" error, but you may be tolerating slow compiles which are just slightly less than too complex. A few explicit type annotations in these situations can do wonders for speeding your development cycle.

Edit: redact I unfortunately can't tell you how to find these. Maybe a "give it a go with reduced complexity limit" compiler flag would help find them.

Edit: Add see reply below for answer.

stephencanon · on May 30, 2023

As a sibling mentioned, -warn-long-function-bodies=<limit-ms> and -warn-long-expression-type-checking=<limit-ms> can flag these for you.

That said, we should also get these to be fast enough that we have to introduce new flags that take µs instead of ms.

bsaul · on May 30, 2023

My most frustating error with swiftc is the switch on enum "not being able to complete in acceptable time" (or something like that). Basically, when you want an exhaustive check on combination of enums, you're often reaching the limits of the compiler.

something like this

enum State { case A, … } switch (fromState, toState) { case (.A, .B): … }

in which you want to make sure you didn't forget a combination.

littlestymaar · on May 30, 2023

Looks like swiftc needs to embed a performant SAT solver ;)

(I could not find the link anymore, but I remember reading a blog post about how Rust `match` statement are in fact solving an instance of SAT).

slavapestov · on May 30, 2023

Swift uses the algorithm from this paper: https://os.unil.cloud.switch.ch/tind-customer-epfl/07596cb2-...

munificent · on May 30, 2023

I implemented a prototype version of the algorithm in that paper when exploring exhaustiveness checking for pattern matching in Dart. I found it pretty easy to understand, but also really easy to get it to generate huge combinatorially large spaces. Some careful memoization and deduplication helped, but even so I never got the performance to a state I felt comfortable with.

Instead, I went with Luc Maranget's classic approach and figured out a way to adapt it to a language with subtyping (with a ton of work from Johnni Winther to figure out all of the hard complex cases around generics):

https://github.com/dart-lang/language/blob/main/accepted/fut...

The performance (in the prototype!) was dramatically better. You can always make pattern matching go combinatorial, but I haven't seen any real-world switches get particularly slow with our approach yet, and we have some fairly large tests of matching on tuples of enums.

cellularmitosis · on May 30, 2023

At what sort of combinatorial size are you encountering this?

bsaul · on May 30, 2023

a switch over a pair of two values of the same enum type, containing 18 cases (some with associated values).

It breaks at around 50 cases in the switch.

stephc_int13 · on May 30, 2023

Compilation time for most modern languages (Rust, Zig, Swift) is, to say the least, not great.

I consider this a major hindrance for productivity.

Fast iteration/test cycle is the best way to stay in the flow while fixing small bugs on the go.

I'd say the limit for me is about five seconds, but I find it much better when I can keep it below one second.

I think that Jai is engineered to have fast compilation in debug mode, I hope that will be the case once released.

mhd · on May 30, 2023

I've got some bad experience with C++ for this. Yes, part of that was losing the flow, which is probably why way back then scripting languages were easier to sell than these days (where you can throw more money on the issue, with umpteen core dev machines).

But the other part was the reaction amongst developers to this. Layers of abstractions introduced just to please the compiler gods. C++ was particular famous for this, all for the sake of partial compilation still working -- sometimes this had the benefits of decoupling, but often it was a few cake layers on top of that.

So you end up with convenient languages, but have to forsake some of that convenience to actually be productive. But you still had to learn the convenient unused part and know the difference, even if you almost never could afford to use it (if it's just about runtime performance, you often don't need that. But you don't want to cause the projects compilation or test phase to double, just because that functional-combinatoric template metaprogramming approach is so much neater)

Oh man, if only we would have Wirth-dows, not Windows ;)

fooker · on May 30, 2023

C++ compile is pretty decent nowadays, except for template heavy dependencies like boost which attempt to be header only for the most part.

jb1991 · on May 30, 2023

I’ve been hearing about Jai for, what, maybe close to a decade already?! Is it actually gonna happen? I feel like it’s one of those languages that falls under the category of vaporware.

duped · on May 30, 2023

I don't think calling it Vaporware is accurate, since it's one person's project for their own other projects. There isn't (nor should there) be any expectation on general availability of it.

stephc_int13 · on May 30, 2023

I've been following the development of this language and I am pretty sure it is not vaporware.

I think they are making a PR mistake by not making it more open, but I would be surprised if it is never released.

bsaul · on May 30, 2023

Zig is slow as well ??

quic5 · on May 30, 2023

The biggest bottleneck in Zig's case is LLVM. That's why they implement fast alternative backends for development. See: https://kristoff.it/blog/zig-new-relationship-llvm/

bsaul · on May 30, 2023

looks like the project's now abandonned :

https://github.com/ziglang/zig/projects/2

hsn915 · on May 30, 2023

This comment suggests otherwise:

https://github.com/ziglang/zig/issues/89#issuecomment-122118...

Although I'm not sure the "self hosted compiler" is the same as "llvm free backend".

stefncb · on May 30, 2023

It's not. The self-hosted compiler still uses llvm, and it's still slow because of it.

TUSF · on May 30, 2023

"self-hosted compiler" in this case means moving all the non-LLVM code (which was previously in C++) to Zig itself.

Patrickmi · on May 31, 2023

One thing I hate about LLVM is that once you get into the ecosystem to get “another alternative” or “abandon LLVM” it’s like hell, you get all these nice cool features then implementing a custom compiler for debugging can be much much harder than that

hgs3 · on May 30, 2023

Huge literals do occur in programmatically generated source code. I'm working on a project now where I generate C source code with large, static tables. Paraphs this is not a common scenario for Swift's target use cases.

d0100 · on May 30, 2023

I reported once that the Go compiler would oom when I had an array literal with quite a few entries

It might have just been a few kb of text but I couldn't compile even with 16gb ram

stephc_int13 · on May 30, 2023

So I was curious and tried to compile/run a similar code on my machine with tcc.

1M elements array, included in the source code, then sorted with a random C quicksort implementation I found while googling (far from optimal)

Compilation time : 154ms on first run, 95ms on successive runs.

Run time : 693ms on first run, ~105ms on successive runs.

The CPU is a 13900K.

stephc_int13 · on May 30, 2023

I tried the same code with clang instead of tcc.

Compilation time is 700-800ms. Run time is about half as tcc, compiled with -O3 (45-55ms)

KerrAvon · on May 30, 2023

sounds right - tcc is a fast compiler, not an optimizing one. clang is faster than many, but optimizations take time to run.

stephc_int13 · on May 30, 2023

Yes, absolutely, this is a tradeoff.

Running C code at 50% max speed is OK for me when running debug builds.

Clang being slower to compile is not an issue because I can use tcc. Unfortunately I can't use all those nice C++ features with tcc.

stephc_int13 · on May 30, 2023

Also, it depends on the type of code.

I tried to replace the quicksort with a radix sort, and the difference is much closer on this better optimized code (15ms vs 17ms).

iainmerrick · on May 30, 2023

I know this is about Swift, but:

Rust compiled in about a minute and the runtime was 2 secs

That seems really slow as well! Since reading the same thing in JSON only takes a fraction of a second.

I think it would be well worthwhile speeding up this kind of thing. Obviously a million-element static array isn’t common usage, but if you can find and fix the bottlenecks there, it will help make everyday usage faster too in lots of small ways.

tialaramex · on May 30, 2023

In Rust you can include_bytes! a file if the situation is that you just want a whole bunch of data, as is often the case with practical systems, (the result of the include_bytes! macro is a &'static [u8; N] ie an immutable reference to an array of N unsigned bytes which lives forever). So for example you can write code which cares how long firmware.bin is, without needing to ensure the code is updated appropriately, and yet doesn't bake firmware.bin inside your finished binary, by writing something like const fw_length = include_bytes!("firmware.bin").len(); and the compiler can see we don't actually want the array at runtime, just its length, so the data evaporates from the output software.

Parsing a programming language is just always going to be slower than just reading a file into memory, which is why Rust had include_bytes! from the outset and it's silly that C didn't do likewise even in 1989, let alone C++ in 1998.

This particular exercise wants a machine-word size integer, so in Rust that's isize, but you could isize::from_le_bytes() or whatever with chunked conversions, which will happen at runtime but ought to go very much faster than parsing text.

iainmerrick · on May 30, 2023

Okay, but the question I’m asking is, why does the basic version take 60 seconds to compile? Why not 6 seconds, or 60 milliseconds?

If there’s a good and insurmountable reason why it must be a minute and no less, okay. But I’d be amazed if there aren’t some easy wins to be made that could speed it up by a decent amount.

That’s not worth doing just because of the silly million-element array case, of course; but if you can make that silly case faster, lots of more important use cases will get faster too.

tialaramex · on May 30, 2023

I mean, if you feel it's worth figuring this out you totally can, Rust's source code really does successfully download and just build so you can tinker with it, I patched it some weeks back to improve the diagnostics you get for type errors where e.g. you wrote 'X' (a 21-bit integer representing the Unicode Latin capital X) but you ought to have written b'X' (an 8-bit unsigned byte representing the ASCII code for X) - yes numerically those values are identical but Rust correctly does not consider that to mean they're the same type - and it was like an hour's work to build Rust and first figure out where to attempt my changes.

You can submit a PR and after a robot puts it in a pile to be looked at, actual humans will ask you about your proposed change, they can ask other robots to check whether it works OK on the huge piles of real world Rust out there, and so on.

jimbokun · on May 30, 2023

The link to the GitHub gist in the first message is returning a 404. Is there a link to the original code?

c-fe · on May 30, 2023

I have not managed to find it either, but based on the comments the issue seems to be with the testing of the algorithm, which contains something like

``` list = [1,2,3,4,5,6 ..., 7999, 8000] ```

i.e. a written out literal, that the swift compiler has trouble type-checking if everything in there is an int.

tambourine_man · on May 30, 2023

I was very excited when Swift was announced. I always hated ObjC’s syntax and Swift’s looked awesome.

Today I get the felling that Swift is a science project used by millions and backed by a trillion dollar company.

klodolph · on May 30, 2023

I used to see the compiler run away, consuming all available RAM and CPU, back in the early days of Mac OS X. So this is not necessarily something unique to Swift. Then there was the ill-fated garbage collection system that got bolted onto Objective C. At some point after the compiler got fixed, ARC was added, and the GC was deprecated, in 2012, is the point in time when Cocoa development felt solid (to me).

I don’t know how you’d escape this. Other platforms have their own problems. As far as I can tell, they’re all science projects, because our understanding of language design and library design keeps changing.

iainmerrick · on May 30, 2023

At some point after the compiler got fixed, ARC was added, and the GC was deprecated, in 2012, is the point in time when Cocoa development felt solid (to me).

Seconded -- Obj-C was really nice to work with at that time.

I do like a lot of the new stuff in Swift, but in many ways it’s a return to the bad old days in terms of the overall dev experience.

kitsunesoba · on May 30, 2023

> I do like a lot of the new stuff in Swift, but in many ways it’s a return to the bad old days in terms of the overall dev experience.

It depends on the angle you’re coming from IMO.

For me Swift has been a major improvement overall for a couple reasons: it has a ton of quality of life features that can only be had in Objective-C with a laundry list of CocoaPods, and its type system allows me to do fairly major refactors on a regular basis that I wouldn’t dream of trying with Obj-C. Not having to maintain header files is also a bigger deal than I thought it’d be…

iainmerrick · on May 30, 2023

Yeah, it would be very hard to go back to a more primitive type system. On balance I prefer Swift.

tambourine_man · on May 30, 2023

Perhaps, but I feel Swift was too ambitious too early.

klodolph · on May 30, 2023

I think a certain amount of ambition is necessary if you’re making a language for general use. You get important feedback from real-world use, and that means you need a population of developers willing to take risks on new languages.

If we go by the Objective C timeline, Swift will be pretty good in the year 2042.

Cthulhu_ · on May 30, 2023

I mean. I love Swift, but it's use case is limited. While there's projects underway (and have been for years) to make it a server-side language for example, I'd never use it for anything but native iOS / Mac code.

londons_explore · on May 30, 2023

For OP's specific case, wouldn't it be better to just special case the compiler to detect that it is a large vector, and pick the same datatype for all elements of the vector? Suddenly an exponential search becomes linear.

stonemetal12 · on May 30, 2023

You would think the part of the compiler that parses integer literals would assign them the type integer, making this close to a no op for the type checker.

slavapestov · on May 30, 2023

Not the case when you have polymorphic literals, like in Swift or Haskell:

    let x: Int = 123
    let y: Double = 123

But it looks like here the problem was not the type checker anyway.

codetrotter · on May 30, 2023

I had a similar problem with Swift and some data.

In the end what I ended up doing was to import it as JSON or binary plist or something at runtime.

Not great. But at least I could compile my code again.

tbarbugli · on May 30, 2023

Tangent to this but Swift is slow in surprising ways: json decoding and decoding strings into date objects are insanely slow. Both are common operations for an iOS app (eg client/server app that decodes a json encoded payload) and I have no idea why not solved years ago.

Another surprise is that Apple’s protobuf generator is able to produce iodiomatic Swift, the same is not true for Kotlin :)

smasher164 · on May 30, 2023

My thinking for ways to avoid this scenario are:

- If there is no concrete type that you can infer for the given literal, throw a type error.

- If there is no concrete type that you can infer for the given literal, fall back to a default, e.g. Int.

But this sort of situation is why backtracking during type inference can lead to pathological behavior.

shusaku · on May 30, 2023

I ran into a similar problem in a Fortran code, where a huge array was being assigned a bunch of literals one by one. The optimizing compiler slowed down substantially (though not to this extent), who knows what it hoped to find…

Alifatisk · on May 31, 2023

This might be an off-topic question but is Swift considered high-performance language? If not, is this a surprise?

_w5kd · on May 30, 2023

[flagged]

lapcat · on May 30, 2023

"Please don't complain about tangential annoyances—e.g. article or website formats" https://news.ycombinator.com/newsguidelines.html

d4rken · on May 30, 2023

It's fun