It's fascinating how differently languages approach the string formatting design...

umanwizard · 2025-04-11T00:46:27 1744332387

> Go developers seem to have taken no more than 5 minutes considering the problem, then thoughtlessly discarded it: [2]. A position born from pure ignorance as far as I'm concerned

There are a million things in go that could be described this way.

unscaled · 2025-04-11T02:40:21 1744339221

Looking at the various conversations involving string interpolation, this characterization is extremely unkind. They've clearly spent a lot more than 5 minutes thinking about this, including writing their own mini-proposals[1].

Are they wrong about this issue? I think they are. There is a big difference in ergonomics between String interpolation and something like fmt.Sprintf, and the performance cost of fmt.Sprintf is non-trivial as well. But I can't say they didn't put any thought into this.

As we've seen multiple times with Go generics and error handling before, their slow progress on correcting serious usability issues with the language stem from the same basic reasons we see with recent Java features: they are just being quite perfectionist about it. And unlike Java, the Go team would not even release an experimental feature unless they feel quite good about it.

[1] https://github.com/golang/go/issues/57616

mananaysiempre · 2025-04-11T19:05:52 1744398352

> There is a big difference in ergonomics between String interpolation and something like fmt.Sprintf

On the other hand, there’s a difference in localizability as well: the latter is localizable, the former isn’t. (It also worries me that I see no substantive discussion of localization in PEP 750.)

lou1306 · 2025-04-19T18:32:08 1745087528

T-strings should also help localizability, you can now just retrieve them from a locale -> t-string mapping and they should Just Work. Or am I missing something?

mananaysiempre · 2025-04-19T19:39:21 1745091561

There are two sides to this:

First—and more importantly—a professional translator’s interface is list of strings and some supplementary materials in, list of strings out. (Protip: given baseline i18n competence like not concatenating sencences out of parts, the quality of the translation you get is largely determined by the supplementary materials; screenshot every part of your UI and your translator will be willing to kiss you.) Both in communicating with clients and on the fast path of their own work (in CAT software like Across, Trados, etc.).

They do not need to see the code that fills in any placeholders, nor do they want to spend time and attention preserving it exactly as it’s been written—they just want to reorder the placeholders as the syntax of the language dictates. Arguably even %s vs %d is too much information. The ideal is {name} and {count}, and {1} and {2} are acceptable.

(By contrast, a very common issue is that like half of the sentence may need to change depending on the numeric value filled in for {count}, and two variants in English may map to anywhere between one and four after localization[1]. Template strings help with this not at all.)

Second—though this may be easier to fix—the two major ways to retrieve localized messages at runtime are integers in, strings out (most commercial tools out there) and strings in, strings out (GNU gettext), where the output strings are retrieved from some sort of data file that’s intentionally incapable of containing executable code (resource-only “MUI” DLLs, hashtables in GNU “MO”s, plain old text in Java “properties”, various kinds of XML, etc.).

Operating on pure data isn’t a strict necessity. It’s is largely a concession to the development process, which may not permit the string tables to go out to the localization contractors until principal QA is finished. The last thing you want is to insert engineering into the already-slow loop between the translators, the editors (or a translation agency employing both), and localization QA (hopefully in-house, using the actual live software to check the translations in context).

[1] https://www.gnu.org/software/gettext/manual/html_node/Plural...

Mawr · 2025-04-12T01:55:32 1744422932

I just expect better from professional language designers. To me, the blindingly obvious follow up to the thought "We understand that people familiar with other languages would like to see string interpolation in Go." [1] is to research how said other languages have gone about implementing this and to present a brief summary of their findings. This is table stakes stuff.

Then there's "You can [get] a similar effect using fmt.Sprint, with custom functions for non-default formatting." [2]:

- Just the fact that "you can already do this" needs to be said should give the designers pause. Clearly you can't already do this if people are requesting a new feature. Indeed, this situation exactly mimics the story of Go's generics - after all, they do not let you do anything you couldn't do before, and yet they got added to Go. It's as if ergonomics matter, huh.

Another way to look at this: if fmt.Sprint is so good it should be used way more than fmt.Sprintf right? Should be easy to prove :)

- The argument crumbles under the load-bearing "similar effect". I already scratched the surface of why this is wrong in a sibling post: [3].

I suspect the reason for this shallow dismissal is the designers didn't go as far as to A/B test their proposal themselves, so their arguments are based on their gut feel instead of experience. That's the only way I can see someone would come up with the idea that fmt.Sprint and f-strings are similar enough. They actually are if all you do is imagine yourself writing the simplest case possible:

    fmt.Sprint("This house is ", measurements(2.5), " tall")

    f"This house is {measurements(2.5)} tall"

Similar enough, so long as you're willing to handwave away the need to match quotation marks and insert commas and don't spend time coding using both approaches. If you did, you'd find that writing brand new string formatting statements is much rarer than modifying existing ones. And that's where the meat of the differences is buried. Modifying f-strings is trivial, but making any changes to existing fmt.Sprint calls is painful.

P.S. Proposing syntax as noisy as:

    fmt.Println("This house is \(measurements(2.5)) tall")

is just another sign the designers don't get it. The entire point is to reduce the amount of typing and visual noise.

[1]: https://github.com/golang/go/issues/57616#issuecomment-14509...

[2]: https://github.com/golang/go/issues/34174#issuecomment-14509...

[3]: https://news.ycombinator.com/item?id=43651419

pansa2 · 2025-04-12T02:48:45 1744426125

> Proposing syntax as noisy as […] is just another sign the designers don't get it.

Are you objecting to the use of `\(…)` here instead of `{…}`? Because of the extra character or because of the need to nest parentheses?

nu11ptr · 2025-04-11T00:04:50 1744329890

Value types anyone? I have zero doubt it is tough to add and get right, esp. to retrofit, but it has been so many years that I have learned/discarded several new languages since Java... and they STILL aren't launched yet.

mjevans · 2025-04-11T00:28:42 1744331322

Go(lang)'s rejection makes sense.

A format function that arbitrarily executes code from within a format string sounds like a complete nightmare. Log4j as an example.

The rejection's example shows how that arbitrary code within the string could instead be fixed functions outside of a string. Safer, easier for compilers and programmers; unless an 'eval' for strings is what was desired. (Offhand I've only seen eval in /scripted/ languages; go makes binaries.)

paulddraper · 2025-04-11T01:31:46 1744335106

No, the format function doesn't "arbitrarily execute code."

An f/t string is syntax not runtime.

Instead of

    "Hello " + subject + "!"

you write

    f"Hello {subject}!"

That subject is simple an normal code expression, but one that occurs after the opening quote of the literal and before the ending quote of the literal.

And instead of

    query(["SELECT * FROM account WHERE id = ", " AND active"], [id])

you write

    query(t"SELECT * FROM account WHERE id = {id} AND active")

It's a way of writing string literals that if anything makes injection less likely.

mjevans · 2025-04-11T02:08:29 1744337309

Please read the context of my reply again.

The Rejected Golang proposal cited by the post I'm replying to. NOT Python's present PEP or any other string that might resolve magic variables (just not literally eval / exec functions!).

zahlman · 2025-04-11T02:23:27 1744338207

As far as I can tell from the linked proposal, it wouldn't have involved such evaluation either. It seems like it was intended to work fundamentally the same way as it currently does in Python: by analyzing the string literal ahead of time and translating into equivalent explicit formatting code, as syntactic sugar. There seem to have been many misunderstandings in the GitHub discussion.

mjevans · 2025-04-11T03:00:52 1744340452

In that case, I might have misunderstood the intent of those examples.

However the difficulty of understanding also illustrates the increased maintenance burden and language complexity.

eviks · 2025-04-11T06:26:32 1744352792

Unless workarounds to a missing feature have a higher maintenance burden like in this case, and you can't avoid it via learning

mjevans · 2025-04-11T09:39:54 1744364394

Go's preferred way would probably be something like compute the aliased operations on the line(s) before, then reference the final values.

E.G. Adapting https://github.com/golang/go/issues/34174

    f := 123.45
    fmt.Fprintln("value=%08.3f{f}") // value=0123.450
    fmt.Fprintln("value=%08.3f", f) // value=0123.450
    s := "value"
    fmt.Fprintln("value='%50s{s}'") // value='<45 spaces>value'
    fmt.Fprintln("value='%50s'", s) // value='<45 spaces>value'

The inline {variable} reference suffix format would be less confusing for situations that involve _many_ variables. Though I'm a bit more partial to this syntax with an immediately trailing %{variable} packet since my gut feeling is that special case would be cleaner in a parser.

    fmt.Fprintln("value=%08.3f%{f}") // value=0123.450
    fmt.Fprintln("value='%50s%{s}'") // value='<45 spaces>value'

paulddraper · 2025-04-11T02:34:39 1744338879

The proposal cited Swift, Kotlin, and C# which have similar syntax sugar.

The proposal was for the same.

chrome111 · 2025-04-11T11:44:20 1744371860

Thanks for this example - it makes it clear it can be a mechanism for something like sqlc/typed sql (my go-to with python too, don't like orms) without a transpilation step or arguably awkward language API wrappers to the SQL. We'll need linters to prevent accidentally using `f` instead of `t` but I guess we needed that already anyways. Great to be able to see the actual cost in the DB without having to actually find the query for something like `typeddb.SelectActiveAccount(I'd)`. Good stuff.

WorldMaker · 2025-04-11T15:05:58 1744383958

The PEP says these return a new type `Template`, so you should be able to both type and/or duck type for these specifically and reject non-Template inputs.

paulddraper · 2025-04-11T16:47:55 1744390075

It is a different type.

You can verify that either via static typechecking, or at runtime.

miki123211 · 2025-04-11T01:20:31 1744334431

In many languages, f-strings (or f-string like constructs) are only supported for string literals, not user-supplied strings.

When compiling, those can be lowered to simple string concatenation, just like any for loop can be lowered to and represented as a while.

zahlman · 2025-04-11T02:18:29 1744337909

In case there was confusion: Python's f-string functionality in particular is specific to string literals. The f prefix doesn't create a different data type; instead, the contents of the literal are parsed at compile time and the entire thing is rewritten into equivalent string concatenation code (although IIRC it uses dedicated bytecodes, in at least some versions).

The t-string proposal involves using new data types to abstract the concatenation and formatting process, but it's still a compile-time process - and the parts between the braces still involve code that executes first - and there's still no separate type for the overall t-string literal, and no way to end up eval'ing code from user-supplied data except by explicitly requesting to do so.

the_clarence · 2025-04-11T03:40:18 1744342818

There is no compile time in python

zahlman · 2025-04-11T03:54:18 1744343658

Yes, there is.

Python source code is translated into bytecode for a VM just like in Java or C#, and by default it's cached in .pyc files. It's only different in that you can ask to execute a source code file and the compilation happens automatically before the bytecode-interpretation.

`SyntaxError` is fundamentally different from other exceptions because it can occur during compilation, and only occurs at run-time if explicitly raised (or via explicit invocation of another code compilation, such as with `exec`/`eval`, or importing a module). This is also why you can't catch a `SyntaxError` caused by the invalid syntax of your own code, but only from such an explicit `raise` or a request to compile a source code string (see https://stackoverflow.com/questions/1856408 ).

pansa2 · 2025-04-11T03:43:38 1744343018

Yes there is, when it compiles source code to bytecode.

mjevans · 2025-04-11T02:05:48 1744337148

My reply was to the parent post's SPECIFIC example of Golang's rejected feature request. Please go read that proposal.

It is NOT about the possibility of referencing existing / future (lazy / deferred evaluation) string literals from within the string, but about a format string that would literally evaluate arbitrary functions within a string.

unscaled · 2025-04-11T02:25:38 1744338338

The proposal doesn't say anything about executing code in user-supplied strings. It only talks about a string literal that is processed by the compiler (at which point no user-supplied string can be available).

On the other hand, the current solution offered by Go (fmt.Sprintf) is the one who supports a user-supplied format String. Admittedly, there is a limited amount of damage that could be done this well, but you can at the very least cause a program to panic.

The reason for declining this feature[1] has nothing to do with what you stated. Ian Lance Taylor simply said: "This doesn't seem to have a big advantage over calling fmt.Sprintf" and "You can a similar effect using fmt.Sprint". He conceded that there are performance advantages to string interpolation, but he doesn't believe there are any gains in usability over fmt.Sprintf/fmt.Sprint and as is usual with Go (compared to other languages), they're loathe to add new features to the compiler[2].

[1] https://github.com/golang/go/issues/34174#issuecomment-14509...

[2] https://github.com/golang/go/issues/34174#issuecomment-53013...

NoTeslaThrow · 2025-04-11T01:22:25 1744334545

What's the risk of user supplied strings? Surely you know their size. What else is there to worry about?

NoTeslaThrow · 2025-04-11T01:13:55 1744334035

> A format function that arbitrarily executes code from within a format string

So, a template? I certainly ain't gonna be using go for its mustache support.

bcoates · 2025-04-11T02:53:38 1744340018

No, it's exactly the opposite--f-strings are, roughly, eval (that is, unsanitary string concatenation that is presumptively an error in any nontrivial use) to t-strings which are just an alternative expression syntax, and do not even dereference their arguments.

rowanG077 · 2025-04-12T19:29:27 1744486167

f-strings are not eval. It's not dynamic. It's simply an expression that is ran just like every other expression.

bcoates · 2025-04-13T20:44:24 1744577064

Right, and then if you do literally anything with the output other than print() to a tty, it’s an escaping/injection attack.

any_func(f"{attacker_provided}") <=> eval(attacker_provided), from a security/correctness perspective

rowanG077 · 2025-04-23T20:53:46 1745441626

Shooting any unsanitized input into your application is bad. template strings don't make this worse. any_func(attacker_provided) is even worse then any_func(t"{attacker_provided}") since in the later case you actually have reduced the attack surface to just strings.

saagarjha · 2025-04-15T05:33:27 1744695207

How is this any different from any_func(attacker_provided)

thayne · 2025-04-11T04:01:50 1744344110

That issue has a link to another Issue with more discussion: https://github.com/golang/go/issues/57616.

But as is all too common in the go community, there seems to be a lot of confusion about what is proposed, and resistance to any change.

cherry_tree · 2025-04-11T03:22:05 1744341725

>Go developers seem to have taken no more than 5 minutes considering the problem, then thoughtlessly discarded it

The issue you linked was opened in 2019 and closed with no new comments in 2023, with active discussion through 2022.

cortesoft · 2025-04-11T00:41:46 1744332106

Then there is Ruby, which just has beautiful string formatting without strange decorators.

BiteCode_dev · 2025-04-21T07:07:20 1745219240

t-string are lazy, which is the point (escaping HTML, translating strings when you get preferred language headers, preparing SQL statements...).

Does Ruby strings already allow lazy processing ?

I'm not talking about wrapping them in a block and passing the block (all languages can do that with a lambdas) but a having literally that eventually resolves to something when you use it.

psychoslave · 2025-04-21T11:11:06 1745233866

That's seems like the wrong pattern, maybe I'm missing something.

Ruby has lazy evaluation with a generic lazy enumeration facility, whether to produce string or any kind of object.

That is, I don't know what is the actual behavior of the default string interpolation in Ruby, but if profiling a codebase some string generation would gain lazy evaluation, there is a path to do so. But in the general case, does it really matter? Chances are good that a string construction is not a big bottleneck.

Does Python miss such a feature of generic lazy enumeration, or is it so painful to use that some syntactic sugar felt like a must have? Genuine question here, I don't have any strong opinion on this t-string feature.

BiteCode_dev · 2025-04-21T12:11:30 1745237490

It has lazy enumeration with generators, but:

    - string construction is a hot path, you don't want them to always be lazy, especially since any access is slow in python.

    - having it using a string syntax is just very clean and easy to read. It's explicit and can be supported by good editor highlighting.

    - it's easy to grep, analyse for, substitute, etc.

    - you get one single unified API instead of thousands of variations. Translations API, log API and escaping API can all look the same, arguments are in the same shapes.

psychoslave · 2025-04-21T15:35:05 1745249705

Thanks for the detailed answer.

I understand that string generation can be a hotpath, though I wouldn't take it as a general certain fact.

From what I understand here the benefit in term of performance is mainly due to partial application automatically handled by the interpreter. It's hard to me to jauge actual pro/con compared to Ruby which can also leverage on freezed string, lambda, miscellaneous lazy evaluation facilities for example. I'm not aware of anything close in PHP, to stay in the realm of popular interpreted languages. I didn't make any Lua for a long time, so no idea how it evolved on that matter.

bshacklett · 2025-04-11T01:14:18 1744334058

That tracks. Ruby followed in the footsteps of Perl, which had string manipulation as a main priority for the language.

BiteCode_dev · 2025-04-21T07:07:42 1745219262

Does perl have lazy string processing? And I'm not talking about a coderefs hack.

1980phipsi · 2025-04-11T01:04:00 1744333440

D had a big blow up over string interpolation. Walter wanted something simple and the community wanted something more like these template ones from Python (at least from scanning the first little bit of the PEP). Walter eventually went with what the community wanted.

gthompson512 · 2025-04-11T12:17:33 1744373853

This led to the OpenD language fork (https://opendlang.org/index.html) which is led by some contributors who had other more general gripes with D. The fork is trying to merge in useful stuff from main D, while advancing the language. They have a Discord which unfortunately is the main source of info.

lynndotpy · 2025-04-11T13:42:11 1744378931

For all its other problems, f-strings make Python such a pleasure to work with. C# has something similar IIRC.

throwaway2037 · 2025-04-11T04:15:28 1744344928

I promise, no trolling from me in this comment. I never understood the advantage of Python f-strings over printf-style format strings. I tried to Google for pros and cons and didn't find anything very satisfying. Can someone provide a brief list of pros and cons? To be clear, I can always do what I need to do with both, but I don't know f-strings nearly as well as printf-style, because of my experience with C programming.

Mawr · 2025-04-11T07:49:00 1744357740

Sure, here are the two Go/C-style formatting options:

    fmt.Sprintf("This house is %s tall", measurements(2.5))

    fmt.Sprint("This house is ", measurements(2.5), " tall")

And the Python f-string equivalent:

    f"This house is {measurements(2.5)} tall"

The Sprintf version sucks because for every formatting argument, like "%s", we need to stop reading the string and look for the corresponding argument to the function. Not so bad for one argument but gets linearly worse.

Sprint is better in that regard, we can read from left to right without interruptions, but is a pain to write due to all the punctuation, nevermind refactor. For example, try adding a new variable between "This" and "house". With the f-string you just type {var} before "house" and you're done. With Sprint, you're now juggling quotation marks and commas. And that's just a simple addition of a new variable. Moving variables or substrings around is even worse.

Summing up, f-strings are substantially more ergonomic to use and since string formatting is so commonly done, this adds up quickly.

throwaway2037 · 2025-04-11T11:44:12 1744371852

    > Not so bad for one argument but gets linearly worse.

This is a powerful "pro". Thanks.

theptip · 2025-04-11T04:37:20 1744346240

    _log(f”My variable is {x + y}”)

Reads to me a lot more fluently to me than

    _log(“My variable is {}”.format(x+y))

or

    _log(“My variable is {z}”.format(z=x+y))

It’s nothing too profound.

blami · 2025-04-13T11:30:09 1744543809

I am not very familiar with Python. How do you localize (translate) first one?

wzdd · 2025-04-15T02:10:50 1744683050

You don't with f-strings because they're substituted eagerly. You could with the new t-strings proposed here because you can get at the individual parts.

BiteCode_dev · 2025-04-21T07:09:29 1745219369

That's what t-strings are about. They are lazy, so you can mark them for translation "as-is".

oliwarner · 2025-04-11T07:23:05 1744356185

It's especially weird how hard people have to fight for string interpolation given it has had implementations since the 1970s.

Even PEP 498 (fstrings) was a battle.

bjourne · 2025-04-12T14:05:56 1744466756

Superficially f-strings reminds you of php and everyone remembers how awful that was. But Python's implementation is leagues better and we also have better tooling (ie smart parsers) for handling fstrings.

amitport · 2025-04-21T04:56:29 1745211389

are you just going to ignore Javascript?

dionian · 2025-04-10T23:35:21 1744328121

Looks great - unlike java which is somehow recommending the format:

STR."Hello \{this.user.firstname()}, how are you?\nIt's \{tempC}°C today!"

compared to scala

s"Hello ${this.user.firstname()}, how are you?\nIt's ${tempC}°C today!"

STR."" ? really?

paulddraper · 2025-04-11T01:37:26 1744335446

Yeah, I hate to bikeshed, but this is the worst syntax possible without being a full-out prank.

nsonha · 2025-04-11T00:14:33 1744330473

also a syntax for braces that looks like escaping

dionian · 2025-04-17T13:52:52 1744897972

Yeah, that almost bothers me more than "STR."

Symbiote · 2025-04-21T07:13:08 1745219588

\{ is currently invalid syntax in Java, so it can be given a meaning.

${ is valid syntax (the string value "$\") so giving it a new meaning would break existing programs. That's not acceptable.