Here is a C implementation of GPT-2. https://bellard.org/nncp/gpt2tc.html I don'...

read_if_gay_ · on Aug 17, 2020

Is there anything Bellard hasn’t done?

coolspot · on Aug 17, 2020

He didn’t release blueprints of cheap homemade 1MW fusion reactor yet, but otherwise - yep, it seems that he covered everything else.

nurettin · on Aug 17, 2020

he didn't rewrite everything in rust.

typon · on Aug 17, 2020

Once he picks up Rust, he'll become a 10x programmer.

jacquesm · on Aug 17, 2020

What a loss. To be downgraded that much.

xkapastel · on Aug 18, 2020

That's not really a C implementation of GPT-2 since it cannot be used to do the thing everyone cares about: self-supervised learning from text. In fact, it doesn't even use the weights in the same way GPT-2 does, so it's not clear how close it is to GPT-2's inference mode. The source isn't even on the page.

svnpenn · on Aug 17, 2020

Hm, I notice the source code is missing? I did I overlook it

huac · on Aug 17, 2020

This is very cool, thanks for sharing! From the readme (https://bellard.org/nncp/readme-gpt2tc.txt), the program benchmarks very comparably to CMIX, which is the top algorithm on the Large Text Compression Benchmark (http://mattmahoney.net/dc/text.html). I'm guessing that any GPT implementation would be ineligible for the benchmark because of its file size but impressive nonetheless.