With all the great progress in large language models lately, and them being exce...

FrenchDevRemote · on July 10, 2022

YALM is 200GB and require 200GB of GPU Memory to run....

naillo · on July 10, 2022

Yeah you picked the hugest SOTA one of them all, but there's smaller ones like this https://bellard.org/libnc/gpt2tc.html that run well even on CPUs that might run well fine tuned specifically on search results (or at least just code queries).

FrenchDevRemote · on July 10, 2022

The only significant difference between those models is the amount of data, the main(or the 2nd most important) reason why you're using a search engine is how much data is available on it.

If you want to search through an incredibly limited % of the web then yeah it can be a solution, but even the lamest search engine company out there would outperform a GPT-2 like model running from your laptop.