I don't know of any 180B models that outperform gpt-3.5, including Falcon (accor...

selfhoster11 · on Sept 12, 2023

Benchmarks don't necessarily reflect real-world performance. Especially given that they poorly measure more esoteric aspects of the model that, for now, can only be judged qualitatively. I would wait for a bit to see what the community comes up with before writing off Falcon-180B.

Der_Einzige · on Sept 11, 2023

Note that quantization has serious impact on the models capabilities.