int_is_compress's comments

int_is_compress · on March 14, 2023

There are already some cool projects that help LLM go beyond the context window limitation and work with even larger codebases like https://github.com/jerryjliu/llama_index and https://github.com/hwchase17/langchain.

Der_Einzige · on March 14, 2023

The fundamental techniques that they use are highly lossey and are far inferior to ultra-long context length models where you can do it all in one prompt. Hate to break it to you and all the others.

hombre_fatal · on March 14, 2023

> Hate to break it to you and all the others.

Jeez. Their comment is quite obviously a complementary one in response to the limitation rather than a corrective one about the limitation.

faizshah · on March 14, 2023

The methods they employ are to improve the context being given to the model irrespective of the context length. Even when the context length improves these methods will be used to decrease the search space and resources required for a single task (think about stream search vs indexed search).

I’m also curious what paper you are referencing that finds that more context vs more relevant context yields better results?

A good survey of the methods for “Augmented Language Models” (CoT, etc.) is here: https://arxiv.org/pdf/2302.07842.pdf

nico · on March 14, 2023

Where can someone find and try ultra-long context length models?

Any links?

intelVISA · on March 14, 2023

The longest one that is generally available is always going to be yourself :)

thelittleone · on March 14, 2023

My context model is getting shorter and fuzzier.

EntrePrescott · on March 15, 2023

… but still the weights are increasing ;)

int_is_compress · on March 14, 2023

There’s already project that help with going beyond the context window limitation like https://github.com/jerryjliu/llama_index

They also just tweeted this to showcase how it can work with multimodal data too: https://twitter.com/gpt_index/status/1635668512822956032?s=4...

int_is_compress · on March 14, 2023

Yea it's incredible. Looks like tooling in the LLM space is quickly following suit: https://twitter.com/gpt_index/status/1635668512822956032