I’ve actually found the opposite. At work, we went from a fine-tuned model to a ...

Xmd5a · 2025-03-20T14:54:22 1742482462

There are frameworks for graph-based RAG that mix both approach. One LLM encodes info as a knowledge graph, gradually building up an ontology. Another LLM is used to query this knowledge graph by emitting speculative queries. As the database grows, the second LLM is fine-tuned again and again with exemple queries using the ontology the first LLM came up with.

Scipio_Afri · 2025-03-23T06:49:20 1742712560

Would you mind naming some of the frameworks?

danielhanchen · 2025-03-19T23:40:08 1742427608

RAG definitely is helpful! Fine-tuning imo is extremely powerful but it's still relatively alchemy - technically gpt4, Claude any large model is a finetune of a base model! Reasoning finetuning is also very powerful!

Tbh the hardest part is the lifecycle - ie new data, updating, serving etc - that seems to be the biggest issue

wahnfrieden · 2025-03-20T01:57:42 1742435862

Is anyone having success with iteratively feeding chunks of code (or other documents) to LLM for search? I understand 'haystack' issues with LLMs are quite bad, but RAG is quite bad too and a lot of that haystack research seems to be with feeding very large contexts in.

moffkalast · 2025-03-19T22:09:28 1742422168

Well, why not both? If you've already got a tuned model why not use RAG on that to get even better results? It already knows the big picture, it just needs the details so it doesn't have to hallucinate them.

danielhanchen · 2025-03-19T23:41:00 1742427660

Yes RAG combined is pretty cool! Fyi I'm planning to add optimized RAG directly into unsloth as well!