(Disclaimer: I'm the founder of OpenPipe, one of the fine-tuning services OP tri...

GlassOwAter · on July 1, 2024

Is this something, as a tech enthusiast that's no expert, I can easily fine tune are run?

My use case would be fine tuning on technical docs. Specific news, 2 years of blog posts, primary source material, and Twitter explainer thread. I want to gather all the niche information of a topic from the last two years, dump it into this and have an LLM that is a subject-matter expert.

afro88 · on July 1, 2024

Fine tuning doesn't quite work that way. You have to format the training data set as request/response. The idea of fine tuning is to get the model to output things in a specific format, style or structure.

Your use case is better suited to RAG. This is where you retrieve data from a large dataset and inject it into the user's request so the AI model has the context it needs to answer accurately.

But that's not a silver bullet and you would need to spend significant time on chunking strategy and ranking of results to hopefully get a decent response accuracy.

w4nderlust · on July 1, 2024

Here is an example of the Predibase platform, referred in the article for the Solar model, but that can train also Llama-3, Phi-3 and Mistral. https://www.youtube.com/watch?v=R2JQhzfaOFw&themeRefresh=1 I think you can assess by yourself if it's easy enough to do for you. (Predibase founder here)

colordrops · on July 1, 2024

Why isn't someone providing a "meta model" that uses an LLM to choose between various fine tuned models depending on the question to get overall better results than gpt4?

billmalarky · on July 1, 2024

Founding AI Engineer at OpenPipe here, using a fine tuned "router LLM" to route between various specialized (inc fine tuned but not necessarily) applied models depending on the input is becoming a common pattern in more modern "graph like" LLM applications.

See LangGraph's "conditional edges" concept here: https://langchain-ai.github.io/langgraph/concepts/low_level/...

You can see how that "routing function" could include a call to a "Router LLM." And yes, fine tuning is a great method to better improve the routing intelligence of said Router LLM.

Great question btw!

anon373839 · on July 2, 2024

Worth mentioning that you don’t even need separate models to implement this. Dynamically loading LoRA adapters is much more efficient, and is the approach Apple took.

bashfulpup · on July 1, 2024

Already a big thing. See the constellation architecture used here:

https://arxiv.org/html/2403.13313v1

sheepscreek · on July 1, 2024

Very loosely, isn’t this what is happening inside most LLMs that have a “multi-head” mechanism?

drphilwinder · on July 2, 2024

Check out https://unify.ai/chat if you're interested in a router optimised for cost/ttft/performance for commercial language models.

babelfish · on July 1, 2024

Is using model responses to train a new model against the ToS for the major LLM providers (OpenAI, Anthropic, etc)?

yreg · on July 1, 2024

There doesn't seem to be any restriction like that in OpenAI terms.

zepton · on July 1, 2024

There is: "you may not... Use Output to develop models that compete with OpenAI"

(from https://openai.com/policies/terms-of-use/)

yreg · on July 1, 2024

Thanks, I've missed that.

I suppose the Output could be washed by publishing it on the web and having another entity crawl it.

OpenAI doesn't treat anyone else's content any differently, acting like it's a fair game, so why should we care.

jaredhallen · on July 2, 2024

Data laundering. What a time to be alive.

babelfish · on July 1, 2024

It seems like you do not work for OpenPipe (OP), so it probably doesn't matter for you, but it could (should) matter a whole lot for OpenPipe and/or their customers