Most LLMs are deterministic, but the tooling around them samples randomly from t...

patrakov · on Aug 8, 2023

This non-deterministic sampling is not only for users to explore the space of responses. Without this, the LLM itself is prone to generate too-repetitive text.

panarky · on Aug 8, 2023

> they only activate some parts of the model for each input

Perhaps you see seemingly random results because OpenAI is A/B testing multiple versions, or different combinations of hyperparameters, so that you can train GPT5.

mlyle · on Aug 8, 2023

Nah; the mentioned paper above (from a few days ago here on HN) show about how GPT4 is nondeterministic because the sparse mixture of experts technique used is nondeterministic based on batch positioning.

cratermoon · on Aug 8, 2023

> You can turn this off

Not entirely. Even with temperature = 0, GPT4 is non-deterministic.

nomel · on Aug 8, 2023

> GPT4 is non-deterministic.

For the curious reader: https://news.ycombinator.com/item?id=37006224

It appears that it could "easily" be made deterministic.

cratermoon · on Aug 9, 2023

That article went past my level of expertise, which suggests that "easily" is, as you imply, a matter of perspective. It's possible the current behavior is a result of tradeoffs made for performance or cost. Modifications to make the model deterministic could depend on making unacceptable tradeoffs.