Open main menu
Article
Quizzes
Tools
EN
Article
Quizzes
Tools
All quizzes
/
LLM Fundamentals
/
Why might ...
Why might an LLM produce slightly different outputs even with temperature=0?
Floating-point non-determinism in GPU hardware means identical inputs can produce subtly different probability values
The model randomly samples from the top-2 tokens even at temperature=0
Temperature=0 is actually treated as temperature=0.01 by the API
The system prompt changes between requests, altering the output
Submit answers