What is Inference?
The process of running a trained model to generate output. Inference cost and speed (latency) are key practical factors when choosing a model.
This is one of 36 terms in the Benchquill AI glossary. Knowing it helps when you compare AI tools and AI models on price and capability.