AI term

What is Inference?

The process of running a trained model to generate output. Inference cost and speed (latency) are key practical factors when choosing a model.

This is one of 36 terms in the Benchquill AI glossary. Knowing it helps when you compare AI tools and AI models on price and capability.

More AI terms

All 36 →