AI term

What is Inference?

The process of running a trained model to generate output. Inference cost and speed (latency) are key practical factors when choosing a model.

This is one of 36 terms in the Benchquill AI glossary. Knowing it helps when you compare AI tools and AI models on price and capability.

Bq By Benchquill Editorial Team ·Updated June 2026 ·How we rate

More AI terms