What is Quantization?
Compressing a model to use less precision and memory, making it cheaper and faster to run with minimal quality loss.
This is one of 36 terms in the Benchquill AI glossary. Knowing it helps when you compare AI tools and AI models on price and capability.