Empirical research on AI inference cost optimisation. Benchmarks, methodology, and production findings.
May 2026
How four production frontier models perform under batch, compression, and output-cap levers — 1,280 scored responses across four task categories.