This article discusses how to implement an infrastructure for measuring and controlling overly verbose LLM responses.