New method lowers cost of reasoning models

Reasoning models already exist, but they have an unfavorable effect. In fact, because problems are broken up and handled in separate blocks, the cost of these models quickly increases. Researchers have found a new method to impose budget constraints on the model without sacrificing quality.

Researchers at the American Carnegie Mellon University have found a new technique for reducing the cost of reasoning models. However, the method must be applied during model development.

LCPO

Developers of AI models can use the technique of length controlled policy optimization (LCPO) to reduce the thoughts of these LLMs. The strength of reasoning models does, however, lie in thinking longer and treating different parts of the information separately, so this research sounds counterproductive. According to the researchers, the level of response does not drop back to LLMs who skip the reasoning step.

The LLM’s thought is constrained by giving the model a maximum number of tokens to find the answer. A correct answer, but using too many tokens results in a penalty. The model must then create a new reasoning plan that fits within the given number of tokens.

For the study, the models L1-max and L1-exact were created. These reasoning models contain 1.5 billion parameters. “To our knowledge, this is the first demonstration that a 1.5B model can outperform frontier models such as GPT-4o, despite using the same generation length,” the researchers write. The gain is two percent.

Tip! Claude 3.7 Sonnet offers as much reasoning as you want

Top story

NetSuite makes platform more efficient with targeted AI

Enabling organizations to innovate faster

Coen van Eenbergen April 1, 2025

Tech calendar

New method lowers cost of reasoning models

LCPO

Stay tuned, subscribe!

Google brings Gemini to on-premises data centers with Distributed Cloud

The Techzine Perspective: Atlassian Team ’25 highlights power of integration

Free VMware ESXi hypervisor returns

Transparency in AI reasoning process falls short

Dutch government starts consultation for NIS2 bill

EU presents final text of NIS2 directive

Don’t wait for NIS2 legislation, organizations can do a lot now

NIS2 compliance is the beginning, better security the goal

VeeamON 2025

GITEX ASIA

SAS Innovate 2025

.NEXT 2025

LambdaConf 2025

Qlik Connect 2025

Try the latest high-end Synology backup system for free

Enhance your data protection strategy for 2025

Strengthen your cybersecurity with DNS best practices

Are you data and AI ready?

AI & Data Architect

Cloud Account Executive – Slack