Tech
tech
Jon Keegan

Quit the yapping: New AI technique could cut costs 90% by saying less

A consensus is emerging in AI circles that the way forward involves models that use “chain of reasoning” to get better performance, at the expense of costlier computing resources. This process involves instructing the model to break a problem down into detailed step-by-step instructions. The problem is that these steps can be pretty verbose, and when it comes to AI, more words = more cost.

A new paper from researchers at Zoom shows that using a new technique dubbed “chain of draft,” if you tell a model to simply limit those steps to succinct “drafts” of only five words or so, rather than wordy sentences, not only can you still achieve high performance on responses, but you can cut computing costs by up to 90%.

AI models are priced by the number of “tokens” — or portions of words — that are input and output by the model. For example: OpenAI’s o3-mini “reasoning” model costs $1.10 per million tokens input, and $4.40 per million tokens of output. That may seem cheap, but when you’re processing millions of queries, this can really add up.

“By reducing verbosity and focusing on critical insights, CoD matches or surpasses CoT in accuracy while using as little as only 7.6% of the tokens, significantly reducing cost and latency across various reasoning tasks,” the paper reports.

Translation: it’s faster, cheaper, and sometimes better than chain of thought.

This approach is also notable for its ease of use. You can simply change the prompts you enter to get this benefit. That said, most of the gains were found using larger models like OpenAI’s GPT-4o and Anthropic’s Claude 3.5 Sonnet, while using smaller models resulted in poorer performance.

Go deeper: Here are OpenAI’s 50 Laws of Robotics

A new paper from researchers at Zoom shows that using a new technique dubbed “chain of draft,” if you tell a model to simply limit those steps to succinct “drafts” of only five words or so, rather than wordy sentences, not only can you still achieve high performance on responses, but you can cut computing costs by up to 90%.

AI models are priced by the number of “tokens” — or portions of words — that are input and output by the model. For example: OpenAI’s o3-mini “reasoning” model costs $1.10 per million tokens input, and $4.40 per million tokens of output. That may seem cheap, but when you’re processing millions of queries, this can really add up.

“By reducing verbosity and focusing on critical insights, CoD matches or surpasses CoT in accuracy while using as little as only 7.6% of the tokens, significantly reducing cost and latency across various reasoning tasks,” the paper reports.

Translation: it’s faster, cheaper, and sometimes better than chain of thought.

This approach is also notable for its ease of use. You can simply change the prompts you enter to get this benefit. That said, most of the gains were found using larger models like OpenAI’s GPT-4o and Anthropic’s Claude 3.5 Sonnet, while using smaller models resulted in poorer performance.

Go deeper: Here are OpenAI’s 50 Laws of Robotics

More Tech

See all Tech
tech
Rani Molla

Tesla investors like the idea of merging with SpaceX

Tesla is trading up about 2.5% in early trading Friday after reports Thursday that the Elon Musk-led company was considering a merger with SpaceX, another of Musk’s many companies.

That’s a better showing than the stock’s reaction to its better-than-expected earnings a day earlier, after which shares closed down 3.5%. Acquiring a very valuable, entirely different company, it turns out, is a more attractive prospect than watching an existing one’s revenue and profit decline.

Musk is also reportedly considering merging SpaceX with xAI, his artificial intelligence company, which recently combined with his social media platform, X.

Musk is also reportedly considering merging SpaceX with xAI, his artificial intelligence company, which recently combined with his social media platform, X.

tech
Jon Keegan

WSJ: OpenAI plans Q4 IPO in race to be the first AI startup to enter public markets

OpenAI was the first to the generative-AI market with ChatGPT, and now it hopes to be the first of its AI startup cohort to pull off an initial public offering, according to a report from The Wall Street Journal. The $500 billion startup is in a race against its $350 billion competitor Anthropic, which has also been exploring an IPO.

Per the report, OpenAI is in talks with banks to try for a fourth-quarter IPO this year, which has the potential to be one of the largest IPOs ever in a year that is expected to see many record-breaking tech companies tap into public markets to raise sizable new rounds of capital.

Ahead of a potential public listing, OpenAI is reportedly attempting to raise a massive round of private investment. The company is reportedly aiming to raise $100 billion, with Amazon potentially accounting for up to half of that target. Other investors in talks with OpenAI over the private fundraising round include Nvidia, Microsoft, and SoftBank.

Per the report, OpenAI is in talks with banks to try for a fourth-quarter IPO this year, which has the potential to be one of the largest IPOs ever in a year that is expected to see many record-breaking tech companies tap into public markets to raise sizable new rounds of capital.

Ahead of a potential public listing, OpenAI is reportedly attempting to raise a massive round of private investment. The company is reportedly aiming to raise $100 billion, with Amazon potentially accounting for up to half of that target. Other investors in talks with OpenAI over the private fundraising round include Nvidia, Microsoft, and SoftBank.

tech
Rani Molla

SpaceX is actually considering a merger with Tesla or xAI: Report

Bloomberg reports that Elon Musk’s SpaceX is considering merging with Musk’s Tesla. Earlier today, Reuters had reported that SpaceX was thinking of potentially merging with xAI ahead of SpaceX’s IPO this year.

From Bloomberg:

The firm has discussed the feasibility of a tie-up between SpaceX and Tesla, an idea that some investors are pushing, the people said, asking not to be identified as the information isn’t public. Separately, they are also exploring a tie-up between SpaceX and xAI ahead of an IPO, some of the people said.

Musk’s companies already have numerous relationships between themselves, including most recently Tesla’s $2 billion investment in xAI. At Tesla’s shareholder meeting last year, shareholders voted to invest in the company but the board didn’t approve the measure due to significant abstentions.

In 2024, SpaceX incurred about $2.4 million in expenses under commercial, licensing, and support agreements with Tesla, and Tesla incurred about $800,000 in expenses for Musk’s use of SpaceX’s jet.

From Bloomberg:

The firm has discussed the feasibility of a tie-up between SpaceX and Tesla, an idea that some investors are pushing, the people said, asking not to be identified as the information isn’t public. Separately, they are also exploring a tie-up between SpaceX and xAI ahead of an IPO, some of the people said.

Musk’s companies already have numerous relationships between themselves, including most recently Tesla’s $2 billion investment in xAI. At Tesla’s shareholder meeting last year, shareholders voted to invest in the company but the board didn’t approve the measure due to significant abstentions.

In 2024, SpaceX incurred about $2.4 million in expenses under commercial, licensing, and support agreements with Tesla, and Tesla incurred about $800,000 in expenses for Musk’s use of SpaceX’s jet.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.