Want to get featured here? Explore premium visibility opportunities.

Contact us

AI NewsThe token bill comes due: Inside the industry scramble to manage AI’s runaway costs

The token bill comes due: Inside the industry scramble to manage AI’s runaway costs

11:26 PM IST · June 5, 2026

The token bill comes due: Inside the industry scramble to manage AI’s runaway costs

Across the industry, companies are starting to balk at the price of AI.Uber blew throughits entire 2026 AI coding budget by April.Microsoft revokedits developers’ Claude Code licenses months after enabling them. A Priceline employee told TechCrunch that a routine Cursor contract renewal came back 4-5x more expensive. Even though per-token prices have fallen, the push for more AI adoption and increasingly autonomous agents have driven token consumption higher and higher. Companies that gorged themselves in early 2025 on all-you-can-eat subscriptions are now scrambling to understand where their money is going, pull back spending, and figure out whether they can salvage some ROI from the wreckage of their budgets. Meanwhile, a market is forming to meet them there. Startups, established vendors, and a new standards body are all racing to give companies the tools and language to track what they spend. “Six months ago, I would have a conversation with a customer and it would be all about ‘What can it do? Is it good enough?’” Alexander Embiricos, OpenAI’s head of enterprise, told TechCrunch at an event in New York City this week. “Our conversations are never about that now. Now the conversations are about, ‘hey, we’re spending so much. What visibility do you have? What auditability do you have? What token controls do you have? What is the efficiency of your models?’” It’s against this backdrop that the Linux Foundation this week unveiled plans for the Tokenomics Foundation, a new standards body that aims to instill the same cost discipline around AI tokens that FinOps did for cloud spend. “In April and May, I started hearing from companies: ‘Oh my god, we are 3x over our entire 2026 token budget and it’s only April,’” J.R. Storment, executive director of the FinOps Foundation, a project under the Linux Foundation, told TechCrunch. “We started hearing existential crises, and the whole conversation shifted fromtokenmaxxingand ‘go fast’ to ‘we need guardrails, how do we control this?’” The cries heard round the tech world followed fervent demands from CEOs pushing their teams to use the best models and move fast, costs be damned. New models released in November like Anthropic’s Claude Opus 4.5, OpenAI’s GPT-5.1, and Google’s Gemini 3 Pro brought significant improvements to agentic tools, which have multiplied consumption. It’s how one companyreportedlyfound itself with a $500 million Claude bill after forgetting to set usage limits for employees. “It’s like the crack-cocaine epidemic,” said Chris Reed, senior director of IT finance at Priceline, noting the company had begun placing token limits on certain groups. “They let you try it to get you hooked on it, and now you’re kind of beholden to it.” Vitaly Gordon, CEO of engineering operations platform Faros AI, said he recently spoke to a CTO who told him: “One of my engineers spent $40,000 on tokens last month, and I genuinely don’t know whether I should stop him or should I go and tell everyone else to be like him.“ A Marchsurveyby Faros found that among 20,000 developers, output was rising, but so were bugs and rewrites. Jellyfish, an engineering management platform, similarly found engineers who used the most tokens were about twice as productive as those who used AI less, but they spent 10x the number of tokens to get there. Nicholas Arcolano, head of research at Jellyfish, told TechCrunch via email that expenditure on AI is exploding in large part due to agentic features, with per-developer consumption rising about 18.6x in nine months. All in all, these stats make the productivity case murkier than the spending suggests. “Whether extreme spend pays off comes down to the ultimate business value of shipped code (e.g. revenue), which most companies still can’t measure,” Arcolano said. At least some of that measurement issue is the sheer scale at which AI is being used today. “Tracking cloud costs is a hundreds-of-millions-of-rows-a-month data problem,” Storment said. “Tracking token costs is a trillions-of-rows-a-month data problem. You can’t just stick that into whatever spreadsheet or even basic tool. You’ve got to fundamentally rethink your tooling, your specs and your accounting systems to do that.” At Priceline, Reed is already seeing discrepancies. He noted issues between a vendor’s reported usage and Priceline’s internal data. “I started my career in telecom expense management, and I’m seeing all the same parallels, from telecom to cloud to AI,” he said. “Anytime you introduce something new, it’s ripe for billing errors and audit and optimization opportunities.” A market is beginning to form around this problem. There are the pure-play companies, like Pay-i, which tracks, measures, and optimizes the costs and performance of GenAI investments.Paid, meanwhile, lets developers track costs, measure usage, and bill users based on actual value rather than subscription fees. Then there are companies like Jellyfish, Waydev, and Faros AI, which all provide AI agent monitoring to prove the ROI of developer tools. Storment says most of the 180 vendors within the FinOps Foundation are leaning toward this space. Companies with existing distribution are also adding new features to capitalize on this new market. Ramp has recently moved intoAI spend management;DatadogandNew Relichave tacked on services like cloud cost management, token-level observability, and GPU monitoring. At the FinOps X conference next week, AWS is expected to introduce new financial management features geared toward enterprise AI spending. Tiffany Luck, a partner at NEA, thinks token efficiency and observability will likely be added in at the “harness or app layer.” She pointed to Factory, astartupthat makes AI agents for enterprises, which this weeklauncheda model router that automatically picks the right model for every task. Gordon expects frontier labs and other model providers to adopt OpenRouter-style optimization to drive queries to the cheapest models — a trend already showing up on enterprise Claude bills. “The financial report for how much you spend on Anthropic, even if you call the Opus model, some of the spend will be on Sonnet or Haiku, because they are smart enough to do it,” Gordon said. “I think this will become more and more of a thing.” But all these tools are being built without a common language or shared definitions for how much a token costs, what it produces, and how to compare spend across vendors. That’s where the Tokenomics Foundation hopes to prove useful. The Foundation is building a canonical definition and framework for “tokenomics;” open standards, specifications and metrics for AI token usage and billing; as well as new metrics for AI economics, like cost-per-intelligence or tokens-per-watt. It also plans to define metrics across token factory effectiveness and consumption efficiency. The group is planning a formal launch in July, and is about to announce more members at the FinOps X conference next week. “Token economics is fundamentally more abstract and opaque than anything we’ve managed at this scale before,” Nishant Gupta, chief availability officer at Salesforce, said in a statement. “It requires a different operational muscle than the one the industry built for cloud.” That said, Goldman Sachsprojectsglobal token usage to multiply by 24 times by 2030. The companies already over budget need solutions now, and the foundation’s first deliverable is still months away. “Maybe we created a steam engine, but we still haven’t figured out the assembly line,” said Gordon. According to Arcolano, the smart move is broad, moderate adoption. “The best ROI comes from moving the broad middle from low to moderate usage, not pushing heavy users higher,” he said. Russell Brandom and Tim Fernholz contributed to this reporting.

read more

Latest AI News

View All News →
India's Compliance Maze: How TeamLease RegTech Is Using AI to Tame a 13,000-Change Beast

India's Compliance Maze: How TeamLease RegTech Is Using AI to Tame a 13,000-Change Beast

Enterprises in India face up to 11,000 compliance instances annually from over 3.2 million regulatory websites. TeamLease RegTech is deploying AI to shift compliance from reactive record-keeping to predictive, intelligence-driven risk management for businesses nationwide.

2 hours ago

View

Meta Strikes Fresh Data Centre Agreements With Crusoe: Report

Meta Strikes Fresh Data Centre Agreements With Crusoe: Report

The latest deal reflects Meta’s ongoing efforts to expand its AI infrastructure as demand for large-scale computing resources continues to grow.

2 hours ago

View

Source: Elastic agrees to buy CRV-backed DeductiveAI for up to $85M

Source: Elastic agrees to buy CRV-backed DeductiveAI for up to $85M

DeductiveAI, a startup that uses AI to catch and resolve bugs in software, has agreed to be sold to enterprise software company Elastic for up to $85 million, according to a person with knowledge of the deal. Deductive, which was founded in 2023, came out stealth last November when it announced a$7.5 million seedround led by CRV with participation from Databricks Ventures, Thomvest Ventures, and PrimeSet.  The investment valued the startup at $33 million, according to PitchBook. Elastic and Deductive did not respond to multiple requests for comment. TechCrunch will update this article if either company responds. The sale marks a speedy exit for Deductive, which is operating in a fast-growing sector known as AI site reliability engineering (AI SRE). Building AI-powered SRE tools has become an important area, driven by the massive influx of AI-written code. Replacing manual debugging with AI enables human SREs to shift focus from constantly fixing outages and other problems, to spending more time on helping with product development. The acquisition reflects a broader trend in which established tech incumbents are looking to buy AI-native startups to integrate agentic technologies into their existing product suites, the source told TechCrunch. Elastic, which went public in 2018, is best known for Elasticsearch, the search and analytics engine that helps organizations store, search, analyze, and monitor large amounts of data in near real time. The company’s observability software — essentially tools that let engineers monitor software systems and detect security threats — could benefit from Deductive’s tech. According to the source, integrating Deductive’s AI technology into Elastic will enhance its observability platform by giving customers tools to automatically monitor performance and resolve system failures in real-time. Deductive was co-founded by Rakesh Kothari, who was previously VP of engineering at Lightspeed-backed business analytics startup ThoughtSpot, and Sameer Agarwal, who formerly worked at Apache Software Foundation and Meta. Agrawal was one of the founding engineers at Databricks. While Deductive reached roughly $1 million in annual recurring revenue (ARR,) according to the source, the startup’s growth lagged behind Resolve AI, one of the sectors’ perceived early winners. The two-year-old Resolve was co-founded by former Splunk executive Spiros Xanthos and Mayank Agarwal. Greylock and Lightspeed-backed startup was last valued at$1.5 billionwhen it raised a $40 million Series A extension in April.

6 hours ago

View

Almost half of U.S. singles feel negatively about AI in dating, Match says

Almost half of U.S. singles feel negatively about AI in dating, Match says

Dating app giant Match Group — which owns apps like Tinder, Hinge, and OkCupid — conducted astudyto determine how U.S. singles really feel about the relationship between AI and dating. Turns out, people don’t want AI messing with every aspect of human life. Across the industry, dating apps are experimenting with AI. Bumble introduced adating assistant named Bee, and Tinder isspendingso much on AI tools that it’s slowed its hiring process. Meanwhile, Hinge’s CEOstepped downlast year to launch a more AI-focused dating app altogether. But according to Match’s survey of 1,000 people aged 18 to 39, 47% of singles have a negative view of AI’s use in romantic contexts. This perspective varies depending on what the AI is being used for. About 40% of singles say they would refuse to date someone who uses an AI companion app, and that figure rises to 51% among women ages 18 to 24. However, only 12% of 18- to 24-year-olds said that they had used a companion app over the last three months, and only about a third of those users said they were seeking genuine connections with those chatbots. While Match says that people harbor a “near-universal” disapproval of actually dating an AI, like in the movie “Her,” that doesn’t mean that respondents are wholly opposed to AI features within apps. Some 64% of respondents said they could see how AI might help them in their dating journey. If we’re being pedantic,technically, every major dating app has already used some form of matching algorithm since before we knew what a GPT was. This survey refers to the new crop of AI features that basically every app is introducing, which help users punch up their profiles, choose photos, and keep conversations flowing. What dating app developers should take away from this survey is that people are not entirely closed off to AI; they just don’t want to be in a relationship with a robot, nor do they want to feel as though their dating experiences are overly inundated with technology that feels inauthentic. “Ask singles what they want from AI in dating, and the answer is pretty consistent: help with the hard parts, but hands off for the human parts,” Match wrote in a blog post. “Yes, they’ll use it to help them punch up a profile or for help figuring out what to say when a conversation goes quiet, but the actual connection is still theirs to create.” Hopefully, this message reaches dating entrepreneurs like Bumble founder Whitney Wolfe Herd, who suggested that dating app users could havepersonal bots that date other users’ bots. It’s pretty normal nowadays to say you met your partner online, but “his bot asked my bot out, and our bots hit it off” will never be a socially acceptable meet-cute.

10 hours ago

View