In a recent post on social media platform X, Coinbase CEO Brian Armstrong shared crucial insights into how the company plans to manage the rising costs associated with artificial intelligence while continuing to maximize token utilization. Highlighting five key strategies, Armstrong emphasized the importance of balancing innovation with efficiency as AI continues to transform operational practices.
Armstrong’s first strategy involves selecting superior default large language models (LLMs) that engineers use for their initial prompts. Notably, he indicated that Coinbase is testing Chinese LLMs, which are generally more affordable than those developed by prominent American AI firms such as Anthropic and OpenAI. He pointed out the organization’s move toward adopting models like GLM 5.2 and Kimi 2.7, created by Chinese companies Z.ai and Moonshot AI, respectively. This pivot aims to promote cost-effectiveness without compromising the quality of AI outputs.
The second strategy Armstrong discussed revisits a concept he introduced earlier in June: efficiently routing prompts to the most appropriate models based on their complexity. He elaborated that certain tasks may warrant the use of advanced models for initial planning phases, while simpler tasks could benefit from less sophisticated models to avoid unnecessary expenditure. Armstrong expressed optimism that AI technologies could automate the model selection process, thus relieving engineers from making these decisions manually.
In addition to optimizing model choices, Armstrong emphasized the need for better caching techniques to help reduce inference costs. His fourth recommendation encourages engineers to maintain a lean context by starting new sessions when changing tasks, thereby improving computational efficiency. Lastly, he outlined a strategy to enhance visibility into AI-related expenditures, encouraging engineers to use tokens freely while fostering awareness of their consumption patterns. With this approach, Coinbase seeks to drive more impactful contributions from employees who engage more with AI resources.
Accompanying his insights was a graph illustrating the trajectory of token usage and AI spending within the company. Although it did not specify a timeline, the graph revealed a significant surge in token usage, which has soared to historic highs, while associated AI spending has notably decreased, nearly halving from its peak. Armstrong clarified that the aim is not to stifle usage but to build an infrastructure that supports sustained exponential growth.
His remarks come on the heels of Coinbase’s recent decision to lay off 14% of its workforce, a move largely attributed to shifts in operational dynamics influenced by AI technologies. Reflecting on this evolution, Armstrong noted that in the past year, the efficiency gains achieved by engineers leveraging AI have transformed project timelines, enabling teams to accomplish in days what previously required weeks. This shift underscores a broader trend within the industry, moving away from the fleeting tokenmaxxing practices toward implementing strategic usage caps to better manage resources amid climbing operational demands.



