The Full-Stack Sovereignty Doctrine: How OpenAI's Jalapeño Chip Just Cut Inference Costs 50%, And Why Every Business Now Has To Choose Which Layers To Own

Stephen Diaz

Published on: 26/06/2026

On June 24, 2026, OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom AI chip, purpose-built for LLM inference. Broadcom CEO Hock Tan says early lab samples deliver ~50% lower per-token inference cost than current Nvidia GPUs. Designed in 9 months (the fastest ASIC cycle ever) using OpenAI's own models for design assist. Manufactured by TSMC; system integration by Celestica; Tomahawk networking by Broadcom. OpenAI spent ~$14B serving ChatGPT in 2025 on third-party GPUs. Initial deployment end of 2026, full scale H1 2028. Microsoft taking ~40% of first batch. Backed by Broadcom-Apollo-Blackstone $35B AI XPV Platform for 20GW+ compute through 2028. Run The Full-Stack Sovereignty Doctrine 5-question audit on your own stack this week.

AI News & Industry Updates

The Compute Concentration Test: What Anthropic's $200B Google Deal Means for Your Business

Stephen Diaz

Published on: 07/05/2026

Anthropic just committed roughly $200 billion to Google Cloud over five years, and combined with OpenAI, two AI labs now own about half of $2 trillion in cloud backlog. Here is The Compute Concentration Test, a 3-question framework every business owner should run on their AI stack this week.

AI News & Industry Updates