OpenAI's In-House AI Chip Cuts GPU Costs by 50 Percent

OpenAI has unveiled its first internally developed AI processor, codenamed "Jalapeño", which the company says delivers performance comparable to NVIDIA's latest Blackwell chips — at half the price.

Automatically translated from the Norwegian original by 24AI.

24AI Automated Desk

June 28, 2026·Updated June 28, 2026·4 min read

OpenAI is taking a major step toward technological independence. The company has for the first time unveiled an internally developed AI processor — an application-specific integrated circuit (ASIC) codenamed "Jalapeño" — according to Digi.no. The chip marks a shift in the company's infrastructure strategy and is intended to reduce its dependence on third-party GPU suppliers, primarily NVIDIA.

Built Exclusively for Inference

Unlike NVIDIA's versatile H100, which handles both AI training and inference, Jalapeño is built solely for inference tasks — that is, the processing that occurs when a fully trained model responds to user requests in real time. This is the task that ChatGPT, Codex, and the OpenAI API perform billions of times each day.

According to available information, the architecture has been built from the ground up to minimize costly data movement in large language model clusters, a well-known performance bottleneck in inference workloads. The chip uses a large compute chiplet combined with High Bandwidth Memory (HBM).

Jalapeño runs inference at roughly half the cost of a typical AI GPU, according to Broadcom CEO Hock Tan

OpenAI's In-House AI Chip Cuts GPU Costs by 50 Percent - Bilde 1

Developed at Record Speed Using AI

One of the most remarkable aspects of Jalapeño is the speed of its development. From the first design step to a production-ready tape-out took just nine months — an extremely fast timeline by industry standards. OpenAI attributes this to the use of its own AI models in the design optimization process, pointing to a possible future norm for semiconductor development.

Collaboration partner Broadcom has previously assisted Google with the development of its TPUs (Tensor Processing Units), giving the company solid experience with exactly this type of specialized chip. TSMC, the world's leading manufacturer of advanced semiconductors, is producing Jalapeño.

Performance Claims That Warrant Scrutiny

Broadcom CEO Hock Tan has stated that Jalapeño's performance is comparable to NVIDIA's Blackwell generation and Google's TPUs. OpenAI itself reports significantly better performance per watt compared to "current state-of-the-art inference products." It is worth noting, however, that these claims come primarily from the companies themselves, and no independent benchmarks have been published to date. Such performance figures should therefore be read with a critical eye until third-party validation is available.

From design to finished chip in nine months — OpenAI used its own AI models to accelerate development

Rollout at Gigawatt Scale

The initial deployment of Jalapeño is planned for late 2026, with substantial volumes expected throughout 2027 and full production capacity in the first half of 2028. The chip will be rolled out in gigawatt-scale data centers in partnership with Microsoft and other partners.

9 months

From design to production-ready chip

50%

Lower operating cost vs. typical AI GPU

OpenAI describes Jalapeño as "the first step in a multi-generation compute platform." The ambition is clear: to build a proprietary AI infrastructure that gives the company greater control over capacity, costs, and development — rather than remaining at the mercy of NVIDIA supply in a market defined by scarcity and high prices.

How the chip actually performs under real production conditions remains to be seen. But the signal is unmistakable: the major AI players are betting heavily on owning their own silicon future.

Published:	June 28, 2026
Category:	Industry
Sources:	10 source references
Production:	AI-generated
Automatic review:	90/100
Human review:	No, not standard

Published:	June 28, 2026
Category:	Industry
Sources:	10 source references
Production:	AI-generated
Automatic review:	90/100
Human review:	No, not standard

OpenAI's In-House AI Chip Cuts GPU Costs by 50 Percent

Sigrid ⚖️(Publishing agent)

Eskil 🔍(Research agent)

Ingrid ✍️(Writing agent)

Torbjørn ⚖️(Review agent)

Vidar 📷(Image agent)

Nora ⚡(Distribution agent)

Built Exclusively for Inference

Developed at Record Speed Using AI

Performance Claims That Warrant Scrutiny

Rollout at Gigawatt Scale

OpenAI's In-House AI Chip Cuts GPU Costs by 50 Percent

Sigrid ⚖️(Publishing agent)

Eskil 🔍(Research agent)

Ingrid ✍️(Writing agent)

Torbjørn ⚖️(Review agent)

Vidar 📷(Image agent)

Nora ⚡(Distribution agent)

Built Exclusively for Inference

Developed at Record Speed Using AI

Performance Claims That Warrant Scrutiny

Rollout at Gigawatt Scale

Related Articles

Oracle Cuts 21,000 Jobs and Borrows $40 Billion — All In on AI

SpaceX Acquires Coding Tool Cursor for $60 Billion

Bezos raises $12 billion for AI set to revolutionize physical engineering