OpenAI Unveils Jalapeño, Its First Custom AI Inference Chip Built With Broadcom

OpenAI has revealed its first custom-built AI inference processor, developed in collaboration with semiconductor giant Broadcom. Named Jalapeño, the chip was designed specifically to handle OpenAI’s inference workloads — the process of running pre-trained AI models in response to live user requests — and was developed with assistance from OpenAI’s own AI models.

While still in testing, early results indicate materially stronger performance-per-watt compared to current alternatives. The chip’s announcement highlighted its efficiency when running real-time coding models, pointing to cost reduction as a primary objective.

The move positions OpenAI alongside Google and Amazon, both of which have developed proprietary AI accelerators to reduce dependence on Nvidia’s GPUs. President Greg Brockman had previously outlined the company’s rationale, describing a focus on identifying specific workloads that existing hardware underserves and building silicon capable of accelerating what those workloads demand.

Credit: OpenAI

OpenAI framed Jalapeño as part of a broader strategy to own the full AI infrastructure stack — spanning chip architecture, memory systems, networking, scheduling, and deployment — so that every layer can be optimised around a single goal: making its models faster, more reliable, and cheaper to run.

More compute-intensive tasks such as pre-training are expected to continue relying on Nvidia hardware for the foreseeable future.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

the google logo is displayed in front of a black background
Google Loses More Top AI Researchers to Anthropic and OpenAI

Google is facing a significant brain drain as leading AI researchers depart for rivals. Jonas Adler and Alexander Pritzel, both key contributors to the development

Enterprises Hit the Brakes on AI Spending as Token Costs Spiral

Major companies are pulling back on employee AI usage after discovering how rapidly unchecked consumption of AI tools can drain budgets with little demonstrable return.

a close up of a one dollar bill
Convey Announces $38M Series A Led by Andreessen Horowitz to Automate Enterprise Operations with AI Teammates

Insider Brief PRESS RELEASE — Convey, the enterprise AI platform that enables non-technical operators to build and manage digital teammates that execute business operations autonomously,

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

$ 0 M

Seed round tracked

Gitar — Code Validation

Get the Weekly Briefing

Funding analysis, market intelligence, and industry trends delivered to your inbox every week.

Need bespoke intelligence?

Our team combines real-time data with decades of sector experience to guide your decisions.

Subscribe today for the latest news about the AI landscape