OpenAI has unveiled 'Jalapeño', a custom chip specialized for LLM inference, which is expected to accelerate the democratization of AI services by increasing cost efficiency by 50% compared to existing GPUs.
Imagine a world where the ChatGPT we use every day provides answers much faster, cheaper, and more intelligently. Until now, AI has relied on general-purpose Graphics Processing Units (GPUs)—the core components that handle computer graphics and data—to process massive amounts of data. It was similar to cooking every dish in the world using just one giant pot. However, OpenAI has decided to change this cooking method, and they are doing it through their own self-developed AI chip, ‘Jalapeño’. OpenAI unveils its first custom chip, built by Broadcom
OpenAI and semiconductor design firm Broadcom unveiled ‘Jalapeño’, their first jointly designed custom AI processor, on the 24th. OpenAI unveils its first custom chip, built by Broadcom This is an attempt not just to build a faster chip, but to fundamentally reshape how AI services operate. OpenAI and Broadcom unveil LLM-optimized inference chip
Why Is This Important?
For everyday users, the most tangible change will be the ‘cost-effectiveness of AI services’. Currently, the cost of running AI is astronomical. Industry estimates suggest that building a 1-gigawatt large-scale data center (a massive computer warehouse for AI operations) costs about $50 billion, with approximately $35 billion of that allocated just for purchasing chips. OpenAI and Broadcom announce first custom AI chip, in strike at nvidia
If the cost of operating the AI apps we use decreases, companies can provide services more cheaply, and AI will permeate deeper into every corner of our daily lives. Jalapeño is capable of reducing costs by 50% compared to existing general-purpose GPUs. OpenAI Unveils Jalapeño — Its First AI Chip, Built With Broadcom When costs drop, complex AI agent services that we currently only imagine will be able to reach us more easily. OpenAI Unveils Jalapeño, Its First Custom AI Chip Built With Broadcom
To put it simply, if a general-purpose GPU is an all-around driver capable of operating cars, motorcycles, trucks, and even boats, Jalapeño is a specialized high-speed train designed solely to carry the ‘freight of data’ most efficiently. As a result, AI will function much more economically.
Understanding It Better: Why a ‘Dedicated Chip’?
To understand Jalapeño, you must first know the difference between ‘general-purpose chips’ and ‘custom chips’.
A general-purpose GPU is like an ‘honor student’ who must be good at math, science, language, and art. They do everything well to some extent, but it’s hard to be completely optimized for any one specific task. Jalapeño, on the other hand, is an ‘expert’ that scores 100 on one specific subject: ‘LLM Inference (the process where a trained AI generates an answer to a question)’. OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI’s own models
In particular, OpenAI designed this chip from a ‘blank sheet of paper’. OpenAI Unveils Jalapeño, Its First Custom AI Chip Built With Broadcom An interesting fact is that OpenAI drastically shortened the development time by utilizing its own artificial intelligence models to design the chip. OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI’s own models An incredible virtuous cycle has begun, where AI designs the chips to make itself smarter.
Current Status
Jalapeño is not just a single chip being created. Broadcom and Celestica are collaborating to integrate this chip into actual data center server racks and network systems. OpenAI, Broadcom unveil first AI inference chip
This chip is slated to become the core engine powering ChatGPT, Codex (code-writing AI), the OpenAI API, and future AI agents yet to emerge. OpenAI Unveils Jalapeño, Its First Custom AI Chip Built With Broadcom OpenAI and Broadcom began collaborating on this chip about 18 months ago, and full deployment is expected to begin late next year. OpenAI and Broadcom announce first custom AI chip, in strike at nvidia
What Happens Next?
The arrival of Jalapeño shows that major AI companies are reducing their dependence on general-purpose hardware and strengthening ‘vertical integration’ (managing everything from semiconductor design to services directly).
What readers should keep an eye on is ‘how quickly this chip is applied to large-scale data centers’. If Jalapeño is deployed in earnest starting next year, the response speed of AI services will likely increase, and the cost burden we feel when using AI is highly likely to be much lower than it is now. The process of AI technology moving beyond a few high-end skills to become an affordable essential tool in our daily lives—that is the future that Jalapeño will bring.
References
- OpenAI and Broadcom unveil LLM-optimized inference chip
- OpenAI unveils its first custom chip, built by Broadcom
- OpenAI unveils first chip as part of Broadcom deal in effort
- OpenAI just announced its first custom chip to help ChatGPT
- OpenAI Unveils Jalapeño, Its First Custom AI Chip Built With
- OpenAI Unveils Jalapeño — Its First AI Chip, Built With
-
[OpenAI, Broadcom unveil first AI inference chip Constellation Research](https://www.constellationr.com/insights/news/openai-broadcom-unveil-first-ai-inference-chip) - OpenAI Reveals Its First AI Chip: Jalapeño - Gadget Review
-
[OpenAI unveils first custom AI inference chip, Jalapeño, with Broadcom — and its development was sped-up with OpenAI’s own models VentureBeat](https://venturebeat.com/infrastructure/openai-unveils-first-custom-ai-inference-chip-jalapeno-with-broadcom-and-its-development-was-sped-up-with-openais-own-models) - OpenAI unveils its first custom chip, built by Broadcom
- OpenAI and Broadcom announce first custom AI chip, in strike at nvidia
-
[OpenAI, Broadcom join forces on AI chips Cybernews](https://cybernews.com/ai-news/openai-broadcom-build-first-ai-processor-chip-deal/) - OpenAI partners with Broadcom custom AI chips alongside
- Accelerating general-purpose personal computers
- LLM (Large Language Model) inference
- Graphics processing for gaming
- 90% reduction in power consumption
- 50% cost savings compared to existing GPUs
- 10-year reduction in development time
- OpenAI operates its own factories
- OpenAI used its own existing models to accelerate development speed
- Reuse of Broadcom's existing chips