OpenAI and Broadcom unveiled Jalapeño, OpenAI’s first custom inference accelerator for large language models. The chip is less about replacing Nvidia overnight than controlling the cost, latency, and supply of the compute that runs products like ChatGPT, Codex, and the API.