OpenAI and the American AI semiconductor unicorn Cerebras have formally announced a landmark three-year strategic alliance. Under this agreement, OpenAI will deploy a staggering 750MW (megawatts) of Cerebras’ Wafer-Scale Systems, a move designed to construct the world’s most expansive high-speed AI inference platform. Sources familiar with the matter indicate that the transaction is valued in excess of $10 billion.
This deployment is slated to commence in 2026, progressing through several developmental phases until its anticipated completion by 2028. Cerebras’ architectural philosophy diverges sharply from traditional GPU designs; it is celebrated for its sheer magnitude, boasting the capacity to integrate 4 trillion transistors onto a singular, colossal silicon wafer. By consolidating immense computational power, memory, and bandwidth on a single chip, Cerebras effectively eliminates the bottlenecks that typically impede inference velocity in conventional hardware. Official claims suggest that in terms of large language model responsiveness, the Cerebras system outperforms GPU-based alternatives by a factor of fifteen—a metric of paramount importance to OpenAI. Rapid, real-time responses are essential for augmenting user productivity, increasing platform engagement, and facilitating high-value workloads.
The relationship underpinning this transaction is deeply rooted, as OpenAI CEO Sam Altman is a personal investor in Cerebras. Their collaboration has been gestating since 2017, when both nascent entities frequently engaged in research exchanges and explored potential synergies. Formal negotiations for this accord commenced last autumn, culminating in a signed Letter of Intent prior to Thanksgiving.
Cerebras CEO Andrew Feldman noted that the primary catalyst for this market shift is the extraordinary demand for “rapid computation.” Buoyed by this monumental contract, Cerebras is currently negotiating a $1 billion financing round, which could see its market valuation surge to $220 billion—nearly triple its previous assessment. Although the company withdrew its 2024 initial public offering application, it remains committed to pursuing a listing on the public markets. For OpenAI, this alliance represents a critical element of its broader strategy to diversify its computational supply chain. In its quest for more cost-effective and efficient alternatives to NVIDIA, OpenAI has previously disclosed a partnership with Broadcom for custom silicon development and an agreement to utilize AMD’s MI450 chips to accelerate its computational infrastructure.
Related Posts:
Support Our Threat Intelligence
If you find our CVE report and cybersecurity news helpful, consider supporting our work.