New AI supercomputer, the largest in the cloud, to deliver up to 131,072 NVIDIA GPUs to enable customers to build, train, and inference AI at scale
Oracle CloudWorld, Las Vegas—Sep 11, 2024Oracle today announced the first zettascale cloud computing clusters accelerated by the NVIDIA Blackwell platform. Oracle Cloud Infrastructure (OCI) is now taking orders for the largest AI supercomputer in the cloud—available with up to 131,072 NVIDIA Blackwell GPUs.
“We have one of the broadest AI infrastructure offerings and are supporting customers that are running some of the most demanding AI workloads in the cloud,” said Mahesh Thiagarajan, executive vice president, Oracle Cloud Infrastructure. “With Oracle’s distributed cloud, customers have the flexibility to deploy cloud and AI services wherever they choose while preserving the highest levels of data and AI sovereignty.”
OCI is now taking orders for the largest AI supercomputer in the cloud—available with up to 131,072 NVIDIA Blackwell GPUs—delivering an unprecedented 2.4 zettaFLOPS of peak performance. The maximum scale of OCI Supercluster offers more than three times as many GPUs as the Frontier supercomputer and more than six times that of other hyperscalers. OCI Supercluster includes OCI Compute Bare Metal, ultra-low latency RoCEv2 with ConnectX-7 NICs and ConnectX-8 SuperNICs or NVIDIA Quantum-2 InfiniBand-based networks, and a choice of HPC storage.
OCI Superclusters are orderable with OCI Compute powered by either NVIDIA H100 or H200 Tensor Core GPUs or NVIDIA Blackwell GPUs. OCI Superclusters with H100 GPUs can scale up to 16,384 GPUs with up to 65 ExaFLOPS of performance and 13Pb/s of aggregated network throughput. OCI Superclusters with H200 GPUs will scale to 65,536 GPUs with up to 260 ExaFLOPS of performance and 52Pb/s of aggregated network throughput and will be available later this year. OCI Superclusters with NVIDIA GB200 NVL72 liquid-cooled bare-metal instances will use NVLink and NVLink Switch to enable up to 72 Blackwell GPUs to communicate with each other at an aggregate bandwidth of 129.6 TB/s in a single NVLink domain. NVIDIA Blackwell GPUs, available in the first half of 2025, with fifth-generation NVLink, NVLink Switch, and cluster networking will enable seamless GPU-GPU communication in a single cluster.
“As businesses, researchers and nations race to innovate using AI, access to powerful computing clusters and AI software is critical,” said Ian Buck, vice president of Hyperscale and High Performance Computing, NVIDIA. “NVIDIA’s full-stack AI computing platform on Oracle’s broadly distributed cloud will deliver AI compute capabilities at unprecedented scale to advance AI efforts globally and help organizations everywhere accelerate research, development and deployment.”
Customers such as WideLabs and Zoom are leveraging OCI’s high-performing AI infrastructure with powerful security and sovereignty controls.
WideLabs, an applied AI startup in Brazil, is training one of Brazil’s largest LLMs, Amazonia IA, on OCI. They developed bAIgrapher, an application that uses its LLM to generate biographical content based on data collected from patients with Alzheimer’s disease to help them preserve important memories.
WideLabs uses the Oracle Cloud São Paulo Region to run its AI workloads, ensuring that sensitive data remains within country borders. This enables WideLabs to adhere to Brazilian AI sovereignty requirements by being able to control where its AI technology is deployed and operated. WideLabs uses OCI AI infrastructure with NVIDIA H100 GPUs to train its LLMs, as well as Oracle Kubernetes Engine to provision, manage, and operate GPU-accelerated containers across an OCI Supercluster consisting of OCI Compute connected with OCI’s RMDA-based cluster networking.
“OCI AI infrastructure offers us the most efficiency for training and running our LLMs,” said Nelson Leoni, CEO, WideLabs. “OCI’s scale and flexibility is invaluable as we continue to innovate in the healthcare space and other key sectors.”
Zoom, a leading AI-first collaboration platform, is using OCI to provide inference for Zoom AI Companion, the company’s AI personal assistant available at no additional cost. Zoom AI Companion helps users draft emails and chat messages, summarize meetings and chat threads, generate ideas during brainstorms with colleagues, and more. OCI’s data and AI sovereignty capabilities will help Zoom keep customer data locally in region and support AI sovereignty requirements in Saudi Arabia, where OCI’s solution is being rolled out initially.
“Zoom AI Companion is revolutionizing the way organizations work, with cutting-edge generative AI capabilities available at no additional cost with customers’ paid accounts,” said Bo Yan, head of AI, Zoom. “By harnessing OCI’s AI inference capabilities, Zoom is able to deliver accurate results at low latency, empowering users to collaborate seamlessly, communicate effortlessly, and boost productivity, efficiency, and potential like never before.”
Oracle offers integrated suites of applications plus secure, autonomous infrastructure in the Oracle Cloud. For more information about Oracle (NYSE: ORCL), please visit us at www.oracle.com.
CloudWorld is where our customers and partners can see the latest innovations in cloud technology, discover methods for getting the most business value from AI today, and explore ways to increase productivity and efficiency through automation. You’ll learn from experts and your peers who build and use the applications, cloud infrastructure, databases, developer tools, and AI services that help solve complex business challenges in every industry. Join us to develop new skills and see new capabilities in action. Register now at oracle.com/cloudworld or follow the news and conversation at oracle.com/news and linkedin.com/company/oracle.
The preceding is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation.
Statements in this article relating to Oracle’s future plans, expectations, beliefs, and intentions are “forward-looking statements” and are subject to material risks and uncertainties. Many factors could affect Oracle’s current expectations and actual results, and could cause actual results to differ materially. A discussion of such factors and other risks that affect Oracle’s business is contained in Oracle’s Securities and Exchange Commission (SEC) filings, including Oracle’s most recent reports on Form 10-K and Form 10-Q under the heading “Risk Factors.” These filings are available on the SEC’s website or on Oracle’s website at oracle.com/investor. All information in this article is current as of September 11, 2024 and Oracle undertakes no duty to update any statement in light of new information or future events.
Many of the products and features described herein remain in various stages and will be offered on a when-and-if-available basis. The statements above are not intended to be, and should not be interpreted as a commitment, promise, or legal obligation, and the development, release, and timing of any features or functionalities described for our products is subject to change and remains at the sole discretion of NVIDIA. NVIDIA will have no liability for failure to deliver or delay in the delivery of any of the products, features or functions set forth herein.
Oracle, Java, MySQL and NetSuite are registered trademarks of Oracle Corporation. NetSuite was the first cloud company—ushering in the new era of cloud computing.
Other company and product names may be trademarks of the respective companies with which they are associated. Features, pricing, availability and specifications are subject to change without notice.