The Oracle and NVIDIA partnership lets you develop, customize, and deploy AI with leading performance, scale, and cost efficiency. Build and deliver AI through the broadest set of deployment options that only Oracle Cloud Infrastructure (OCI) and NVIDIA together can provide.
We’re excited to announce the general availability of Oracle Cloud Infrastructure Supercluster with NVIDIA H200 Tensor Core GPUs.
OCI Superclusters, based on NVIDIA's full stack of accelerated computing hardware and software, deliver a powerful platform for training and deploying AI workloads and accelerating data processing. OCI Superclusters offer OCI Compute Bare Metal, an ultralow-latency RDMA over Converged Ethernet cluster based on NVIDIA GPUs and networking, and a choice of storage architectures. Accelerate GenAI workloads on OCI Superclusters of more than 64,000 GPUs with NVIDIA software such as NVIDIA NeMo.
Together, Oracle and NVIDIA offer a complete full-stack platform for running AI workloads anywhere you need them. NVIDIA-powered OCI Superclusters let you train, customize, and deploy state-of-the-art AI models quickly and cost-effectively with NVIDIA software that’s available via the OCI Marketplace through the NVIDIA AI Enterprise private offer.
Optimize inferencing and run AI anywhere with NVIDIA technologies on OCI’s distributed cloud. Use three NVIDIA L4 GPUs in a single edge appliance or scale to an OCI Supercluster in your data center—or to the largest publicly available supercluster in the world, with 65,536 NVIDIA GPUs.
Governments and customers in highly regulated industries can choose from a rich set of deployment models to help satisfy stringent data sovereignty and compliance requirements. NVIDIA AI Enterprise on OCI Supercluster is built from the ground up to help meet security and sovereign AI requirements while providing the best of NVIDIA software and Oracle Cloud AI infrastructure in one solution.
Oracle has collaborated with Cohere to power Oracle Cloud Infrastructure’s generative AI services. Leveraging the performance of OCI to train their models, Cohere is working with Oracle to bring enterprise AI technology to businesses.
MosaicML is a software development provider that offers infrastructure and tools for building large-scale machine learning models to help enterprises extract more value from their data. With OCI’s high-performance AI infrastructure, MosaicML has seen up to 50% faster performance and cost savings of up to 80% compared to other cloud providers.
Modal Labs lets you run data/AI jobs in the cloud by just writing a few lines of Python. Customers use Modal to deploy GenAI models at large scale, fine-tune LLM models, run protein folding simulations, and much more. Modal Labs uses Oracle's bare metal A10 instances because of the unbeatable combination of price and performance.
Headquartered in San Francisco, California, Evidium is a health technology startup that has created a referenced AI platform to give healthcare organizations grounded and trustworthy AI. To power its model training, the company leverages GPUs on OCI for its diverse product line.
Founded by leaders from PyTorch and Meta, Fireworks AI offers the fastest and highest quality platform to serve generative AI models aimed at accelerating product innovation and disruption. The company selected Oracle Cloud Infrastructure to run inferencing and training workloads.
Yurts is a generative AI integration platform on OCI that’s trusted by the world’s most secure organizations. The company offers high-quality attributed outputs, faster time to value, and seamless integration with source-of-truth applications.
Oracle and NVIDIA are expanding access to accelerated AI computing in the cloud so organizations can solve their most complex business challenges.