by Andy Patrizio

AWS and Nvidia partner on Project Ceiba, a GPU-powered AI supercomputer

News

Nov 30, 20233 mins

CPUs and ProcessorsGenerative AISupercomputers

The companies are extending their AI partnership, and one key initiative is a supercomputer that will be integrated with AWS services and used by Nvidia’s own R&D teams.

shutterstock 324149159 cloud computing building blocks abstract sky with polygons and cumulus clouds

Credit: Shutterstock

Amazon Web Services and Nvidia have announced an expansion of their alliance that includes plans to add supercomputing capabilities to AWS’s artificial intelligence (AI) infrastructure. The companies announced the news at the AWS re:Invent conference in Las Vegas.

The biggest of the initiatives is Project Ceiba, a supercomputer that will be hosted by AWS for Nvidia’s own research and development teams. It will feature 16,384 Nvidia GH200 Superchips and be capable of processing 65 exaflops of AI, the companies said. The Project Ceiba supercomputer will be integrated with a number of AWS services, including Amazon Virtual Private Cloud (VPC) encrypted networking and Amazon Elastic Block Store high-performance block storage.

Nvidia plans to use the supercomputer for research and development to advance AI for LLMs, graphics and simulation, digital biology, robotics, self-driving cars, Earth-2 climate prediction and more.

New Amazon EC2 G6e instances featuring Nvidia L40S GPUs and G6 instances powered by L4 GPUs are also in the works, AWS announced. L4 GPUs are scaled back from the Hopper H100 but offer much more power efficiency. These new instances are aimed at startups, enterprises, and researchers looking to experiment with AI.

Nvidia also shared plans to integrate its NeMo Retriever microservice into AWS to help users with their development of generative AI tools like chatbots. NeMo Retriever is a generative AI microservice that enables enterprises to connect custom LLMs to enterprise data, so the company can generate proper AI responses based on their own data.

“Generative AI is transforming cloud workloads and putting accelerated computing at the foundation of diverse content generation,” said Jensen Huang, founder and CEO of Nvidia, in a statement. “Driven by a common mission to deliver cost-effective, state-of-the-art generative AI to every customer, Nvidia and AWS are collaborating across the entire computing stack, spanning AI infrastructure, acceleration libraries, foundation models, and generative AI services.”

In other news, AWS will be the first cloud provider to bring Nvidia’s GH200 Grace Hopper Superchips to the cloud. The Nvidia GH200 NVL32 multi-node platform connects 32 Grace Hopper Superchips by Nvidia’s NVLink and NVSwitch interconnects. The platform will be available on Amazon Elastic Compute Cloud (EC2) instances connected with Amazon’s networking, virtualization (AWS Nitro System), and hyper-scale clustering (Amazon EC2 UltraClusters).

AWS will host Nvidia’s DGX Cloud cluster of GPUs for AI. DGX Cloud on AWS will accelerate training of generative AI and LLMs that can reach beyond 1 trillion parameters.

by Andy Patrizio

Andy Patrizio is a freelance journalist based in southern California who has covered the computer industry for 20 years and has built every x86 PC he’s ever owned, laptops not included.

The opinions expressed in this blog are those of the author and do not necessarily represent those of ITworld, Network World, its parent, subsidiary or affiliated companies.

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

By Howard Solomon

Feb 14, 20253 mins

FirewallsVulnerabilitiesZero-day vulnerability

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

By Zeus Kerravala

Feb 14, 20256 mins

Networking

Americas

Topics

About

Policies

Our Network

More

AWS and Nvidia partner on Project Ceiba, a GPU-powered AI supercomputer

The companies are extending their AI partnership, and one key initiative is a supercomputer that will be integrated with AWS services and used by Nvidia’s own R&D teams.

More from this author

Nvidia partners with cybersecurity vendors for real-time monitoring

FPGAs lose luster in genAI era

Nvidia claims near 50% boost in AI storage speed

Taiwan chip tariff would raise industry costs, analysts say

Verizon brings AI suite to enterprise infrastructure customers

More questions than answers around Trump’s Stargate AI plans

What Intel needs to do to get its mojo back

Oracle updates Exadata systems to speed database operations

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

Juniper CEO: 'I am disappointed and somewhat puzzled' by DOJ merger rejection

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the lsblk command

How to use the fdisk command

How to use the du command

AWS and Nvidia partner on Project Ceiba, a GPU-powered AI supercomputer

The companies are extending their AI partnership, and one key initiative is a supercomputer that will be integrated with AWS services and used by Nvidia’s own R&D teams.

From our editors straight to your inbox

More from this author

Nvidia partners with cybersecurity vendors for real-time monitoring

FPGAs lose luster in genAI era

Nvidia claims near 50% boost in AI storage speed

Taiwan chip tariff would raise industry costs, analysts say

Verizon brings AI suite to enterprise infrastructure customers

More questions than answers around Trump’s Stargate AI plans

What Intel needs to do to get its mojo back

Oracle updates Exadata systems to speed database operations

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

Juniper CEO: 'I am disappointed and somewhat puzzled' by DOJ merger rejection

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the lsblk command

How to use the fdisk command

How to use the du command