by Charlotte Trueman

Senior Writer

Nvidia unveils new GPU-based platform to fuel generative AI performance

News

Nov 14, 20234 mins

Generative AIHigh-Performance Computing

The new Nvidia HGX H200 has been designed to support the high performance computing workloads required to train generative AI models.

italy-data-center-man-woman-it-specialist-mainframe

Credit: Shutterstock

Nvidia has announced a new AI computing platform called Nvidia HGX H200, a turbocharged version of the company’s Nvidia Hopper architecture powered by its latest GPU offering, the Nvidia H200 Tensor Core.

The company also is teaming up with HPE to offer a supercomputing system, built on the Nvidia Grace Hopper GH200 Superchips, specifically designed for generative AI training.

A surge in enterprise interest in AI has fueled demand for Nvidia GPUs to handle generative AI and high-performance computing workloads. Its latest GPU, the Nvidia H200, is the first to offer HBM3e, high bandwidth memory that is 50% faster than current HBM3, allowing for the delivery of 141GB of memory at 4.8 terabytes per second, providing double the capacity and 2.4 times more bandwidth than its predecessor, the Nvidia A100.

Nvidia unveiled the first HBM3e processor, the GH200 Grace Hopper Superchip platform, in August “to meet [the] surging demand for generative AI,” founder and CEO of Nvidia, Jensen Huang, said at the time.

The introduction of the Nvidia H200 will lead to further performance leaps, the company said in a statement, adding that when compared to its H100 offering, the new architecture will nearly double the inference speed on Meta’s 70 billion-parameter LLM Llama-2. Parameters relate to how neural networks are configured.

“To create intelligence with generative AI and HPC applications, vast amounts of data must be efficiently processed at high speed using large, fast GPU memory,” said Ian Buck, vice president of hyperscale and HPC at Nvidia in a statement accompanying the announcement. “With Nvidia H200, the industry’s leading end-to-end AI supercomputing platform just got faster to solve some of the world’s most important challenges.”

H200-powered systems are expected to start shipping in the second quarter of 2024, with the Nvidia H200 Tensor Core GPU available in HGX H200 server boards with four- and eight-way configurations.

An eight-way HGX H200 provides over 32 petaflops of FP8 deep learning compute and 1.1TB of aggregate high-bandwidth memory for the highest performance in generative AI and HPC applications, Nvidia said.

A petaflop is a measure of performance for a computer that can calculate at least one thousand trillion, or one quadrillion, floating point operations per second. An FP8 is an eight-bit floating point format specification, designed to ease the sharing of deep learning networks between hardware platforms.

The H200 can be deployed in any type of data center, including on premises, cloud, hybrid-cloud and edge and will also be available in the GH200 Grace Hopper Superchip platform.

Nvidia powers new HPE AI training solution with GH200 Grace Hopper Superchips

Two weeks after it was revealed that the UK’s Isambard-AI supercomputer would be built with HPE’s Cray EX supercomputer technology and powered by Nvidia GH200 Grace Hopper Superchips, the two companies have once again teamed up to provide a new supercomputing turnkey system that supports the development of generative AI.

The new system comprises preconfigured and pretested AI and machine learning software, and also includes liquid-cooled supercomputers, accelerated compute, networking, storage, and services. Based on the same architecture as Isambard-AI, the solution will integrate with HPE Cray supercomputing technology and be powered by Nvidia Grace Hopper GH200 Superchips, allowing AI research centers and large enterprises to speed up the training of a model by 2-3 times.

“Together, this solution offers organizations the unprecedented scale and performance required for big AI workloads, such as large language model (LLM) and deep learning recommendation model (DLRM) training,” HPE in a press release.

The system will be generally available in December through HPE in more than 30 countries.

by Charlotte Trueman

Senior Writer

Follow Charlotte Trueman on X

Charlotte Trueman is a staff writer at Computerworld. She joined IDG in 2016 after graduating with a degree in English and American Literature from the University of Kent. Trueman covers collaboration, focusing on videoconferencing, productivity software, future of work and issues around diversity and inclusion in the tech sector.

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

By Howard Solomon

Feb 14, 20253 mins

FirewallsVulnerabilitiesZero-day vulnerability

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

By Zeus Kerravala

Feb 14, 20256 mins

Networking

Americas

Topics

About

Policies

Our Network

More

Nvidia unveils new GPU-based platform to fuel generative AI performance

The new Nvidia HGX H200 has been designed to support the high performance computing workloads required to train generative AI models.

Nvidia powers new HPE AI training solution with GH200 Grace Hopper Superchips

More from this author

China clears Broadcom’s $69B VMware acquisition, allowing deal to close

ASML boosts chip manufacturing capacity to meet global demand

Japanese government pledges further $13.3B to shore up domestic chip sector

UK to house three new supercomputers by 2025

Nokia to cut 14,000 jobs in an attempt to salvage falling profit

UK regulator launches antitrust probe into Microsoft and Amazon cloud services

UK gov’t announces supercomputer, research facility to drive R&D, innovation

UK competition agency provisionally OKs Broadcom’s $61B VMware acquisition

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

Juniper CEO: 'I am disappointed and somewhat puzzled' by DOJ merger rejection

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the lsblk command

How to use the fdisk command

How to use the du command

Nvidia unveils new GPU-based platform to fuel generative AI performance

The new Nvidia HGX H200 has been designed to support the high performance computing workloads required to train generative AI models.

Nvidia powers new HPE AI training solution with GH200 Grace Hopper Superchips

From our editors straight to your inbox

More from this author

China clears Broadcom’s $69B VMware acquisition, allowing deal to close

ASML boosts chip manufacturing capacity to meet global demand

Japanese government pledges further $13.3B to shore up domestic chip sector

UK to house three new supercomputers by 2025

Nokia to cut 14,000 jobs in an attempt to salvage falling profit

UK regulator launches antitrust probe into Microsoft and Amazon cloud services

UK gov’t announces supercomputer, research facility to drive R&D, innovation

UK competition agency provisionally OKs Broadcom’s $61B VMware acquisition

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

Juniper CEO: 'I am disappointed and somewhat puzzled' by DOJ merger rejection

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the lsblk command

How to use the fdisk command

How to use the du command