AWS re:Invent: An Arm-based CPU for high-performance computing, a networking chip that doubles performance of its predecessor, and a TCP protocol replacement to speed up networks in the AWS cloud. Credit: CIS Amazon Web Services has introduced a new CPU customized for high-performance computing (HPC) and the next generation of its Nitro smart networking chip, plus instances that take full advantage of the hardware. The Arm-based CPU is called the Graviton3E and has been optimized for floating point math, key in HPC, the company announced at AWS re:Invent conference. Amazon said Hpc7g instances powered by the new Graviton3E chips offer up to double the floating point and vector performance compared to the current generation of instances. The vast datasets that accompany HPC need to be moved around, so Amazon also introduced the fifth generation of its Nitro smartNICs, offering up to twice the network bandwidth and up to 50% higher packet processing-per-second performance compared to current generation networking-optimized instances. Nitro v5 has about twice the transistors as the previous generation Nitro chip, which gives it about twice the computational power, said Peter DeSantis, senior vice president of AWS utility computing during his keynote. It also has 50% more memory bandwidth and a PCIe adapter that provides about twice the bandwidth and supports a 60% higher packet-per-second rate with a 30% reduction in packet latency. Accompanying the new chip is a new Elastic Compute Cloud (EC2) instance, the networking optimized C7G. The C7G uses the Graviton 3 processor and is designed to deliver optimized networking performance for the most network-intensive workloads. It offers 200Gbps. of throughput and up to 50% higher packet-processing performance over the previous network optimized instance. Replacing TCP To accelerate data movement for HPC workloads, AWS created the elastic fabric adapter (EFA) for scalable, high-speed inter-node communication. As part of EFA is another AWS creation, Scalable Reliable Datagram (SRD), an alternative to the widely used TCP Ethernet protocol. DeSantis said AWS internal networking relies on multipath routing, but creaky old TCP uses a single path and isn’t too good at noting if performance is compromised. TCP also transmits packets in order, which can cause latency. SRD makes use of multipath routing so doesn’t transmit packets to arrive order, but it reorders the packets at the receiving end. DeSantis said it will retransmit dropped packets “in microseconds, not milliseconds” and speed up networks hosted on the AWS cloud. And now all of AWS will gain use of SRD, because AWS has built a new version of the network driver offered with EC2 instances called ENA Express to offer native SRD support. So SRD is being rolled out site-wide. Related content news High-bandwidth memory nearly sold out until 2026 While it might be tempting to blame Nvidia for the shortage of HBM, it’s not alone in driving high-performance computing and demand for the memory HPC requires. By Andy Patrizio May 13, 2024 3 mins CPUs and Processors High-Performance Computing Data Center news CHIPS Act to fund $285 million for semiconductor digital twins Plans call for building an institute to develop digital twins for semiconductor manufacturing and share resources among chip developers. By Andy Patrizio May 10, 2024 3 mins CPUs and Processors Data Center news HPE launches storage system for HPC and AI clusters The HPE Cray Storage Systems C500 is tuned to avoid I/O bottlenecks and offers a lower entry price than Cray systems designed for top supercomputers. By Andy Patrizio May 07, 2024 3 mins Supercomputers Enterprise Storage Data Center news Lenovo ships all-AMD AI systems New systems are designed to support generative AI and on-prem Azure. By Andy Patrizio Apr 30, 2024 3 mins CPUs and Processors Data Center PODCASTS VIDEOS RESOURCES EVENTS NEWSLETTERS Newsletter Promo Module Test Description for newsletter promo module. Please enter a valid email address Subscribe