Cloud GPU provider sees AMD chips now as a faster alternative to Nvidia chips tomorrow. Credit: Shutterstock In the competitive realm of artificial intelligence, dominated by Nvidia’s formidable GPUs, a bold newcomer is betting on a different contender. Nscale, a little-known GPU cloud provider based in Norway, has launched a new AI Cloud Services platform built around AMD’s MI300X accelerators. The move is a bold gambit to challenge Nvidia’s dominance and offer customers a potentially cheaper and greener alternative for running AI workloads in the cloud. For buyers of cloud services and builders of AI models, Nscale’s embrace of AMD could signal a small shift in the competitive landscape. “AMD may offer a cost advantage over Nvidia,” said University of Pennsylvania engineering professor Benjamin Lee. “Data center operators and their clients optimize the total cost of ownership, which includes both hardware purchases costs and operating energy costs, and AMD’s hardware costs may be lower than Nvidia’s. AMD may offer competitive or comparable performance for inference, the computation required when a trained model responds to a user prompt.” Nscale’s vertical integration, spanning from its modular data centers to its high-performance compute clusters and its use of natural cooling solutions and low-cost renewable power, could potentially result in customer cost savings. However, the company must provide evidence of significant cost savings and performance advantages to attract buyers away from established providers. Challenges and Opportunities in the AI Infrastructure Market For AI model builders and cloud service buyers, Nscale’s offering represents a new option in the market. However, observers say the AMD MI300X accelerators and ROCm open software ecosystem have yet to establish themselves as a widely adopted alternative to Nvidia’s offerings regarding performance, ease of use, and developer support. Lee also discussed the implications for various stakeholders in the AI industry. “Developers must understand more precisely the nature of the AI computation. General data processing, training, and inference are different types of computation that stress different types of hardware in a data center server. Increasingly, developers might use one type of server for model training and another for inference,” he said. “In addition to AMD’s offering, AI developers will increasingly see data centers deploy other custom chips for inference from Intel, Microsoft, Google, Meta, and others.” When asked how this move differs from other AI-focused computing infrastructures provided by hyperscalers like Azure, AWS, or GCP, Lee pointed to AMD’s long-standing efforts in creating and popularizing the ROCm software ecosystem. “Whether AMD’s chips will gain traction depends on whether ROCm provides sufficient support for inference computations compared to hyperscaler alternatives,” he noted. Olivier Blanchard, Research Director from The Futurum Group, suggested several factors that may have influenced Nscale’s decision to work with AMD. “Nscale already has a good working relationship with AMD and decided to strengthen it by choosing their GPUs over NVIDIA’s,” he explained. Additionally, Blanchard pointed out that there might be a cost-benefit, as “NVIDIA GPUs tend to price high.” Supply chain considerations could also have played a role in Nscale’s choice. “It could also be a supply chain decision: Nvidia GPUs can come with six-month lead times, and high demand could create supply bottlenecks. It is possible that AMD can provide better lead times and a lower risk of supply chain disruptions,” Blanchard noted. Related content how-to Compressing files using the zip command on Linux The zip command lets you compress files to preserve them or back them up, and you can require a password to extract the contents of a zip file. By Sandra Henry-Stocker May 13, 2024 4 mins Linux news High-bandwidth memory nearly sold out until 2026 While it might be tempting to blame Nvidia for the shortage of HBM, it’s not alone in driving high-performance computing and demand for the memory HPC requires. By Andy Patrizio May 13, 2024 3 mins CPUs and Processors High-Performance Computing Data Center opinion NSA, FBI warn of email spoofing threat Email spoofing is acknowledged by experts as a very credible threat. By Sandra Henry-Stocker May 13, 2024 3 mins Linux how-to Download our SASE and SSE enterprise buyer’s guide From the editors of Network World, this enterprise buyer’s guide helps network and security IT staff understand what Secure Access Service Edge (SASE) and Secure Service Edge) SSE can do for their organizations and how to choose the right solut By Neal Weinberg May 13, 2024 1 min SASE Remote Access Security Network Security PODCASTS VIDEOS RESOURCES EVENTS NEWSLETTERS Newsletter Promo Module Test Description for newsletter promo module. Please enter a valid email address Subscribe