by Andy Patrizio

HPE to ship a dedicated inference server for the edge

News Analysis

Aug 04, 20223 mins

The small form factor HPE Edgeline EL8000 is designed for AI tasks such as computer vision and natural-language processing.

Later this month, HP Enterprise will ship what looks to be the first server aimed specifically at AI inferencing for machine learning.

Machine learning is a two-part process, training and inferencing. Training is usign powerful GPUs from Nvidia and AMD or other high-performance chips to “teach” the AI system what to look for, such as image recognition.

Inference answers if the subject is a match for trained models. A GPU is overkill for that task, and a much lower power processor can be used.

Enter Qualcomm’s Cloud AI100 chip, which is designed for artificial intelligence on the edge. It has up to 16 “AI cores” and supports FP16, INT8, INT16, FP32 data formats, all of which are used in inferencing. These are not custom Arm processors, they are entirely new SoCs designed for inferencing.

The AI100 is a part of the HPE Edgeline EL8000 edge gateway system that integrates compute, storage, and management in a single edge device. Inference workloads are often larger in scale and often require low-latency and high-throughput to enable real-time results.

The HPE Edgeline EL8000 is a 5U system that supports up to four independent server blades clustered using dual-redundant chassis-integrated switches. Its little brother, the HPE Edgeline EL8000t is a 2U design supports two independent server blades.

In addition to performance, Cloud AI100 has a low power draw. It comes in two form factors, a PCI Express card and dual M.2 chips mounted on the motherboard. The PCIe card has a 75 watt power envelope while the two M.2 form factor units draw either 15 watts or 25 watts. A typical CPU is draws more than 200 watts, and a GPU over 400 watts.

Qualcomm says Cloud AI 100 supports all key industry-standard model formats including ONNX, TensorFlow, PyTorch, and Caffe that can be imported and prepared from pre-trained models that can be compiled and optimized for deployment. Qualcomm has a set of tools for model porting and preparation including support for custom operations.

Qualcomm says the Cloud AI100 is targeting manufacturing/industrial customers, as well as those with edge AI requirements. Use cases for AI inference computing at the edge include computer vision and natural language processing (NLP) workloads.

For computer vision, this could include quality control and quality assurance in manufacturing, object detection and video surveillance, and loss prevention and detection. For NLP it ncludes programming-code generation, smart assistant operations, and language translation.

Edgeline servers will be available for purchase or lease through HPE GreenLake later this month.

by Andy Patrizio

Andy Patrizio is a freelance journalist based in southern California who has covered the computer industry for 20 years and has built every x86 PC he’s ever owned, laptops not included.

The opinions expressed in this blog are those of the author and do not necessarily represent those of ITworld, Network World, its parent, subsidiary or affiliated companies.

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

By Howard Solomon

Feb 14, 20253 mins

FirewallsVulnerabilitiesZero-day vulnerability

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

By Zeus Kerravala

Feb 14, 20256 mins

Networking

Americas

Topics

About

Policies

Our Network

More

HPE to ship a dedicated inference server for the edge

The small form factor HPE Edgeline EL8000 is designed for AI tasks such as computer vision and natural-language processing.

More from this author

Nvidia partners with cybersecurity vendors for real-time monitoring

FPGAs lose luster in genAI era

Nvidia claims near 50% boost in AI storage speed

Taiwan chip tariff would raise industry costs, analysts say

Verizon brings AI suite to enterprise infrastructure customers

More questions than answers around Trump’s Stargate AI plans

What Intel needs to do to get its mojo back

Oracle updates Exadata systems to speed database operations

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

Juniper CEO: 'I am disappointed and somewhat puzzled' by DOJ merger rejection

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the lsblk command

How to use the fdisk command

How to use the du command

HPE to ship a dedicated inference server for the edge

The small form factor HPE Edgeline EL8000 is designed for AI tasks such as computer vision and natural-language processing.

From our editors straight to your inbox

More from this author

Nvidia partners with cybersecurity vendors for real-time monitoring

FPGAs lose luster in genAI era

Nvidia claims near 50% boost in AI storage speed

Taiwan chip tariff would raise industry costs, analysts say

Verizon brings AI suite to enterprise infrastructure customers

More questions than answers around Trump’s Stargate AI plans

What Intel needs to do to get its mojo back

Oracle updates Exadata systems to speed database operations

Show me more

Palo Alto Networks firewall bug being exploited by threat actors: Report

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

Juniper CEO: 'I am disappointed and somewhat puzzled' by DOJ merger rejection

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the lsblk command

How to use the fdisk command

How to use the du command