skip to main content

GPU Options for ThinkSystem Servers

Reference Information

Home
Top
Author
Updated
28 Sep 2023
Form Number
LP0767
PDF size
11 pages, 2.8 MB
Download PDF

* Only available in selected markets

Abstract

Learn more about GPU technology to accelerate different computing workloads, maximize performance for graphic design, virtualization, artificial intelligence and high performance computing applications in Lenovo servers.

Introduction

Lenovo ThinkSystem servers support GPU technology from NVIDIA and AMD to accelerate different computing workloads, maximize performance for graphic design, virtualization, artificial intelligence and high performance computing applications in Lenovo servers.

NVIDIA AI and Virtualization

SXM GPUs

The following SXM GPUs from NVIDIA are offered for ThinkSystem servers.

  • ThinkSystem NVIDIA H100 SXM5

    NVIDIA H100 SXM5 GPUThe ThinkSystem NVIDIA H100 PCIe Gen5 GPU delivers unprecedented performance, scalability, and security for every workload. The GPUs use breakthrough innovations in the NVIDIA Hopper™ architecture to deliver industry-leading conversational AI, speeding up large language models by 30X over the previous generation. The NVIDIA H100 is available in both double-wide PCIe adapter form factor and in SXM form factor. The H100 SXM5 GPU is used in Lenovo's Neptune direct-water-cooled ThinkSystem SD665-N V3 server for the ultimate in GPU performance and heat management.

    Learn more:

  • ThinkSystem NVIDIA A100 SXM

    NVIDIA A100 SXM GPUNVIDIA A100 Tensor Core GPUs delivers outstanding acceleration and flexibility to power the world’s highest-performing elastic data centers for AI, data analytics, and HPC applications. As the engine of the NVIDIA data center platform, A100 provides up to 20X higher performance over V100 GPUs and can efficiently scale up to thousands of GPUs, or be partitioned into seven isolated GPU instances to accelerate workloads of all sizes. NVIDIA A100 is available in both double-wide PCIe adapter form factor and in SXM form factor. The A100 SXM GPU is used in Lenovo's Neptune direct-water-cooled ThinkSystem SD650-N V2 server for the ultimate in GPU performance and heat management.

    Learn more:

NVIDIA dual slot adapters

The following dual-slot (double-wide) GPUs from NVIDIA are offered for ThinkSystem and ThinkAgile servers.

  • ThinkSystem NVIDIA H100 & H100 NVL GPUs

    NVIDIA H100 GPUThe ThinkSystem NVIDIA H100 PCIe Gen5 GPU delivers unprecedented performance, scalability, and security for every workload. The GPUs use breakthrough innovations in the NVIDIA Hopper™ architecture to deliver industry-leading conversational AI, speeding up large language models by 30X over the previous generation. The NVIDIA H100 is available in both double-wide PCIe adapter form factor and in SXM form factor. The NVIDIA H100 NVL Tensor Core GPU is optimized for Large Language Model (LLM) Inferences, with its high compute density, high memory bandwidth, high energy efficiency, and unique NVLink architecture.

    Learn more:

  • ThinkSystem NVIDIA H800 & H800 NVL GPUs

    NVIDIA H100 GPUThe ThinkSystem NVIDIA H800 PCIe Gen5 GPU delivers high performance, scalability, and security for every workload. It uses breakthrough innovations in the NVIDIA Hopper architecture to deliver industry-leading conversational AI.

    Note: The ThinkSystem NVIDIA H800 is only available in the China, Hong Kong and Macau markets.

    Learn more:

  • ThinkSystem NVIDIA L40S GPU

    NVIDIA L40S GPUThe ThinkSystem NVIDIA L40S 48GB PCIe Gen4 Passive GPU is a powerful universal GPU for the data center, delivering breakthrough multi-workload acceleration for Generative AI and large language model (LLM) inference and training, graphics, and video applications. AI models are exploding in complexity and popularity with the disruption led by large language models (LLMs) such as ChatGPT and generative AI diffusion models. L40S’s fourth-generation Tensor Cores with the Transformer Engine and new FP8 data format enable AI performance that exceeds the NVIDIA A100 Tensor Core GPUs for many AI training and inference workloads.

    Learn more:

  • ThinkSystem NVIDIA L40 GPU

    NVIDIA L40 GPUThe ThinkSystem NVIDIA L40 48GB PCIe Gen4 Passive GPU delivers unprecedented visual computing performance for the data center and provides revolutionary neural graphics, compute, and AI capabilities to accelerate the most demanding visual computing workloads. The NVIDIA L40, based on the NVIDIA Ada Lovelace GPU architecture features new generation RT cores and Tensor cores, delivering in combination over a petaflop of inferencing performance. These new features are combined with the latest generation CUDA Cores and 48GB of graphics memory to accelerate visual computing workloads from high-performance virtual workstation instances to large-scale digital twins in NVIDIA Omniverse.

    Learn more:

  • ThinkSystem NVIDIA A100 GPU

    NVIDIA A100 GPUThe NVIDIA A100 Tensor Core GPU delivers acceleration at every scale for AI, data analytics, and HPC to tackle the world’s toughest computing challenges. As the engine of the NVIDIA data center platform, A100 can efficiently scale up to thousands of GPUs or, using new Multi-Instance GPU (MIG) technology, can be partitioned into seven isolated GPU instances to accelerate workloads of all sizes. A100’s third-generation Tensor Core technology now accelerates more levels of precision for diverse workloads, speeding time to insight as well as time to market.

    Learn more:

  • ThinkSystem NVIDIA A800 GPU

    NVIDIA A800 GPUThe NVIDIA A800 Tensor Core GPU delivers outstanding acceleration and flexibility to power the highest-performing elastic data centers for AI, data analytics, and HPC applications. As the engine of the NVIDIA data center platform, A800 provide up to significantly higher performance over V100 GPUs and can efficiently scale up to thousands of GPUs, or be partitioned into seven isolated GPU instances to accelerate workloads of all sizes.

    Note: The ThinkSystem NVIDIA H800 is only available in the China, Hong Kong and Macau markets.

    Learn more:

  • ThinkSystem NVIDIA Tesla V100S GPU

    NVIDIA Tesla V100S GPU adapter is a dual-slot 10.5 inch PCIe 3.0 card with a single NVIDIA Volta GV100 graphics processing unit (GPU). The GPU supports double precision (FP64), single precision (FP32) and half precision (FP16) compute tasks, unified virtual memory and page migration engine. The V100S GPU offers improved performance over the V100, featuring a ~25% increase in memory bandwidth and higher FLOPS.

    Learn more:

  • ThinkSystem NVIDIA Tesla V100 GPU

    NVIDIA Tesla V100 GPU adapter is a dual-slot 10.5 inch PCIe 3.0 card with a single NVIDIA Volta GV100 graphics processing unit (GPU). The GPU supports double precision (FP64), single precision (FP32) and half precision (FP16) compute tasks, unified virtual memory and page migration engine. Available with either 16GB or 32GB of HBM2 high-bandwidth memory.

    Learn more:

  • ThinkSystem NVIDIA A30 GPU

    ThinkSystem NVIDIA A30 GPUThe NVIDIA A30 offers versatile compute acceleration for mainstream enterprise servers. With NVIDIA Ampere architecture Tensor Cores and Multi-Instance GPU (MIG), it delivers speedups securely across diverse workloads, including AI inference at scale and HPC applications. The A30 combines fast memory bandwidth and low-power consumption in a PCIe form factor to enable an elastic data center and delivers maximum value for enterprises.

    Learn more:

  • ThinkSystem NVIDIA A16 GPU

    ThinkSystem NVIDIA A16 GPUTake remote work to the next level with NVIDIA A16. Combined with NVIDIA Virtual PC (vPC) or NVIDIA RTX Virtual Workstation (vWS) software, the A16 enables virtual desktops and workstations with the power and performance to tackle any project from anywhere. Purpose-built for high-density, graphics-rich virtual desktop infrastructure (VDI) and leveraging the NVIDIA Ampere architecture, A16 provides double the user density versus the previous generation, while ensuring the best possible user experience.

    Learn more:

  • ThinkSytem NVIDIA Tesla P100 GPU

    High-performance computing GPU for HPC workloads and Deep Learning training workloads. P100 GPU accelerators are the most advanced ever built, features 16GB memory capacity, powered by the breakthrough NVIDIA Pascal architecture and designed to boost throughput to save money for HPC and Hyperscale data centers.

    Learn more:

  • ThinkSystem NVIDIA Tesla P40 GPU

    P40The NVIDIA Tesla P40 GPU accelerator is purpose-built to deliver maximum throughput for deep learning deployment. The P40 is powered by the revolutionary NVIDIA Pascal architecture provide the computational engine for the new era of artificial intelligence, enabling amazing user experiences by accelerating deep learning applications at scale.

    Learn more:

  • ThinkSytem NVIDIA Tesla M60 GPU

    High-performance for virtualization applications. M60 GPU accelerator, features 16GB memory capacity, works with NVIDIA GRID software to provide the industry’s highest user performance for virtualized workstations, desktops, and applications.

    Learn more:

  • ThinkSystem NVIDIA Tesla M10 GPU

    M10ThinkSystem NVIDIA Tesla M10 GPU accelerator works with NVIDIA GRID software to provide the industry’s highest user density for virtualized desktops and applications. It supports 64 desktops per board and 128 desktops per server, giving your business the power to deliver great experiences to all of your employees at an affordable cost.

    Learn more:

NVIDIA single-slot adapters

The following single-slot (single-wide) GPUs from NVIDIA are offered for ThinkSystem, ThinkEdge and ThinkAgile servers.

  • ThinkSystem NVIDIA L4 GPU

    NVIDIA L4 GPUThe ThinkSystem NVIDIA L4 24GB PCIe Gen4 Passive GPU delivers universal acceleration and energy efficiency for video, AI, virtual workstations, and graphics in the enterprise, in the cloud, and at the edge. With NVIDIA’s AI platform and full-stack approach, L4 is optimized for video and inference at scale for a broad range of AI applications to deliver the best in personalized experiences.

    Learn more:

  • ThinkSystem NVIDIA A10 GPU

    ThinkSystem NVIDIA A10 GPUThe NVIDIA A10 Tensor Core GPU, combined with NVIDIA RTX Virtual Workstation (vWS) software, brings mainstream graphics and video with AI services to mainstream enterprise servers, delivering the solutions that designers, engineers, artists, and scientists need to meet today’s challenges. Built on the latest NVIDIA Ampere architecture, the A10 combines second-generation RT Cores, third-generation Tensor Cores, and new streaming microprocessors with 24 GB of GDDR6 memory for versatile graphics, rendering, AI, and compute performance. From virtual workstations, accessible anywhere in the world, to render nodes to the data centers running a variety of workloads, A10 is built to deliver optimal performance in a single-wide, full-height, full-length PCIe form factor.

    Learn more:

  • ThinkSystem NVIDIA A2 GPU

    ThinkSystem NVIDIA A2 GPUThe NVIDIA A2 Tensor Core GPU provides entry-level inference with low power, a small footprint, and high performance for NVIDIA AI at the edge. Featuring a low-profile PCIe Gen4 card and a low 40-60W configurable thermal design power (TDP) capability, the A2 brings versatile inference acceleration to any server for deployment at scale.

    Learn more:

  • ThinkSystem NVIDIA Tesla V100 FHHL GPU

    ThinkSystem NVIDIA Tesla V100 FHHL GPUThe NVIDIA Tesla V100 FHHL GPU Accelerator is the latest NVIDIA Volta family product, a full-height half-length (FHHL) form factor, suitable for advanced data center functions to accelerate AI, HPC, and graphics. The Tesla V100 FHHL offers significant performance and great power efficiency.

    Learn more:

  • ThinkSystem NVIDIA Tesla T4 GPU

    ThinkSystem NVIDIA Tesla T4 GPUThe NVIDIA Tesla T4 GPU supports diverse cloud workloads, including high-performance computing, deep learning training and inference, machine learning, data analytics, and graphics. Based on the new NVIDIA Turing Architecture and packaged in an energy-efficient 70-watt, small PCIe form factor, Tesla T4 is optimized for scale-out computing environments with its multi-precision Turing Tensor Cores and new RT Cores.

    Learn more:

  • ThinkSystem NVIDIA Tesla P4 GPU

    NVIDIA Tesla P4 is a single-slot, low profile, PCIe 3.0 GPU Accelerator with an NVIDIA Pascal engine. The Tesla P4 has 8 GB GDDR5 memory and a 75 W maximum power limit and features optimized INT8 instructions aimed at deep learning inference computations. The GPU delivers up to 22 TOPs of inference performance, enabling smart responsive AI-based services.

    Learn more:

NVIDIA 3D Graphics

NVIDIA dual-slot graphics adapters

The following dual-slot (double-wide) GPUs from NVIDIA are offered for ThinkSystem and ThinkAgile servers.

  • ThinkSystem NVIDIA A40 GPU

    ThinkSystem NVIDIA A40 GPUThe NVIDIA A40 is a powerful data center GPU for visual computing, delivering high performance and capabilities to professionals for graphics-based workloads such as ray traced rendering, high-performance virtual workstations, simulation, 3D design, VR, and virtual production. The A40 GPU is a graphics-based virtualization solution for designers, engineers, scientists, and creatives that need this performance from anywhere in the world.

    Learn more:

  • ThinkSystem NVIDIA Quadro RTX 8000 GPU

    ThinkSystem NVIDIA Quadro RTX 8000 GPUBring the power of RTX to the data center with the NVIDIA Quadro RTX 8000, and Quadro Virtual Data Center Workstation (Quadro vDWS) software, built on the NVIDIA Turing architecture and the NVIDIA RTX platform for powerful server-based visual computing solutions. Accelerate multiple data center workloads including batch rendering, data science, virtual workstation, simulation, and augmented or virtual reality over 5G networks. Customers can also serve multiple powerful virtual workstations with Quadro vDWS software.

    Learn more:

  • ThinkSystem NVIDIA Quadro RTX A6000 GPU

    ThinkSystem NVIDIA Quadro RTX A6000 GPUUnlock the next generation of revolutionary designs, scientific breakthroughs, and immersive entertainment with the NVIDIA RTX A6000, the world's most powerful visual computing GPU. With cutting-edge performance and features, the RTX A6000 lets you work at the speed of inspiration—to tackle the urgent needs of today and meet the rapidly evolving, compute-intensive tasks of tomorrow.

    Learn more:

  • ThinkSystem NVIDIA Quadro RTX 6000 GPU

    ThinkSystem NVIDIA Quadro RTX 6000 GPUNVIDIA Quadro RTX 6000, powered by the NVIDIA Turing architecture and the NVIDIA RTX platform, brings the most significant advancement in computer graphics in over a decade to professional workflows. Designers and artists can now wield the power of hardware-accelerated ray tracing, deep learning, and advanced shading to dramatically boost productivity and create amazing content faster than ever before.

    Learn more:

  • ThinkSystem NVIDIA Quadro RTX 5000 GPU

    ThinkSystem NVIDIA Quadro RTX 5000 GPUShatter the boundaries of what’s possible with the NVIDIA Quadro RTX 5000, powered by NVIDIA Turing GPU to bring real-time ray tracing and accelerated AI to next-generation workflows. Creative and technical professionals can supercharge demanding design and visualization workloads and make more informed decisions faster than ever before.

    Learn more:

  • ThinkSystem NVIDIA Quadro P6000 GPU

    Most advanced professional graphics solution with unprecedented performance. P6000 is the world's most advanced professional graphics solution ever created, features 24GB memory capacity, bringing unprecedented power, performance, and capabilities to professional users.

    Learn more:

  • ThinkSystem NVIDIA RTX A4500 GPU

    ThinkSystem NVIDIA RTX A4500 GPUBased on the groundbreaking NVIDIA Ampere Architecture graphics processing unit (GPU), NVIDIA RTX A4500 delivers hardware-accelerated ray tracing, revolutionary AI features, advanced shading, and powerful simulation capabilities to creative professionals. With a graphics memory footprint of 20 GB of GDDR6 memory, the A4500 GPU enables the most graphics-intensive applications run with the highest level of user experience, even with largest of data sets.

    Learn more:

  • ThinkSystem NVIDIA RTX A2500 GPU

    ThinkSystem NVIDIA RTX A2500 GPUThe NVIDIA RTX A2000 brings the power of NVIDIA RTX technology, realtime ray tracing, AI-accelerated compute, and high-performance graphics to more professionals. Built on the NVIDIA Ampere architecture, the VR ready RTX A2000 combines 26 second-generation RT Cores, 104 third-generation Tensor Cores, and 3,328 next-generation CUDA cores and 6 or 12GB of GDDR6 graphics memory with error correction code (ECC) support for error free computing. The RTX A2000 GPU features a power-efficient low profile, dual-slot PCIe form factor, and the RTX A2000 12GB doubles memory for even larger models and datasets. Design bigger, render faster, and work smarter than ever before with RTX A2000 GPUs.

    Learn more:

NVIDIA single-slot graphics adapters

The following single-slot (single-wide) GPUs from NVIDIA are offered for ThinkSystem and ThinkAgile servers.

  • ThinkSystem NVIDIA Quadro RTX 4000 GPU

    ThinkSystem NVIDIA Quadro RTX 4000 GPUMeet the challenge of today’s demanding professional workflows with NVIDIA Quadro RTX 4000, powered by NVIDIA Turing architecture and the NVIDIA RTX platform. The NVIDIA Quadro RTX 4000 delivers GPU accelerated ray tracing, deep learning, and advanced shading in an accessible single slot form factor. It gives designers the power to accelerate their creative efforts with faster time to insight and faster time to solution.

    Learn more:

  • ThinkSystem NVIDIA Quadro RTX T1000 GPU

    ThinkSystem NVIDIA Quadro RTX T1000 GPUThe NVIDIA T1000, built on the NVIDIA Turing GPU architecture, is a powerful, low profile solution that delivers the full-size features, performance and capabilities required by demanding professional applications in a compact graphics card. Featuring 896 CUDA cores and 8GB of GDDR6 memory, the T1000 enables professionals to tackle multi-app workflows, from 3D modeling to video editing. Support for up to four 5K displays gives you the expansive visual workspace to view your work in stunning detail.

    Learn more:

  • ThinkSystem NVIDIA Quadro RTX T400 GPU

    ThinkSystem NVIDIA Quadro RTX T1000 GPUThe NVIDIA T400, built on the NVIDIA Turing GPU architecture, delivers amazing performance and capabilities to power a range of professional workflows. The RTX T400 GPU features 384 CUDA cores and 2GB of GDDR6 memory, and has native support for up to three 5K displays.

    Learn more:

  • ThinkSystem NVIDIA Quadro P4000 GPU

    NVIDIA Quadro P4000 combines a 1792 CUDA core Pascal GPU, large 8 GB GDDR5 memory and advanced display technologies to deliver the performance and features that are required by demanding professional applications.

    Learn more:

  • ThinkSytem NVIDIA Quadro P2200 GPU

    ThinkSytem NVIDIA Quadro P2200 GPUThe perfect balance of performance, compelling features, and compact form factor. Powered by NVIDIA Pascal with 1280 CUDA cores. GPU technology features 5 GB memory capacity. It also enables an expansive visual workspace with the ability to drive up to four 5K displays, combining outstanding performance and features in a compact form factor.

    Learn more:

  • ThinkSytem NVIDIA Quadro P2000 GPU

    Powered by NVIDIA Pascal with 1024 CUDA cores. GPU technology features 5 GB memory capacity. It also enables an expansive visual workspace with the ability to drive up to four 5K displays, combining outstanding performance and features in a compact form factor.

    Learn more:

  • ThinkSystem NVIDIA Quadro P620 GPU

    The NVIDIA Quadro P600 combines a 512 CUDA core Pascal GPU, 2 GB GDDR5 on-board memory and advanced display technologies to deliver amazing performance for a range of professional workflows. Suitable for professional CAD, DCC and visualization designers, engineers and users.

    Learn more:

  • ThinkSystem NVIDIA Quadro P600 GPU

    The NVIDIA Quadro P600 combines a 384 CUDA core Pascal GPU, large on-board memory and advanced display technologies to deliver amazing performance for a range of professional workflows. Offers 2 GB of ultra-fast GPU memory to enable the creation of complex 2D and 3D models.

    Learn more:

Intel AI and Virtualization

  • Intel Max Series 1550 GPU

    Intel Max Series 1550 GPUThe Intel Max Series 1550 GPUs are optimized for machine learning and high-performance computing applications while also containing media decode and encode engines to support certain media analytics use cases. These highly specialized GPUs are enabled with Intel’s OneAPI, an open cross-architecture programming model. Intel Max Series 1550 GPUs are used in Lenovo's Neptune direct-water-cooled ThinkSystem SD650-I V3 server for the ultimate in GPU performance and heat management.

    Learn more:

AMD AI and Virtualization

  • ThinkSystem AMD Instinct MI210 Accelerator

    AMD Instinct MI210 AcceleratorThe ThinkSystem AMD Instinct MI210 Accelerator is a compute workhorse optimized for accelerating single precision and double-precision HPC-class system. The accelerator can also be deployed for training large scale machine intelligence workloads. The accelerator's powerful compute engine, new matrix math FP64 cores and advanced memory architecture, combined with AMD’s ROCm open software platform and ecosystem, provides a powerful, flexible heterogeneous compute solution that is designed to help datacenter designers meet the challenges of a new era of compute.

    Learn more:

  • ThinkSystem AMD Radeon Pro V340 GPU

    ThinkSystem AMD Radeon Pro V340 GPUThe AMD Radeon Pro V340 datacenter graphics card delivers an impressively smooth GPU experience from the cloud to virtually any device, anywhere. With MxGPU technology at its core, this hardware-based virtualized graphics solution provides high levels of predictable performance, enhanced security and support for up to 32 VMs per card. This MxGPU solution is easy to set up and manage, and does not require end user licenses, providing enterprises with a lower cost per user.

    Learn more:

  • ThinkSystem AMD Radeon Instinct MI25 GPU

    The AMD Radeon Instinct MI25 accelerator provides a powerful, flexible heterogeneous compute solution that allows datacenter designers to meet the challenges of a new era of compute and Machine Intelligence. The GPU delivers 24.6 TFLOPS of FP16 and 12.3 TFLOPS of FP32 peak performance through its 64 compute units with 4,096 stream processors.

    Learn more:

Qualcomm AI

  • ThinkSystem Qualcomm Cloud AI 100 Accelerator

    ThinkSystem Qualcomm Cloud AI 100 AcceleratorThe Qualcomm Cloud AI 100 is designed for AI inference acceleration, and addresses the unique requirements in the cloud, including power efficiency, scale, process node advancements, and signal processing. The AI 100 enables data centers to run inference on the edge cloud faster and more efficiently. Qualcomm Cloud AI 100 is designed to be a leading solution for datacenters who increasingly rely on infrastructure at the edge-cloud. The ThinkSystem Qualcomm Cloud AI 100 accelerator is offered on ThinkEdge servers to enable customers to deploy AI workloads at the edge of their network. The AI 100 supports over 150 neural networks across multiple categories, including image classification, object detection, semantic segmentation, and natural language processing.

    Learn more:

Related product families

Product families related to this document are the following:

Trademarks

Lenovo and the Lenovo logo are trademarks or registered trademarks of Lenovo in the United States, other countries, or both. A current list of Lenovo trademarks is available on the Web at https://www.lenovo.com/us/en/legal/copytrade/.

The following terms are trademarks of Lenovo in the United States, other countries, or both:
Lenovo®
ThinkAgile®
ThinkEdge®
ThinkSystem®

The following terms are trademarks of other companies:

Intel® is a trademark of Intel Corporation or its subsidiaries.

Other company, product, or service names may be trademarks or service marks of others.