Lenovo Big Data Reference Design for Cloudera Data Platform on ThinkSystem ServersReference Architecture

Updated
29 Sep 2021
Form Number
LP1458
PDF size
35 pages, 1.4 MB

Abstract

This document describes the reference design for Cloudera Data Platform software on ThinkSystem servers. It provides architecture guidance for designing optimized hardware infrastructure for the Cloudera Data Platform Private Cloud edition, a distribution of Apache Hadoop and Apache Spark with enterprise-ready capabilities from Cloudera. This reference design provides the planning, design considerations, and best practices for implementing Cloudera Data Platform with Lenovo products. It also includes considerations for GPU-acceleration of Apache Spark 3.0 on ThinkSystem servers.

The intended audience for this reference design is IT professionals, technical architects, sales engineers, and consultants assisting in planning, designing, and implementing the big data solution with Lenovo hardware. It is assumed that you are familiar with Cloudera Data Platform components and capabilities.

Table of Contents

  1. Introduction
  2. Business problem and business value
  3. Requirements
  4. Architectural Overview
  5. Component Model
  6. Operational Model
  7. Resources

To view the document, click the Download PDF button.

Change History

Changes in the September 29, 2021 update:

  • Updated Apache Spark 3.0 with GPU acceleration

Related product families

Product families related to this document are the following: