This document describes the reference design for Cloudera Data Platform software on ThinkSystem servers. It provides architecture guidance for designing optimized hardware infrastructure for the Cloudera Data Platform Private Cloud edition, a distribution of Apache Hadoop and Apache Spark with enterprise-ready capabilities from Cloudera. This reference design provides the planning, design considerations, and best practices for implementing Cloudera Data Platform with Lenovo products. It also includes considerations for GPU-acceleration of Apache Spark 3.0 on ThinkSystem servers.
The intended audience for this reference design is IT professionals, technical architects, sales engineers, and consultants assisting in planning, designing, and implementing the big data solution with Lenovo hardware. It is assumed that you are familiar with Cloudera Data Platform components and capabilities.
Table of Contents
- Business problem and business value
- Architectural Overview
- Component Model
- Operational Model
To view the document, click the Download PDF button.
Changes in the September 29, 2021 update:
- Updated Apache Spark 3.0 with GPU acceleration
Related product families
Product families related to this document are the following: