Authors
Updated
27 Mar 2024Form Number
LP1458PDF size
38 pages, 1.4 MBAbstract
This document describes the reference design for Cloudera Data Platform software on ThinkSystem servers. It provides architecture guidance for designing optimized hardware infrastructure for the Cloudera Data Platform Private Cloud edition, a distribution of Apache Hadoop and Apache Spark with enterprise-ready capabilities from Cloudera. This reference design provides the planning, design considerations, and best practices for implementing Cloudera Data Platform with Lenovo products. It also includes considerations for GPU-acceleration of Apache Spark 3.0 on ThinkSystem servers.
The intended audience for this reference design is IT professionals, technical architects, sales engineers, and consultants assisting in planning, designing, and implementing the big data solution with Lenovo hardware. It is assumed that you are familiar with Cloudera Data Platform components and capabilities.
Table of Contents
- Introduction
- Business problem and business value
- Requirements
- Architectural Overview
- Component Model
- Operational Model
- Customer Case
- Resources
To view the document, click the Download PDF button.
Change History
Changes in the March 27, 2024 update:
- Added Intel Emerald Rapids description to section 6.1.1
- Updated accelerators in section 6.1.3