Usability in the Design of the ThinkSystem SR950Article

Published
16 Aug 2017
Form Number
LP0741
PDF size
12 pages, 1.1 MB

Abstract

In the data center, a server’s usability is very important to support installation, maintenance and upgrade activities. The Lenovo ThinkSystem SR950 provides a high level of usability in the system design. This article highlights the usability features in the SR950 such as, fan replacement, processor and memory access and XClarity management.

Introduction

Lenovo data center systems run many mission critical applications. In datacenters with hundreds to thousands of servers, system maintenance and uptime are critical to business. In order to help ease the burden of installing, servicing, and upgrading systems Lenovo strives to make these tasks as easy as possible by designing in high levels of usability into the systems. Usability that is designed into the server hardware and software can reduce time and cost associated with installing and maintaining systems, and can reduce the chance of errors occurring while working with the system.

Lenovo ThinkSystem SR950
Figure 1. Lenovo ThinkSystem SR950

The overall design of the Lenovo ThinkSystem SR950 is based on a modular service model where access is from the front and rear only. This means that nearly all parts can be removed from the front or rear of the system, even parts that are located in the center of the machine (e.g., fans, memory DIMMs, and processors). This design philosophy means the system does not require thick slide rails required to slide the system out for service actions, which maximizes the space inside the chassis for components. This provides more width for the system to increase system density to fit 8 processors in a 4U form factor.

Let’s look at several areas the ThinkSystem SR950 has designed ease of use into the system.

Identifying the SR950 Server

When you contact Lenovo for help, the machine type, model, and serial number information help support technicians to identify your server and provide faster service. The machine type, model number, and serial number, are provided in both text and 2D barcode format on the ID label on the front of the SR950 server.


Figure 2. 2D barcode on the front of the server

In addition, at the top rear of the Compute System Tray inside the server, we provide a quick reference (QR) code for mobile access to service information. You can scan the QR code with a mobile device using a QR code reader application and get quick access to the Service Information on the SR950 support web page that provides additional information for parts installation and replacement videos, and error codes for server support.

Service label with QR Code
Figure 3. Service label on the server

Fan replacement

The fan design is one of the most unique usability features in the system. Many rack systems have hot-swap fans that allow you to replace fans while the server continues to run. The SR950 process to replace the fans is much quicker and simpler than most servers. Typical rack servers require you to slide the server out from the rack, being careful not to pull out any of the IO cabling from the rear, and lift off the top cover to access the interior of the server. Once inside the server you can replace a fan being careful not to cut your fingers in the process.

The SR950 greatly simplifies the fan replacement process with a very novel design. There is no need to remove or pull-out the SR950 from the rack to replace a fan. Instead you just remove the front bezel and pull on the fan handle. The fan then pivots within its guiding tracks inside the system and then slides out the front of the server. To replace the fan just slide it in the same front slot until it stops and the latch engages, then replace the bezel. This design was key to allowing hot swap fans to be installed between the storage and compute sections of the system.


Figure 4. Fans are accessible from the front of the server

Accessing CPUs and memory

On typical rack servers, to access server CPU or memory, you are required to slide the server out from the rack, being careful not to pull out any of the I/O cabling from the rear, and lift off the top cover to access the interior of the server.

With the SR950, you rack the server once and access all components either from the front or rear of the server. To access the CPU or memory, you slide out either the Upper or Lower Compute Tray out the front of the server. At this point you can directly access the top Compute System board which contains 2 CPUs and up to 24 DIMMs. To access the other 2 CPUs and associated DIMMs, you slide out the top Compute Tray which then provides access the lower 2 CPUs and associated DIMMs.

Access to CPUs and memory
Figure 5. Access to CPUs and memory

Accessing rear I/O

As mentioned above to access interior components on typical rack servers you must first pull the server out of the rack and lift up the top cover. Then, access to the rear IO area typically requires extra reaching and awkward postures while accessing the rear IO area to remove or install adapters.

To access the rear IO (PCIe adapters, LOM, ML2) on the SR950 you pull the IO Tray out the rear of the server with the 2 latching cam handles. From this point all the PCIe adapters, LOM and ML2 cards are easily accessible to replace or add new adapters to empty slots.

Access to PCIe I/O slots
Figure 6. Access to PCIe I/O slots

Front storage

As with most rack systems the front storage devices (HDD, SSD, NVMe) are easily accessed individually from the front of the server by releasing the latch to unlock the drive handle, rotate the handle outward and pull the drive out of the server.

Each SR950 storage backplane supports 4 drives (2x2). You can have up to three backplanes in each Compute Tray (thus 6 backplanes or 24 drives in total). The SR950 supports storage backplanes that support SAS, SATA and NVMe drives in the same 2x2 backplane allowing the flexibility to choose and mix storage types within the same backplane.

Three of the six drive backplanes
Figure 7. Three of the six drive backplanes

Front Operator Panel & LCD

Most rack servers only have front LEDs that inform you of on/off power status, connection to network or not, and if some type of system error has occurred. You will need to connect a separate device to the server or access the management software online to pull detailed system information in order to know further details about the status of the server.

The SR950 contains not only the typical LEDs for power, network and errors, but also has a LCD panel that provides quick access to system status, firmware, network, and health information. The LCD allows you to scroll up and down to access different menu options.

LCD displays the following type of information:

  • System errors such as fan failure, power supply failure, PCI error, etc.
  • System VPD information (such as machine type and serial number)
  • System firmware levels such as UEFI and XCC code levels
  • LXCC network information such as MAC address and IP address
  • System environmental information such as temperature, voltage, power consumption

LCD pull-out operator panel
Figure 8. LCD pull-out display panel

The following image shows the LCD display panel menu options flow.

LCD display panel menu structure
Figure 9. LCD display panel menu structure

Light Path Diagnostics

The SR950 has a unique method to help guide the way to server issues. Light path diagnostics is a system of LEDs on various external and internal components of the server that leads you to the failed component. When an error occurs, LEDs are lit on the front operator panel on the front of the server, then on the failed component. By viewing the LEDs in a particular order, you can often identify the source of the error.

It begins with the Front Operator Panel and LCD that provides quick access to system status, firmware, network and health information (discussed above in further detail).

The SR950 has a unique Light Path Diagnostics on each Compute System Board. Each system board has LEDs that light up to indicate which component is causing the error that was generated. The Light Path LEDs on the system board can even be used when the compute tray is pulled out of the server by pushing the Light Path power button on the compute system board.

Compute System Board LEDs:

  • Each Processor (two per system board)
  • System board in general
  • Each of the 24 DIMM slots

In addition to the front panel/LCD and Compute System Board, the SR950 also has LEDs on the rear of the server. The SR950 has over 20 different LEDS for Power Supplies, PCIe, LOM and IO ports. Power Supplies and Storage drives also have their own error LEDs.

SR950 Compute Board with Light Path Diagnostics button
Figure 10. SR950 Compute Board with Light Path Diagnostics button

Lenovo XClarity Administrator

XClarity Administrator is an easy-to-use centralized, resource-management application that simplifies infrastructure management, speeds responses, and enhances the availability of Lenovo server systems and solutions. It runs as a virtual appliance that automates infrastructure tasks for Lenovo infrastructure:

  • Discovery and tracking inventory of managed hardware endpoints
  • Real-time monitoring and fault handling, such as call home and forwarding events and logs
  • Configuration management
  • Firmware compliance and updates
  • Installation of operating systems and hypervisors

To further increase data center productivity, speed, and precision, XClarity Administrator can be integrated into external management and automation platforms through open REST application programming interfaces (APIs) and XClarity Integrator software plugins. Leveraging a centralized point of integration and running XClarity in external data center platforms helps you focus more on building proficiencies in services and less on resource deployment and management. In addition, you can automate tasks using the PowerShell toolkit or the Python toolkit.

Other XClarity Management Tools

The SR950 supports several XClarity management tools to help manage the server more easily and efficiently.

  • Lenovo XClarity Integrators are a suite of software plugins that integrate XClarity Administrator into popular data center applications such as VMware vCenter, Microsoft System Center, and SAP ITOA.
  • Lenovo XClarity Energy Manager is a web-based power and temperature management solution designed for data center administrators. It monitors and manages the power consumption and temperature of servers.
  • Lenovo XClarity Controller is the embedded management engine in every ThinkSystem server. It delivers an all-new experience through an intuitive, modernized web-based graphical user interface and Redfish-complaint REST APIs.
  • XClarity Mobile is the mobile app that can run on a mobile device to help you manage your system while at the system. This can be accessed very easily by connecting a device to one of the USB ports on the front of the system for quick access to more detailed information on the health of the system.
  • Lenovo XClarity Provisioning Manager is an all-new, modernized UEFI tool used to configure ThinkSystem servers.
  • Lenovo Capacity Planner is a power consumption evaluation tool that enhances data center planning by enabling IT administrators to understand important parameters of different type of racks, servers, and other devices.
  • Lenovo Business Vantage is a security software tool suite designed to work with the Trusted Cryptographic Module (TCM) adapter for enhanced security, to keep user data safe, and to erase confidential data completely from a hard disk drive.

About the authors

Randall Lundin is the Mission Critical Product Manager in the Lenovo Infrastructure Solutions Group. He is responsible for managing and planning Lenovo’s 4-socket and 8-socket servers. Randall has also authored and contributed to numerous Lenovo Press publications in the Mission Critical space.

Dan Kelaher is a human factors engineer in Lenovo’s Infrastructure Solutions Group. He works on the usability for the Lenovo server hardware within the Lenovo User Experience Team. Dan has over 35 issued patents in the area of server design and usability, and is a Lenovo Master Inventor.

Related product families

Product families related to this document are the following:

Trademarks

Lenovo and the Lenovo logo are trademarks or registered trademarks of Lenovo in the United States, other countries, or both. A current list of Lenovo trademarks is available on the Web at https://www.lenovo.com/us/en/legal/copytrade/.

The following terms are trademarks of Lenovo in the United States, other countries, or both:
Lenovo®
ThinkAgile
ThinkSystem
XClarity®

The following terms are trademarks of other companies:

Intel® and Xeon® are trademarks of Intel Corporation or its subsidiaries.

Microsoft®, PowerShell, and SQL Server® are trademarks of Microsoft Corporation in the United States, other countries, or both.

Other company, product, or service names may be trademarks or service marks of others.