The NNSA Office of Advanced Simulation and Computing (ASC) is faced with significant challenges by ongoing technology advancements and must continue to meet the mission needs of the current applications while also adapting to computing technology revolutionary and evolutionary changes. ASC recognizes that the simulation environment of the future will be transformed with new computing architectures and new programming models and has established the development and deployment of a series of Advanced Technology (AT) systems. The ASC roadmap states “work in this timeframe will establish the technological foundation to build toward exascale computing environments, which predictive capability may demand.” It is critical for ASC to both explore the rapidly changing technology of future systems and to provide platforms with more capability and higher performance for predictive capability. Trinity is the first instantiation of an AT system and will achieve a balance between usability of the current simulation codes while also allowing adaptation to new computing technologies and programming methodologies.
The Trinity supercomputer is provided by Cray, Inc. and is based on its XC30 platform architecture. Trinity is a mixture of Intel Haswell and Knights Landing (KNL) processors. The Haswell partition provides a natural transition path for many of the legacy codes running on the Cielo supercomputer, Trinity’s predecessor. In order to effectively use the KNL processor to its full potential, the ASC code teams to must expose higher levels of thread- and vector-level parallelism than has been necessary for the traditional multicore architectures. To help facilitate this transition, the Trinity Center of Excellence was established, with staff from the ASC tri-Labs, Cray, and Intel.
Trinity introduces tightly integrated nonvolatile “burst buffer” storage capabilities. Embedded within the high-speed fabric are nodes with attached solid-state disk drives. The burst buffer capability will allow for accelerated checkpoint/restart performance and relieve much of the pressure normally loaded on the back-end storage arrays. In addition, the burst buffer will support novel new workload management strategies such as in-situ analysis, which opens a whole space in which projects can manage their overall workflows.
Trinity also introduces advanced power management functionality that allows monitoring and control of power consumption at the system, application, and component levels. Although advanced power management is not needed for the current power and operational budget, its functionality is being used to gain a better understanding for future system requirements and features.