NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Expanding a Supercomputer Facility Using Modular Data Center TechnologyWith the expansion of high-end computing resources needed to support NASA's increasing demands for physics-based simulations, the facility housing Pleiades-the agency's largest supercomputer-recently reached its power and cooling capacity. In response, the NASA Advanced Supercomputing Division at Ames Research Center undertook a prototype project that resulted in a new facility based on modular data center technology. The facility, a ~1000 square-foot module on a concrete pad with room for 16-18 compute racks, was completed in fall 2016 and an SGI computer system, named Electra, was deployed there in early 2017. Cooling is performed via an evaporative system built into the module, and preliminary experience shows a Power Usage Effectiveness (PUE) of ~1.03. Electra achieved over a petaflop on the LINPACK benchmark, sufficient to rank number 96 on the November 2016 TOP500 list. The system consists of 1,152 InfiniBand-connected Intel Xeon Broadwell-based nodes. Its users access their files on a facility wide file system shared by all compute assets via Mellanox MetroX InfiniBand extenders, which connect the Electra fabric to Lustre routers InfiniBand fabric over fiber-optic links about 300 meters long. The prototype has exceeded expectations and is serving as a blueprint for future expansions.*†
Document ID
20180007549
Acquisition Source
Ames Research Center
Document Type
Conference Paper
Authors
Hood, Robert T.
(Computer Sciences Corp. Moffett Field, CA, United States)
Mehrotra, Piyush
(NASA Ames Research Center Moffett Field, CA, United States)
Thigpen, William W.
(NASA Ames Research Center Moffett Field, CA, United States)
Tanner, Christopher Bryan
(Computer Sciences Corp. Moffett Field, CA, United States)
Buchanan, Christopher J.
(Computer Sciences Corp. Moffett Field, CA, United States)
Chan, Davin S.
(Computer Sciences Corp. Moffett Field, CA, United States)
Date Acquired
November 7, 2018
Publication Date
November 12, 2017
Subject Category
Computer Systems
Report/Patent Number
ARC-E-DAA-TN41643
Meeting Information
Meeting: SC17
Location: Denver, CO
Country: United States
Start Date: November 12, 2018
End Date: November 17, 2018
Sponsors: IEEE Computer Society, ACM
Funding Number(s)
CONTRACT_GRANT: NNA07CA29C
Distribution Limits
Public
Copyright
Public Use Permitted.
Keywords
Facility
Modular Data Center
Supercomputer
No Preview Available