The Columbia project at NASA Ames includes twenty 512 processor Altix systems. Four of those systems are joined together into a Numalink based globally accessible non-coherent memory Altix SuperCluster. PBSPro 7.0 and 7.1 have been used to schedule and run work on this system. In this presentation, we will go over features of PBSPro especially applicable to such systems, some problems we have encountered and how they have been addressed, and remaining, unresolved issues.
|Experience with PBSPro on a 4 x 512p Altix SuperCluster||2014-07-31|