Daily Bulletin Archive

January 23, 2019

CISL has determined that a UCAR enterprise ethernet network hardware failure was the root cause of last night’s problems on Cheyenne. The network problem caused Cheyenne to lose communications with GLADE and caused a significant number of failed jobs and poor system performance.

CISL system and storage administrators implemented a workaround to restore Cheyenne-GLADE communications and no further unscheduled interruptions are expected. It may necessary to schedule a brief outage in the near future to implement a more permanent repair. Users will be notified well in advance if such an outage is scheduled.

January 22, 2019

CISL is now accepting large-scale allocation requests from university-based researchers for the 5.34-petaflops Cheyenne supercomputer and the Casper data analysis and visualization cluster. Submissions are due March 5. Researchers are encouraged to review these allocation instructions before preparing their requests.

In addition to requesting computing allocations, university projects should request long-term space on the NCAR Campaign Storage resource instead of HPSS. Unlike HPSS, Campaign Storage has no default minimum amount; users are asked to justify the amount requested. The CISL HPC Allocations Panel (CHAP) is applying increased scrutiny to data management plans and storage requests.  

At the spring meeting, CISL will allocate up to 275 million core-hours on Cheyenne, up to 2 PB of Campaign Storage space, and up to 200 TB of GLADE project space. Large allocations on Cheyenne are those requesting more than 400,000 core-hours. CISL accepts requests from university researchers for these large-scale allocations every six months. Please contact cislhelp@ucar.edu if you have any questions.

 

January 17, 2019

A video recording and slides from the January 14 NCAR/CISL tutorial for new Cheyenne supercomputer users have been added to the CISL Course Library. The 50-minute Introduction to Cheyenne tutorial covers basic usage and typical user workflows. Topics discussed include:

  • The Cheyenne computing environment

  • Accessing software, including compilers and MPI libraries

  • Submitting batch jobs using the PBS scheduler

January 15, 2019

The next regular maintenance operations on NCAR’s HPC systems are scheduled for Tuesday, February 5. The Cheyenne and Casper clusters and the GLADE file system are expected to be unavailable from 7 a.m. until 6 p.m. MST but every effort will be made to restore the systems to users earlier if possible. More details on the outage will be published in the Daily Bulletin later this month.

January 15, 2019

HPSS: Tuesday, 7:30-8:30 a.m.

No downtime: Cheyenne, Casper, Campaign Storage, GLADE   

January 15, 2019

New installations of Python – versions 3.6.8 and 2.7.15 – are now available on both Cheyenne and Casper. These installations were compiled with GCC 7 and use Intel's MKL for accelerated Numpy performance. You can add the executables to your path using environment modules, and then load our library of packages using the NPL utility. See this page for more information on using CISL's Python libraries.

The default version of Python will be switched from 2.7.14 to 3.6.8 on both systems on Tuesday, January 22. This move to Python 3 follows the end of new feature development on the scientific Python stack for 2.7.

 

January 8, 2019

The NCAR Campaign Storage system will be unavailable on Tuesday, January 8, starting at approximately 9 a.m. MST to allow CISL staff to perform a system upgrade. It is expected be returned to service before 5 p.m.

The High-Performance Storage System (HPSS) will be unavailable for about an hour beginning at 7:30 a.m. CISL will inform users through the Notifier service when all of the systems are restored.

Previously announced maintenance on the Cheyenne and Casper clusters and the GLADE file system has been deferred until February or will be accomplished without taking the expected outage.

January 8, 2019

CISL has scheduled routine maintenance downtimes on the following Tuesdays:

  • January 8, 2019 (details)

  • February 5, 2019

  • March 5, 2019

An extended downtime is being planned in May for a major operating system update to the Cheyenne system and for electrical work at the NCAR-Wyoming Supercomputing Center. Details will be announced in the Daily Bulletin when the dates are set.

Information on scheduled outages is also available on the CISL HPC calendar.

 

January 8, 2019

Casper: Monday, noon to 2 p.m.

Campaign Storage, HPSS: Details

No downtime: Cheyenne, GLADE   

January 4, 2019

The Fortran language has evolved a lot since its first version in 1957 in order to take advantage of new computing systems. Now, the use of vector instructions (or SIMD) can result in substantial performance improvements on current and future architectures. Alessandro Fanfarillo, NCAR Research Applications Laboratory, will lead a workshop January 29 to help participants make their Fortran codes run more efficiently through vectorization and other techniques.

When: 9 a.m. to noon, 1 to 3 p.m. Tuesday, January 29

Where: Room 3131, Center Green campus (CG1), Boulder

Participants will get a detailed, practical explanation of how to obtain high performance from modern Fortran codes, with a particular focus on how to exploit the hardware instructions provided by modern processors. Prerequisite: Basic knowledge of Fortran 90 constructs, such as array syntax and allocation, recursion, modules, and intrinsic, elemental and pure functions. Participants are encouraged to bring their own codes and laptop computers.

Register here to attend.

 

Pages