Daily Bulletin Archive

December 7, 2018

The CISL Help Desk and Consulting support will close at 15:00 MDT in December 7, so staff members can attend a UCAR function.

 

December 5, 2018

CISL system administrators were able to install a new version of PBS during an unscheduled Cheyenne outage on Tuesday, December 4, so no Cheyenne downtime is needed during the maintenance period scheduled for December 11. Other system maintenance remains scheduled for December 11 as follows.

CISL staff will update several key data analysis and visualization system components and the Campaign Storage file system beginning at 8 a.m. MST. The maintenance will affect the availability of the Casper, Geyser, and Caldera clusters and file transfers to Campaign Storage. The GLADE file system is expected to remain available throughout the day.

Maintenance on Casper, Geyser, and Caldera is expected to take until 12 noon MST. The work will include installing a new version of the SLURM resource scheduler. These clusters will not be accessible during the maintenance window. Also note:

  • All running jobs, interactive processes, and login sessions will be terminated when maintenance begins.

  • Jobs that are queued for execution or in a hold state will remain in those states until Casper, Geyser, and Caldera are returned to service.

Maintenance on the Campaign Storage file system is expected to take until 5 p.m. MST. Globus transfers will be suspended for the duration of the maintenance but will resume when the work is completed.

December 5, 2018

The CISL Help Desk and Consulting Office will be closed Wednesday 3:00 pm - 5:30 pm for a CISL function. For assistance with any urgent issues, call 303-497-2400 or email cislhelp@ucar.edu both of which are always monitored.

December 3, 2018

No downtime: Cheyenne, GLADE, Geyser_Caldera, HPSS and Casper

November 30, 2018

Users should migrate their data analysis and visualization (DAV) workflows to the new Casper cluster soon if they have not already done so. The Geyser and Caldera clusters will be decommissioned on December 31 and are experiencing more frequent hardware failures as they approach end of life. Several Geyser and Caldera nodes have already suffered irrecoverable hardware failures. They have been removed and will not be replaced.

During a scheduled maintenance downtime on December 11, the execdav command will be updated so it defaults to assigning Casper nodes for use in interactive jobs. When Geyser and Casper are decommissioned, the execgy and execca commands will be removed. Keep an eye out for more information about the December 11 downtime in upcoming Daily Bulletins.

Related support for using Casper:

November 30, 2018

Users of CRYPTOCard keypads who have not converted to a supported authentication method will not be able to log in with those tokens as of December 3, 2018. As announced earlier in the Daily Bulletin and in emails to users, there are two alternatives:

Contact cislhelp@ucar.edu for assistance.

November 28, 2018

Cheyenne system administrators will perform preventive maintenance on the PBS workload management system at 12:00 noon MST today, November 28. The maintenance is expected to take less than 30 minutes to complete.

Most PBS commands will not work during that time, including qstat, and new Cheyenne job submissions will not be possible. Jobs that are executing when the maintenance begins will continue to run without interruption. Jobs that are queued for execution or in a hold state will remain in those states until PBS is returned to service. Access to Cheyenne’s login nodes will not be interrupted.

November 27, 2018

No downtime: Cheyenne, GLADE, Geyser_Caldera

November 26, 2018

CISL has published new recommendations for optimizing the performance of the Weather Research and Forecasting (WRF) modeling system. The recommendations are based on the results of numerous jobs that were run on the Cheyenne system by the CISL Consulting Services Group. The jobs included small runs and others with a variety of domain sizes and time steps. See Optimizing WRF performance for details.

 

November 21, 2018

A recording and slides of the November 14 NCAR/CISL tutorial on using the new Casper system are now available on the CISL web site. See the course page to review the presentation and download the slides. The presentation introduces the capabilities of the new Casper system, describes how to access its features, and provides some best practices.

Topics covered include:

  • The four types of Casper nodes and their features

  • Accessing Casper resources using Slurm

  • Interactive jobs and remote virtual desktops (VNC)

  • Using the GPU capabilities of Casper

Also see: Casper documentation home page.

Pages