Daily Bulletin Archive

Jan. 19, 2018

The CISL Help Desk and Consulting Office will be closed Friday from 11:30 am - 2:00 pm for a USS function. For assistance with any urgent issues, call 303-497-2400 or email cislhelp@ucar.edu both of which are always monitored.

Jan. 18, 2018

CISL has added a new script for Cheyenne users that simplifies the launching of resource-intensive compilation jobs on the system’s batch nodes. Running qcmd as shown in this new documentation starts a non-interactive job on a single batch node in the Cheyenne "regular" queue with a default wall-clock time of 1 hour. It is recommended for running resource-intensive tasks such as CESM and WRF builds or any compiles with three or more threads.

Reminder: Use of the Cheyenne login nodes, as noted here, is restricted to running processes that do not consume excessive resources. This is to ensure an appropriate balance between user convenience and login node performance. Users are encouraged to compile on the Cheyenne batch nodes or on the Geyser or Caldera clusters, depending on where they want to run their programs.

Jan. 16, 2018

The Geyser and Caldera clusters will be unavailable from 8 a.m. to 8 p.m. MST on Tuesday, January 16, to allow CISL staff to update hardware and software components.

Users will be unable to log in to Geyser and Caldera during the outage. Reservations will be put in place on batch jobs before the outage begins. After the reservations are activated, jobs that are submitted with specified job times that overlap the reservation period will remain on hold until the systems are restored to service. All processes and running jobs that have not finished when the outage begins will be killed and will need to be resubmitted after the systems are returned to service.

Users will be informed via the CISL Notifier service when the systems are returned to production.

Jan. 15, 2018

A few slots remain open for the NCL Workshop, February 6-9, 2018, at NCAR’s Center Green Campus in Boulder.

The workshop will include introductory lectures on NCL, scientific data formats, and graphics, and daily hands-on labs.

During the labs, instructors will work with students to help them write NCL programs for analyzing their own data. The registration deadline is January 15.

Jan. 5, 2018

Two new Cheyenne batch queues, premium and standby, are now available to users. The queues are very similar to the Yellowstone queues with the same names.

Jobs submitted to the premium queue are dispatched for execution at a higher priority than other queues available to most users and will be charged at 1.5 times the rate of jobs in the regular queue.

Running jobs in the standby queue is allowed only for exhausted Cheyenne project allocations. Jobs submitted to other queues when an allocation is exhausted are rejected and the user is notified to resubmit the job to the standby queue. Those jobs are dispatched at the lowest priority level and do not incur charges.

Please see Job-submission queues and charges for more complete information on Cheyenne’s batch queues.

Jan. 4, 2018

A recording of the December 19 tutorial, “How to access Geyser and Caldera from Cheyenne,” is now available on the CISL web site. See the course page to review the presentation and download the slides.

The 55-minute tutorial covers how to start interactive jobs and batch jobs with the Slurm scheduler, performing graphically intensive tasks on Geyser, and considerations when using Cheyenne and Geyser/Caldera together in a workflow.

Jan. 3, 2018

The Cheyenne, Geyser and Caldera systems are expected to remain offline throughout today, January 3, as the NCAR-Wyoming Supercomputing Center (NWSC) facility continues to recover from the December 30 power outage.

Power has been restored to the NWSC and CISL staff have begun extensive hardware and system software testing to ensure that the systems were not damaged as a result of the sudden loss of power. Every effort is being made to restore Cheyenne, Geyser and Caldera as soon as possible.

CISL will inform users through the Notifier service of any significant updates and when the systems are restored. We apologize to all users for the inconvenience this will cause, and we thank you for your patience.

Also be aware that, as planned, the Yellowstone system has been powered down permanently.

Jan. 2, 2018

Scheduled Downtime: Cheyenne, Geyser and Caldera: Tuesday Jan. 2, 6 am to 6 pm

No downtime: GLADE and HPSS

Dec. 31, 2017

The Yellowstone supercomputing cluster will be decommissioned on December 31, 2017. The system will be powered down on that date and no longer available to users.

The Geyser and Caldera clusters will remain in production until the end of 2018. Users can learn about how to access those systems from the Cheyenne cluster in training sessions are scheduled for December 4 and 19. See Training: How to access Geyser and Caldera from Cheyenne for more details and how to register.

Users who have not already done so are strongly encouraged to migrate their work to Cheyenne as soon as possible to avoid unnecessary interruptions. Documentation available to help with the transition includes this Quick start overview, procedures and scripts for running jobs, and this recorded tutorial. For help in migrating job scripts or rebuilding executables, please contact cislhelp@ucar.edu.

It is not necessary to copy or move files to HPSS or to another disk space to ensure that they are available from Cheyenne after Yellowstone’s decommissioning. All files in the GLADE system file spaces—home, scratch, work, and project—are accessible via Cheyenne already. Copying or archiving files unnecessarily will waste expensive tapes and valuable disk space while increasing use of your allocation.

See our allocations documentation if you do not already have a Cheyenne allocation. University users who want to convert an existing Yellowstone allocation to a Cheyenne allocation should email that request to cislhelp@ucar.edu.

Dec. 27, 2017

CISL has resolved some issues that prevented the Systems Accounting Manager (SAM) from updating allocation usage records for several days recently. All records should be up to date and available now in SAM.

Pages