Daily Bulletin Archive

May 28, 2019

No scheduled downtime: Cheyenne, Casper, Campaign Storage, GLADE and HPSS.

May 23, 2019

CISL has released a pre-production implementation of the popular JupyterHub platform on the Cheyenne system. It is accessible at jupyterhub.ucar.edu with a valid Cheyenne user ID and YubiKey or Duo authentication.

As it is a pre-production offering, no documentation is available at this time for the installation, and CISL cannot guarantee its availability, robustness, or reliability. This JupyterHub instance will remain in a pre-production state for the lifetime of the Cheyenne cluster. A fully supported instance is expected to be available on the Cheyenne system’s successor, which is scheduled to be deployed in 2021.

After coming online earlier this year, NCAR's JupyterHub portal has been leveraged by several workshops, tutorials, and users with varying degrees of success. If you plan to employ JupyterHub in a workshop or tutorial, please notify CISL at cislhelp@ucar.edu at least one week in advance so we can provide the best possible experience and minimize potential issues.

May 20, 2019

When a Globus user authenticates to transfer files to or from an NCAR Campaign Storage endpoint or other endpoint in the Cheyenne/GLADE environment, the default credential lifetime is 24 hours. Users can minimize the need to authenticate that frequently – and simplify both attended and unattended Globus transfers – by extending that lifetime to up to 720 hours (30 days).

Globus file transfers describes using the web and command-line interfaces. It also links to other support resources, including:

Users can also contact the CISL Consulting Services Group for assistance.

May 17, 2019

A major Cheyenne operating system update has been scheduled for the last week of June while routine maintenance times scheduled for June 4, July 2, and August 6 have been canceled. The Cheyenne system will be unavailable from Monday, June 24, through Monday, July 1, as CISL staff update from SUSE Linux Enterprise Server (SLES) Service Pack 1 to SLES Service Pack 4. The update is required to bring the system up to current security and support levels and is expected to be the last operating system upgrade in Cheyenne’s lifetime.

During the update outage all of the Cheyenne cluster will be unavailable, including the system’s login nodes. The Casper cluster, GLADE file system, and HPSS will all be available through the recently deployed Casper login nodes. Users should also be aware that all cron services will be unavailable throughout the outage.

Most users’ programs and executables will need to be rebuilt following the update, as many system libraries will change. Most scripts should not require modifications but users are encouraged to thoroughly test their commonly used scripts after the system is returned to service.

 

May 17, 2019

Checking your usage charges frequently can help ensure that you are using your supercomputing and storage allocations as efficiently as possible. Also make sure that others who are authorized to charge against your allocation understand how to those resources efficiently.

More about managing allocations and other best practices.

 

May 14, 2019

The latest NCAR Package Library (NPL) version – 20190326 is now the default – provides a script, start-jupyter, to assist in the process of launching JupyterLab on Cheyenne and Casper. JupyterLab is the new interface to the Jupyter ecosystem that provides many additional features over the traditional interface. The earlier interface is still available via start-notebook.

Updated documentation is online here: Jupyter and IPython.

May 10, 2019

The Cheyenne system’s compute nodes were rebooted, tested, and returned to service shortly after midnight today, a day earlier than planned. Worsening weather conditions in Cheyenne, Wyoming, resulted in some delays to this week’s electrical repair efforts at the NCAR-Wyoming Supercomputing Center (NWSC). However, most of the scheduled repairs and maintenance were completed as planned.

Some unfinished work that affected Cheyenne’s availability was suspended and will be rescheduled to be completed at a later date. CISL thanks everyone for their patience and cooperation during the extensive repair efforts at the NWSC facility this week.

May 10, 2019

NCAR HPC system users are reminded of the scheduled downtime for Cheyenne’s compute nodes Monday, May 6, through Saturday, May 11, while extensive electrical repairs take place at the NCAR-Wyoming Supercomputing Center. Cheyenne’s login nodes, the Casper cluster, and GLADE will remain available on UPS power. HPSS is scheduled to be down briefly for electrical recabling on Monday, May 6, from 7 a.m. to 1 p.m. MDT but otherwise is expected to be available during the week.

A major Cheyenne operating system update also is being planned and will require an extended downtime, most likely in late June or early July. Details will be announced in the Daily Bulletin when the dates are set.

May 9, 2019

Intel software engineers will conduct a half-day training session titled “Intel Developer Tools” on Wednesday, May 22, from 1 to 4:30 p.m. MDT. The training will be held at the VisLab (ML4), NCAR Mesa Lab, in Boulder and is open to all UCAR and NCAR employees and external collaborators. To attend, please register at one of the links below.

Topics to be covered include:

  • Intel Compilers

  • Intel Distribution for Python (MKL optimized ML packages: numpy/scipy/sklearn; Intel ML lib DAAL; MKLDNN optimized DL frameworks)

  • Intel Performance Libraries

  • Intel VTune

  • Intel Advisor: (Flow Graph Analyzer; Roof Line Analysis; Platform Profiler)

  • Intel Inspector

  • Intel MPI

  • Intel Trace Analyzer

Register to attend in person or attend online by selecting one of these links:

May 7, 2019

The CISL User Services Section will present an in-person and online tutorial at 9:30 a.m. MDT on Friday, May 24, for new users of NCAR’s Cheyenne high-performance computing (HPC) system and the Casper data analysis and visualization cluster. The 90-minute tutorial is intended for individuals who are either new to HPC or unfamiliar with the Cheyenne user environment.

Topics will include:

  • Overview of compute and storage resources

  • Using software and building applications

  • Scheduling jobs on the batch resources

  • Workflow recommendations and best practices

Register at one of these links to attend in person or online:

Pages