Daily Bulletin Archive

June 26, 2020

No scheduled downtime: Cheyenne, Casper, GLADE, Campaign Store, Object Store, or HPSS

June 25, 2020

Video and slides from the June 24 tutorial "Casper Basics for New Users" are now available here in the CISL training library. The 30-minute presentation by Shiquan Su of the CISL Consulting Services Group provides a basic understanding of how to run jobs on the Casper cluster. Casper is a heterogeneous system of specialized data analysis and visualization resources and large-memory, multi-GPU nodes.

June 22, 2020

Users are reminded that the High Performance Storage System (HPSS) will reach its end of life and be decommissioned in 2021. HPSS file owners and project leads have been contacted and instructed on how to access lists of their files. The lists are updated weekly.

For reference, the lists can be found here:

  • /glade/work/csgteam/hpssreports/current/byusers/<userID>.data.gz
  • /glade/work/csgteam/hpssreports/current/byprojects/<projectID>.data.gz

Writing HPSS files is no longer possible, but users can perform most common metadata operations on their HPSS holdings, including deleting, renaming, and moving files. Those who have not already done so should begin moving their data to alternative storage systems and deleting files that are no longer needed.

Documentation and training are available on recommended processes for identifying and organizing HPSS holdings; copying files that need to be preserved to another storage resource; and deleting files that are no longer needed. 

Please contact CISL for advice on individual workflows and storage options.

June 19, 2020

No scheduled downtime: Cheyenne, Casper, GLADE, Campaign Store, Object Store, or HPSS

June 18, 2020

Acknowledging the support of NCAR and CISL computing when you publish research results helps ensure continued support from the National Science Foundation and other sources of funding for future high-performance computing (HPC) systems. It is also one of the requirements of receiving an allocation, as was noted in your award letter.

The reporting requirements and recommended wording of acknowledgments can be found on this CISL web page. The content of citations and acknowledgments varies depending on the type of allocation that was awarded.

June 15, 2020

No scheduled downtime: Cheyenne, Casper, Campaign Storage, GLADE, HPSS, Stratus

June 15, 2020

CISL will present an online tutorial at 10 a.m. MDT on Wednesday, June 24, for users who want to know more about the Casper data analysis and visualization cluster. There are no prerequisites.

A 30-minute presentation by Shiquan Su of the CISL Consulting Services Group will provide attendees with a basic understanding of how to run jobs on the Casper cluster. The presentation will be followed by additional time for questions. Topics to be covered include:

  • Node types for different tasks
  • Accessing Casper resources through the Slurm job scheduler
  • Interactive jobs and virtual remote desktops
  • Using GPUs on Casper

Please use this link to register.

June 11, 2020

Users are reminded that the Cheyenne HPC system will be unavailable for one week in mid-July to allow CISL staff and external contractors to perform power infrastructure repairs at the NCAR-Wyoming Supercomputing Center (NWSC). Some other systems will be unavailable for one day.

The maintenance period will begin Monday, July 13, and Cheyenne is expected to be returned to service by Saturday, July 18. The Casper cluster, GLADE and Campaign Storage file systems, data-access nodes, and HPSS will remain available throughout the week except for Tuesday, July 14. Some maintenance on those systems has been scheduled since our previous announcement. Watch for more details and updates in the Daily Bulletin.

June 8, 2020

New versions of major compiler and MPI libraries have been installed on both Cheyenne and Casper. These installations include Intel 19.1.1 (the 2020 release of the Parallel Studio), PGI 20.4, and GCC 10.1.0, as well as MPT MPI 2.22 and Open MPI 4.0.3. We have also installed Python 3.7.5 and have built new NCAR Package Library virtual environments.

On Monday, June 15, we will upgrade the default module environment to the following modules:

  • ncarenv/1.3
  • intel/19.0.5
  • ncarcompilers/0.5.0
  • netcdf/4.7.3
  • mpt/2.22 (Cheyenne), openmpi/4.0.3 (Casper)

Default versions of PGI, GCC, and Python will be upgraded to 19.9, 9.1.0, and 3.7.5 respectively. The default CUDA version on Casper will be set to 10.1. Older versions of these software installations will remain available, but CISL recommends migrating to newer versions at your earliest convenience.

June 8, 2020

CISL is seeking a director for its Technology Development Division (TDD), the lab's applied research and technology development program. The search was launched following the announcement that current TDD director Rich Loft will retire at the end of 2021 after 25 years of service to NCAR. In addition to guiding and managing TDD, Loft is one of CISL's resident experts on HPC in the Earth system sciences, takes a lead role in CISL business development activities, and created and fostered the SIParCS program over many years.

The TDD director will be responsible for leading CISL's efforts in evaluating HPC technology trends and the impact they will have on NCAR’s modeling enterprise. The position will have a lead role in preparing NCAR applications for exascale, which requires a broad, detailed understanding of HPC architectures, algorithmic developments and optimization concepts in the domain of atmospheric and related sciences. TDD comprises six groups and roughly 35 staff. The groups encompass HPC application scalability and performance, machine learning, data assimilation, workflows, visualization, and GPU porting and other special projects.

See the UCAR job posting, Technology Development Division Director, for additional information or to apply. Daily Bulletin readers are encouraged to share this opportunity with potential candidates.