system news

Migrating away from /pfs/nobackup at HPC2N

  • Posted on: 28 January 2021
  • By: bbrydsoe

Migrating away from /pfs/nobackup at HPC2N

Dear PIs and users at HPC2N.

As you hopefully already know our new storage system is in full production since November.

There is still some work to be done by You, the user, to make the transition to Project Storage complete.
All data in your /pfs/nobackup$HOME space must be moved to a Project Storage directory or to your $HOME space depending on the type and amount of data.

2020-01-13 CVMFS issues affects local software and modules *Resolved*

  • Posted on: 13 January 2021
  • By: nikke

We are currently having issues with the CVMFS subsystem on all HPC2N machines. This affects local software and modules, amongst other things.

Fixing is in progress, but might take a while before everything is sorted out.

We apologize for the inconvenience.

 

2021-01-13 11:59

The problem has been resolved

 

 

 

Bus error on Kebnekaise

  • Posted on: 26 December 2020
  • By: zao

As a side effect of the recent file system upgrade we are observing a small set of user-installed programs crashing with a bus error when loading dynamic libraries from the PFS file system. We are working with the vendor to find the root cause of these and are running some bulk operations on the file system to mitigate the problem.

The problem is elusive and may only affect a particular set of nodes and may disappear when hashing or otherwise fully reading the affected files on that node, or reinstalling the affected files in another location.

Maintenance on storage file system, batch queues stopped 2020-12-01 08:30 *FINISHED*

  • Posted on: 1 December 2020
  • By: ake

We're currently doing some minor maintenance on the storage file system.

The batch queues on Kebnekaise are stopped, therefor no new jobs will be allowed to start.

Already running jobs will continue to run.

The file system should be available most of the time but may at times access may be stalled for shorter periods.

Logins may be slower than normal.

 

* UPDATE 2020-12-01 11:30 *

Maintenance done and batch queues now running again.

New storage system for users and project storage at HPC2N, maintenance window starting 2020-11-12 08:00 *FINISHED*

  • Posted on: 9 November 2020
  • By: ake

We have a maintenance window starting 2020-11-12 08:00 for migrating to our new storage system.

This affects batch queues and login nodes for all users.

Background

We have acquired a new storage system which will replace our old center storage.
The new storage is twice as large and will provide better performance overall for both user and project storage.

The old storage will be decommissioned and the data must therefore be migrated to the new system.

Name resolution outage 2020-10-10 (resolved)

  • Posted on: 10 October 2020
  • By: zao

Early Saturday morning on October 10th we observed an outage in name resolution (DNS) for HPC2N hosts.

This prevented external and internal access to batch nodes, the website and more.

2020-10-10 11:00
We have restored DNS functionality and HPC2N should once again be reachable from the outside. It may take a little while for the changes to propagate out to internet providers.

Batch jobs that were running may not have run to completion and any queued jobs may have failed on startup.

Urgent repairs to city cooling network affects HPC2N compute resources (2020-10-01 - 2020-02-10)

  • Posted on: 29 September 2020
  • By: nikke

UPDATE 2020-10-02 07:30 Umeå Energi will perform urgent repairs on the city cooling network between Thursday 2020-10-01 18:00 and Friday 2020-10-02 12:00.

Due to unforeseen events, the end time for Umeå Energi maintenance have been moved to Friday 2020-10-02 12:00.

 

Initial information

Umeå Energi will perform urgent repairs on the city cooling network between Thursday 2020-10-01 18:00 and Friday 2020-10-02 04:00.

Pages

Updated: 2024-12-12, 13:22