system news

Problems with home-directories and project storage (2021-04-01) *SOLVED 2021-04-06*

  • Posted on: 1 April 2021
  • By: roger

We are noticing intermittent file server crashes causing problems. This causes problems with the file systems for $HOME and project storage. As a user it is mostly noticed by logins getting stuck after authentication and/or really slow filesystem access (simple ls might takes minutes).

The exact cause is not known but we are doing active debugging together with our vendor. During the debugging we might block the starting of new batch jobs. You can still submit jobs but they will not start.


Cluster maintenance at HPC2N 2021-03-22 - 2021-03-25, *FINISHED*

  • Posted on: 5 March 2021
  • By: ake

Dear users,

During this maintenance, 2021-03-22 - 2021-03-25, we’re going to do some upgrades on the parallel file system, where home directories and project storage is located, along with other upgrades on Kebnekaise itself.

Since this maintenance affects the parallel file system we have to drain the batch nodes from running jobs. Login sessions will be disabled and active sessions will be terminated, during that period.

Migrating away from /pfs/nobackup at HPC2N

  • Posted on: 28 January 2021
  • By: bbrydsoe

Migrating away from /pfs/nobackup at HPC2N

Dear PIs and users at HPC2N.

As you hopefully already know our new storage system is in full production since November.

There is still some work to be done by You, the user, to make the transition to Project Storage complete.
All data in your /pfs/nobackup$HOME space must be moved to a Project Storage directory or to your $HOME space depending on the type and amount of data.

2020-01-13 CVMFS issues affects local software and modules *Resolved*

  • Posted on: 13 January 2021
  • By: nikke

We are currently having issues with the CVMFS subsystem on all HPC2N machines. This affects local software and modules, amongst other things.

Fixing is in progress, but might take a while before everything is sorted out.

We apologize for the inconvenience.


2021-01-13 11:59

The problem has been resolved




Bus error on Kebnekaise

  • Posted on: 26 December 2020
  • By: zao

As a side effect of the recent file system upgrade we are observing a small set of user-installed programs crashing with a bus error when loading dynamic libraries from the PFS file system. We are working with the vendor to find the root cause of these and are running some bulk operations on the file system to mitigate the problem.

The problem is elusive and may only affect a particular set of nodes and may disappear when hashing or otherwise fully reading the affected files on that node, or reinstalling the affected files in another location.

Maintenance on storage file system, batch queues stopped 2020-12-01 08:30 *FINISHED*

  • Posted on: 1 December 2020
  • By: ake

We're currently doing some minor maintenance on the storage file system.

The batch queues on Kebnekaise are stopped, therefor no new jobs will be allowed to start.

Already running jobs will continue to run.

The file system should be available most of the time but may at times access may be stalled for shorter periods.

Logins may be slower than normal.


* UPDATE 2020-12-01 11:30 *

Maintenance done and batch queues now running again.

New storage system for users and project storage at HPC2N, maintenance window starting 2020-11-12 08:00 *FINISHED*

  • Posted on: 9 November 2020
  • By: ake

We have a maintenance window starting 2020-11-12 08:00 for migrating to our new storage system.

This affects batch queues and login nodes for all users.


We have acquired a new storage system which will replace our old center storage.
The new storage is twice as large and will provide better performance overall for both user and project storage.

The old storage will be decommissioned and the data must therefore be migrated to the new system.

Name resolution outage 2020-10-10 (resolved)

  • Posted on: 10 October 2020
  • By: zao

Early Saturday morning on October 10th we observed an outage in name resolution (DNS) for HPC2N hosts.

This prevented external and internal access to batch nodes, the website and more.

2020-10-10 11:00
We have restored DNS functionality and HPC2N should once again be reachable from the outside. It may take a little while for the changes to propagate out to internet providers.

Batch jobs that were running may not have run to completion and any queued jobs may have failed on startup.


Updated: 2023-03-07, 11:24