lustre

Problems with home-directories and project storage (2021-04-01) *SOLVED 2021-04-06*

  • Posted on: 1 April 2021
  • By: roger

We are noticing intermittent file server crashes causing problems. This causes problems with the file systems for $HOME and project storage. As a user it is mostly noticed by logins getting stuck after authentication and/or really slow filesystem access (simple ls might takes minutes).

The exact cause is not known but we are doing active debugging together with our vendor. During the debugging we might block the starting of new batch jobs. You can still submit jobs but they will not start.

 

Cluster maintenance at HPC2N 2021-03-22 - 2021-03-25, *FINISHED*

  • Posted on: 5 March 2021
  • By: ake

Dear users,

During this maintenance, 2021-03-22 - 2021-03-25, we’re going to do some upgrades on the parallel file system, where home directories and project storage is located, along with other upgrades on Kebnekaise itself.

Since this maintenance affects the parallel file system we have to drain the batch nodes from running jobs. Login sessions will be disabled and active sessions will be terminated, during that period.

Updated: 2024-03-21, 12:31