kebnekaise

Kebnekaise and Cluster maintenance at HPC2N 2021-11-15 06:00 - 2021-11-17 17:00

  • Posted on: 8 November 2021
  • By: brorerik

Dear users,

During this maintenance, 2021-11-15 06:00 - 2021-11-17 17:00, we’re going to do some upgrades on the parallel file system, where home directories and project storage is located, along with other upgrades on Kebnekaise itself.

Since this maintenance affects the parallel file system we have to drain the batch nodes from running jobs. Login sessions will be disabled and active sessions will be terminated, during that period.

Storage system issues causes Kebnekaise login and fileaccess problems 2021-10-28 12:45 (CLOSED 15:15)

  • Posted on: 28 October 2021
  • By: brorerik

Storage system issues causes Kebnekaise login and fileaccess problems

The batch queues on Kebnekaise are stopped, therefor no new jobs will be allowed to start.

We are working with resolving the problem in contact with our vendor, we will post updates here when we have more information

UPDATED 15:15 Issues now solved, system is now working normally again

Storage system issues causes Kebnekaise login and fileaccess problems 2021-09-08 08:00 (Resolved 13:00)

  • Posted on: 8 September 2021
  • By: brorerik

Storage system issues causes Kebnekaise login and fileaccess problems

The batch queues on Kebnekaise are stopped, therefor no new jobs will be allowed to start.

We are working with resolving the problem in contact with our vendor, we will post updates here when we have more information

 

** Update ** The problem witrh the storage system was resolved 13:00 and the cluster if back up again with accessnodes available and queues enabled

Problems with project storage (2021-06-11) *SOLVED 2021-06-12 01:17*

  • Posted on: 11 June 2021
  • By: brorerik

Problems with project storage, mkdir fails and new directories cannot be created

The exact cause is not known but we are doing active debugging together with our vendor. During the debugging we have blocked the starting of new batch jobs. You can still submit jobs but they will not start until the problem is solved.

 

* SOLVED 2021-06-12 01:17 *

The problem was finially identified and a workaround put in place.

The system is now back in production.

Problems with home-directories and project storage (2021-04-01) *SOLVED 2021-04-06*

  • Posted on: 1 April 2021
  • By: roger

We are noticing intermittent file server crashes causing problems. This causes problems with the file systems for $HOME and project storage. As a user it is mostly noticed by logins getting stuck after authentication and/or really slow filesystem access (simple ls might takes minutes).

The exact cause is not known but we are doing active debugging together with our vendor. During the debugging we might block the starting of new batch jobs. You can still submit jobs but they will not start.

 

Cluster maintenance at HPC2N 2021-03-22 - 2021-03-25, *FINISHED*

  • Posted on: 5 March 2021
  • By: ake

Dear users,

During this maintenance, 2021-03-22 - 2021-03-25, we’re going to do some upgrades on the parallel file system, where home directories and project storage is located, along with other upgrades on Kebnekaise itself.

Since this maintenance affects the parallel file system we have to drain the batch nodes from running jobs. Login sessions will be disabled and active sessions will be terminated, during that period.

Pages

Updated: 2021-11-11, 13:50