system news

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

  • Posted on: 9 June 2023
  • By: brorerik

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

We can see no other effect on running jobs and the job queue are now open again after having been DOWN for 1 hour

If you see some other negative effect send us a support case and we'll help solving the issue

Sorry for the inconvenience that this may have caused.

 

Best regards,

/Support

2023-01-30 07:00 Planned maintenance of the cooling systems and central file system (FINISHED 2023-02-02 20:30)

  • Posted on: 20 January 2023
  • By: brorerik

Akademiska hus have a planned maintenance of the cooling systems for the HPC2N Infrastructure computer hall on 2023-02-01

We'll coordinate an upgrade of the central file system around their maintenance to minimize the time the cluster is draining jobs.

The combined maintenance window will therefore start on 2023-01-30 07:00 and according to our planning end on 2023-02-03 16:00

All Kebnekaise nodes, central storage and the login nodes will be unavailable during this time.

2022-12-05 File system down, login not working (SOLVED 23:58)

  • Posted on: 5 December 2022
  • By: brorerik

We are currently experiencing file system server problems.

This is blocking logins and is also affecting running jobs.

We're working to get it back online but currently have no ETA for this.

UPDATE 23:58

The issues has now been resolved and all systems are working normally and the jobs queues are active,

UPDATE 17:30

The work with the file system verification continues, the job queues will not be up until late this evening or around 09.00 tomorrow.

UPDATE 13:10

File system down, 2022-10-06 login not working (SOLVED 2022-10-06 21:40)

  • Posted on: 6 October 2022
  • By: ake

Today at around 08:30 the file system started to have problems.

This is blocking logins and is also affecting running jobs.

We're working to get it back online.

 

UPDATE  2022-10-06 21:40)

The problem with the file system has been solved and Kebnekaise is now up and working normally again.

Due to the problems today some running jbs might have failed and need to be restarted by the user again.

We are sorry for the problems that this has caused

Regards,

/Support

Problems with the file system - affecting login, etc. (2022-09-26) ** FIXED **

  • Posted on: 26 September 2022
  • By: bbrydsoe

We are currently experiencing some problems with the file system.

Because of this there are problems accessing the file system, logging in, etc.

We are working on finding the issue and getting it back online, but there is currently no ETA for this.

This news will be updated when we have more information.

** UPDATE, 13:00 **

Filesystem is up and running, but we are still looking into the problem, and there is a risk of further issues.

** UPDATE, 16:40 **

The partitions and queues are back up and the jobs are running again. **

Pages

Updated: 2024-06-25, 16:43