system news

External HPC2N connectivity intermittently down, 2024-02-26, 18-22

Posted on: 20 February 2024
By: bbrydsoe

Due to technical service on a router for UmU, external HPC2N connectivity is affected intermittently during 2004-02-26, 18:00-22:00.

Jobs that are already submitted should continue to run, and jobs in the queue should start, but anything that needs external connectivity (possibly some of the license servers) may be affected.

Thinlinc login to Kebnekaise currently unavailable, 2024-02-08 09:30, DONE

Posted on: 8 February 2024
By: ake

The Thinlinc login to Kebnekaise is currently unavailable due to ongoing upgrade.

It should be finished after lunch

* UPDATE 2024-02-08 14:00 *

Thinlinc login to Kebnekaise is available again.

Main HPC2N router rebooting 2024-01-03 14:30

Posted on: 3 January 2024
By: ake

Our main router to HPC2N will be rebooting 2024-01-03 13:30 which will cause some network interruptions.

While this is being done no new jobs will be starting.

It is expected to be back online ~15:45

Unexpected cooling failure 2023-12-04 00:30 Kebnekaise down, RESOLVED 20231204 1114

Posted on: 4 December 2023
By: ake

Due to an unexpected cooling failure around 00:30 on 2023-12-04 Kebnekaise is currently down.

We are currently investigating why this happened and will bring Kebnekaise back online once we have established the cause.

*UPDATE 20231204 11:14*

Kebnekaise is now up again.

Maintenance on Kebnekaise file system 2023-11-15 - 2023-11-17 FINISHED

Posted on: 3 November 2023
By: ake

We are going to upgrade the file system for projects and home directories, /proj/nobackup and /home.

There is a maintenance window for this from 2023-11-15 08:00 - 2023-11-15 17:00.

No jobs will be allowed to run during that period and login nodes will also be disabled.

This is synchronized with electrical and cooling maintenance work on 2023-11-16 to minimize total downtime.

*** UPDATE 2023-11-17 11:25 ***

The maintenance is now finished and batch system and login nodes are back online.

Maintenance on Kebnekaise 2023-10-12 - 2023-10-13

Posted on: 4 October 2023
By: ake

We will have a maintenance window on Kebnekaise 2023-10-12 - 2023-10-13 to upgrade the batch system. It is due to an important security update of SLURM (the workload manager/batch scheduler).

From 2023-10-12 08:00 no jobs will be allowed to run, this means that jobs will not be allowed to start if their requested time limit reaches into this service window.

The maintenance window is from 2023-10-12 08:00 to 2023-10-13 17:00, but we hope to be finished before that.

New hardware available on Kebnekaise including new GPUs

Posted on: 20 June 2023
By: ake

There are three new nodes available on Kebnekaise.

Two of the new nodes have dual NVIDIA A100 GPUs and one is a many-core CPU node.

There are some notable differences with these nodes.

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

Posted on: 9 June 2023
By: brorerik

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

We can see no other effect on running jobs and the job queue are now open again after having been DOWN for 1 hour

If you see some other negative effect send us a support case and we'll help solving the issue

Sorry for the inconvenience that this may have caused.

Best regards,

/Support

Password changing/resetting temporarily disabled 2023-04-12

Posted on: 12 April 2023
By: ake

We're currently moving the password handling to another host and during the move chaing or resetting passwords are not possible.

The service will be back up later today.

2023-01-24 07:37 File system down, login not working (RESOLVED 09:40)

Posted on: 24 January 2023
By: brorerik

We are currently experiencing file system server problems.

This is blocking logins and is also affecting running jobs.

We're working to get it back online but currently have no ETA for this.

UPDATE 09:40
The problem with the file server has been solved and everything should work as normal now

External HPC2N connectivity intermittently down, 2024-02-26, 18-22

Thinlinc login to Kebnekaise currently unavailable, 2024-02-08 09:30, DONE

Main HPC2N router rebooting 2024-01-03 14:30

Unexpected cooling failure 2023-12-04 00:30 Kebnekaise down, RESOLVED 20231204 1114

Maintenance on Kebnekaise file system 2023-11-15 - 2023-11-17 FINISHED

Maintenance on Kebnekaise 2023-10-12 - 2023-10-13

New hardware available on Kebnekaise including new GPUs

2023-06-09 A mishap with Slurm caused a loss of the job accounting data for Kebnekaise jobs today between 00:00 and 16:40

Password changing/resetting temporarily disabled 2023-04-12

2023-01-24 07:37 File system down, login not working (RESOLVED 09:40)

Pages

HPC2N footer

Search form

You are here

system news

Pages

HPC2N footer