system news

New center storage system

  • Posted on: 28 April 2016
  • By: admin
Dear Users,
 
We apologize for the late notification. However we have some really good news.
 
HPC2N has during the summer and early autumn procured, tested and deployed a new center storage system which will replace the old, aging GPFS based system. The new system is a DDN SFA 12KX and Exascaler solution using Lustre as the underlying filesystem. The new storage system consists of 1PB storage and will be up to 25 times faster, depending on I/O pattern, than the old one. 
 

Problems with batchjobs on Abisko and Akka clusters.

  • Posted on: 28 April 2016
  • By: admin

Due to a recent security update of the 'bash' shell there is a high probability that jobs that were submitted before the nodes received the update may fail to use the 'module' functionality once they start running.

As the login nodes are now updated, any future job submissions should work as intended as soon as you log out and in again of any long-running sessions, as you need to be on the new version of the shell.

We are very sorry about this unforeseen problem.

Downtime on Abisko for large scale testing of our new center storage

  • Posted on: 28 April 2016
  • By: admin

Thursday Aug 28th we will performe some large scale tests of our new center storage.

To do this we have reserved ALL nodes of Abisko starting 09:30 CEST
The tests are expected to take a couple of hours after which the system will be back for normal use.

We recommend submitting jobs with shorter runtimes which will then fit nicely into the slots that become available due to draining the system.

UPDATE 2014-08-28 20:30 CEST

Power outage - Abisko and Akka affected

  • Posted on: 28 April 2016
  • By: admin

We just had a power outage this morning. All jobs running on both Akka and Abisko were interrupted. We are currently in the process of restoring the systems.

We hope to have all Akka compute nodes back in service by the end of today. 

Abisko nodes may take longer. It is possible that we can get them back up today, but they may not be available until Thursday morning, due to the scheduled file system update tomorrow.

Tue, 2014-08-19 11:40 | Daniel Petersen

Downtime due to installation of new cluster file system

  • Posted on: 28 April 2016
  • By: admin

On Wednesday Aug 20th we will do the initial installation of our new cluster file system.

To do this we need to do some recabling on the interconnect of Abisko.
We will therefore drain the cluster of jobs so that it will be empty at 08:00

The system is estimated to be back in production on Thursday morning at the latest.

More information on the new file system will be annouced later.

Wed, 2014-08-13 10:39 | Åke Sandgren

Power failure

  • Posted on: 28 April 2016
  • By: admin

Today (2014-07-19) we suffered a city-wide power failure around 15:27 CEST. This caused all of the Abisko and Akka nodes to lose power and shut down.

The clusters should be back in production again at approximately 21:00 CEST.

Sat, 2014-07-19 22:50 | Björn Torkelsson

Pages

Updated: 2024-03-21, 12:31