Problems with batch system
We have some disk problems on our batch server. This means that no jobs can be submitted. We are hoping it will not affect running jobs.
Fri, 2014-11-14 11:47 | Roger Oskarsson
We have some disk problems on our batch server. This means that no jobs can be submitted. We are hoping it will not affect running jobs.
Fri, 2014-11-14 11:47 | Roger Oskarsson
Friday afternoon (around 14:30) we had a power outage which affected both our cluster (Abisko and Akka). Our power company is blaming the weather.
Power should be back now and we are in the process of restarting everything.
Fri, 2014-10-24 14:52 | Roger Oskarsson
The new centre storage is now in production.
The /pfs/nobackup file system is now larger and faster, ... finally.
Almost all users have been synchronized to new new file system.
The few remaining users (those affected will get a separate mail) have been blocked from logging in and their jobs put on hold until the transfer is complete for each user.
Jobs are running again and login has been opened (see exception above).
If you notice anything strange please notify support@hpc2n.umu.se.
On Tuesday October 14 06:30-13:00 the HPC2N compute resources will be unavailable due to maintenance work being performed on the cooling/ventilation system in the cluster machine room.
Note that jobs that would extend into this downtime window won't start until the downtime has passed.
Wed, 2014-10-01 16:09 | Niklas Edmundsson
Due to a recent security update of the 'bash' shell there is a high probability that jobs that were submitted before the nodes received the update may fail to use the 'module' functionality once they start running.
As the login nodes are now updated, any future job submissions should work as intended as soon as you log out and in again of any long-running sessions, as you need to be on the new version of the shell.
We are very sorry about this unforeseen problem.
On Tuesday (2014-09-16) we will start to do more large scale tests of our new center storage.
We have made a reservation of the system between Tuesday and Thursday for this.
Abisko will therefor be drained of jobs during this time.
Wed, 2014-09-10 16:23 | Åke Sandgren
Thursday Aug 28th we will performe some large scale tests of our new center storage.
To do this we have reserved ALL nodes of Abisko starting 09:30 CEST
The tests are expected to take a couple of hours after which the system will be back for normal use.
We recommend submitting jobs with shorter runtimes which will then fit nicely into the slots that become available due to draining the system.
UPDATE 2014-08-28 20:30 CEST
We just had a power outage this morning. All jobs running on both Akka and Abisko were interrupted. We are currently in the process of restoring the systems.
We hope to have all Akka compute nodes back in service by the end of today.
Abisko nodes may take longer. It is possible that we can get them back up today, but they may not be available until Thursday morning, due to the scheduled file system update tomorrow.
Tue, 2014-08-19 11:40 | Daniel Petersen
Quantum Espresso 5.1 has now been installed on both Abisko and Akka.
Thu, 2014-08-14 08:03 | Åke Sandgren