system news

2016-09-06: Maintenance on the /pfs/nobackup filesystem, Abisko will be stopped.

  • Posted on: 12 September 2016
  • By: bbrydsoe

On September 6th between 08:00 and 17:00 we will perform some necessary software updates of the servers that deliver the /pfs/nobackup file system.

This means that the batch nodes of Abisko must be drained of jobs and the file system unmounted on all nodes, including the login node.

During days leading up to the maintenance window, the system will only accept to run shorter and shorter jobs. It is therefore benficial to submit jobs with short runtimes. 

AFS problems

  • Posted on: 18 August 2016
  • By: admin

We are experiencing some problems with one of the AFS-servers. We are working on it and hopefully it should be back online soon. 

As a security measurment we have stopped the batch queue (no new jobs will start). The queue will be restored as soon as the AFS server is back online.

Abisko down due to major power failure in Umeå (power and /pfs/nobackup are now back)

  • Posted on: 18 August 2016
  • By: admin

The batch nodes of Abisko is currently down due to a major power failure in Umeå.

The lightning looks to have caused a short-circuit in one of the circuit breakers feeding Abisko.

(To give the electricans a better chance to solve the problem, all power feeds are kept off for the time being, so Abisko is currently completely without power.)

We are currently waiting for electricians to arrive on site.

Pfs problems (solved)

  • Posted on: 10 May 2016
  • By: admin

We are experiencing some problem with /pfs. We are working on it and hope to get it back online soon.

During this problem we have stopped the batch queue (no new jobs will start). As soon as pfs is back we will start up the queue again.

*UPDATE 2016-05-09 16:30*
Pfs back online. Batch queue started again.

Mon, 2016-05-09 13:35 | Roger Oskarsson

Abisko down due to upgrade of the Lustre filesystem (/pfs/nobackup) 2016-05-02 - 2016-05-04 (at least)

  • Posted on: 10 May 2016
  • By: admin

Our long awaited upgrade of the Lustre file system to double the size and performance is finally approaching.

On May 2:nd (2016-05-02) Abisko will be down (no jobs running) and the login node will be taken offline.
This is necessary to be able to add the new hardware and do the required recabling/rearranging of the system.

The down time will be at least two days long.

Mon, 2016-04-25 15:21 | Åke Sandgren

Abisko down due to maintenance of the cooling system 20160413-14 (*FINISHED*)

  • Posted on: 29 April 2016
  • By: admin

Abisko will be down due to maintenance on the cooling system 20160413 and 20160414.

During that time we will also perform some maintenance on the /pfs/nobackup file system.

This means that /pfs/nobackup will be unavailable both of those days.

No jobs will be allowed to run during this maintenance window.

*UPDATE 2016-04-14 19:00*
The system is now back online again.

Mon, 2016-04-04 08:39 | Åke Sandgren

Abisko now back online after the maintenance

  • Posted on: 29 April 2016
  • By: admin

Abisko is now back online after a two day maintenance window.

We have found some discrepancies regarding the quota information that are not yet fixed.

Some users have large differences between what the quota system thinks and what is actually there in the file system. This may cause some problems with hitting the file quota limit. If this happens please send a mail to support@hpc2n.umu.se and we will deal with it.

We will try to fix this problem during the next maintenance window in May.

Pages

Updated: 2017-12-06, 15:21