Welcome to the COSmology MAchine (COSMA)! Here, you will find some information about the COSMA5, COSMA6, COSMA7 and COSMA8 HPC facilities.
COSMA has been in existence since July 2001 and is now in its 8th generation.
COSMA5 is a Durham system for ICC users and collaborators.
COSMA6 and COSMA7 are the current DiRAC facilities hosted by Durham. COSMA6 was introduced on 1st April 2017 as the DiRAC-2.5 Data Centric service. COSMA7 was introduced as the DiRAC-2.5x system in May 2018, and expanded by DiRAC-2.5y in January 2019, and as DiRAC-2.5z in April 2019.
COSMA8 is a new DiRAC-3 system which will become operational in September 2021. The DiRAC-3 prototype system (COSMA-8 prototype) became operational in October 2020, and was available for a number of months before integration with the full DiRAC-3 system began.
The DINE experimental system is also part of COSMA.
DiRAC is the UK's integrated supercomputing facility for theoretical modelling and HPC-based research in particle physics, astronomy and cosmology. For more information about DiRAC please visit the DiRAC web pages http://www.dirac.ac.uk.
The actual load (taken every 15 minutes) on the DiRAC systems can be seen on the SAFE pages: http://dirac-ops.epcc.ed.ac.uk/
The COSMA systems run security hardened DiRAC Linux.
The login details are:
cosma5: login.cosma.dur.ac.uk or login5.cosma.dur.ac.uk (round-robin to login5a and login5b)
cosma6: login6.cosma.dur.ac.uk (single node, login6a)
cosma7: login7.cosma.dur.ac.uk (round-robin to login7a, login7b and login7c)
cosma8: login8.cosma.dur.ac.uk (round-robin to login8a, login8b)
All systems use a global slurm batch system. Submission to any queue (cosma, cosma6, cosma7, cosma8) can be done from any login node.
COSMA5 is no longer a DiRAC facility. Therefore, DiRAC users should use COSMA6, 7 or 8 as appropriate for their project.
COSMA has 3 periods of scheduled downtime per year, lasting up to a week, though typically the affected period is shorter. Current scheduled periods are:
7-11th June 2021
4-8th October 2021
7-11th February 2022
Ogden Centre for Fundamental Physics - West,
Department of Physics,
Durham DH1 3LE
20/4/21: AMD Milan node (with 128 cores, 1TB RAM and MI100 GPU) goes live
10/4/21: COSMA8 power-on
1/4/21: COSMA8 integration starts
1/3/21: COSMA becomes a Dell Centre of Excellence
4/2/21: COSMA back from downtime
28/1/21: COSMA network link severed between Durham and Leeds
26/1/21: Updated OneAPI module installed
16/12/20: DiRAC-3 system ordered
5/10/20: COSMA downtime starts
21/9/20: COSMA8 Compute nodes powered up, with novel on-chip cooling
7/8/20: COSMA8 service nodes brought into production
26/6/20: COSMA5 back in operation
24/6/20: COSMA5 down - failure of machine room CIS cooling equipment
3/6/20: COSMA downtime completed
15/5/20: COSMA seems to have survived (so far) the world-wide HPC attacks
14/5/20: All users must regenerate SSH keys and upload to SAFE
17/4/20: GCC 9.3 and Intel 2020 (update 1) compilers now available for use
1/4/20: BlueField cluster available for users (first 4 nodes)
16/3/20: x2go installed on login nodes to aid remote working during COVID-19
2/3/20: 16-node BlueField delivered and racked (awaiting power cables)
28/2/20: New database server for virgodb delivered.
5/2/20: Permanent host for V100 GPU cards identified.
5/2/20: New COSMA5 storage online - from nearly 30kW down to 1.5kW
5/2/20: COSMA is alive again!
3/2/20: COSMA in downtime... back soon
25/11/19: New COSMA6 storage in service across all of COSMA
19/11/19: New COSMA6 storage in service on COSMA6 nodes
11/11/19: New COSMA6 storage migration ongoing