AMD GPU systems
COSMA has a MI100 GPU in server ga004.
THere are also six AMD MI50 GPUs hosted in the ga003 server. These are interlinked with a 4x and 2x InfinityFabric link between the GPUs, to enable direct data transfer. However, unless you specifically need this, please use the newer MI100 system instead.
Relevant software is in /opt/rocm* and the AMD AOCC compiler is available as a module (module load aocc)
The rocm_smi.py command will provide information about the GPUs.
These systems are available for general use. If you use them, feedback would be welcome, in particular around performance, and the software environment.