Skip to content

Home

The AMD Accelerator Cloud (AAC) is a private hosted platform that offers users access to the latest AMD hardware resources, tools, and ready-to-use application software. This platform facilitates the rapid and cost-effective development of solutions using AMD GPUs.

On this website, you will find comprehensive documentation on how to use the AMD Accelerator Cloud.

Users

The Users Section outlines the process of gaining access to AAC, changing or resetting a password, viewing the teams and queues assigned, and checking the list of workloads launched by the user.

User Files

User Files Section describes how users can upload files if required for their workload, as well as how to download or delete these files as needed

Applications

Applications section describes how users can create applications on AAC. This section can be referred if user has Developer role. Without developer role, applications cannot be created

Workloads

This section describes how to run, monitor, cancel, or rerun a workload. It also explains how to check the logs and metrics associated with a workload. Additionally, it provides instructions on how to run a workload using your own Docker

Slurm Cluster

This section provides a step-by-step guide for users on how to connect to the AAC environment using MobaXterm and PuTTY with a private key. This is relevant for users who have direct SSH access, i.e., those who have shared their public key with the AAC team. Note that this is not related to the AAC website.

The section includes guides on how to access the cluster, use Podman, run Megaron, NanoGPT, PyTorch multinode, RCCL tests, maintain long SSH sessions with Tmux and Screen, and set up a Conda environment.

FAQs

This section includes a list of frequently asked questions (FAQs) that can serve as a useful reference for users. It provides quick answers to common queries and can help clarify any doubts or issues that may arise during usage.