Frequently Asked Questions
How do I launch or run a workload?
How do I change or increase workload run time?
You cannot change or increase the total run time after the workload is launched. Set it before launching.
In the Select Resources step, use Maximum allowed runtime to set how long the workload can run.
The default is 1 hour. If you set 1 hour, the workload runs for 1 hour and then stops automatically. Set Maximum allowed runtime according to how long you need the workload to run.
Example: Setting 2 hours and 30 minutes allows the workload to run for up to 2 hr 30 min.
See How to run a workload for full steps.
Why can't I see any queue while launching a workload?
Check the following:
-
Team assignment: On your user profile page, check the Teams section. If you are not assigned to a team, contact your sponsor. See Check team assigned.
-
Queues: On the queues page, confirm that queues are visible. If not, contact your sponsor to get queues assigned to your team. See Check queues assigned.
-
GPU count: Queue availability depends on the number of GPUs you select in Select Resources (must be ≤ 8). If queues still do not appear, contact your sponsor.
Why is my workload or job in pending state?
A workload stays in Pending until resources are available. When you launch a workload you select a queue; each queue maps to a set of nodes. If those nodes are in use, your workload waits. When the required nodes are free, the workload starts.
Note: A workload can run for up to one week. In rare cases, if all nodes are busy with long-running workloads, yours may stay pending until capacity frees up.
Why can't I rerun a workload?
Possible reasons:
- Team cannot use the application – Application is not on your team. Request access.
- You do not own all input files – You must own the job's input files.
- Application is disabled – Enable the application.
- You do not have permission – Application is not on your team. Request access.
- Application was deleted – It can no longer be used.
- Container not validated – The application's container has not been validated.
Why can't I see the performance tab after the workload is completed?
The Performance tab appears only if Telemetry enabled was turned on in the Select Resources step when you launched the workload.
See the Select Resources step in How to run a workload.
Why can't I see the Application configuration step while launching a workload?
Interactive applications (e.g. PyTorch, TensorFlow, Ubuntu ROCm) do not have an Application Configuration step, so you will not see it.
How do I SSH into the container for Ubuntu ROCm applications?
Launch an Ubuntu ROCm application. When Interactive endpoints appear on the workload overview page, open STDOUT and copy the username and password.
Click Connect, use the SSH URL shown, replace <USER> with the username from STDOUT, and enter the password when prompted in your terminal or SSH client (e.g. PuTTY, MobaXterm).
See Launch Ubuntu ROCm application for full steps.
How do I connect to JupyterLab for AI/ML applications (PyTorch, TensorFlow, etc.)?
Launch a PyTorch (or similar) application. When Interactive endpoints are available, click Connect.
Copy the token from the STDOUT tab and use it to connect.
Alternatively, click Connect and when asked for a token or password, use the value from the Secret key field.
See Launch PyTorch application for full steps.
What does "Invalid account or account/partition combination specified" mean?
It means you do not have access to the nodes behind the queue you specified. For example:
salloc: error: Job submit/allocate failed: Invalid account or account/partition combination specified.sbatch: error: Batch job submission failed: Invalid account or account/partition combination specified.
Copy-pasting commands can sometimes introduce invalid characters. Try typing the commands manually.
What does "invalid partition specified" mean?
The queue name you used does not exist. Run sinfo -o "%P%" to list valid queues.
How do I fix “rocminfo: command not found” or “Command 'rocminfo' not found … Please ask your administrator.”?
Load the ROCm Environment “module load rocm-6.1.2” and retry.
How do I check my role?
Open your user profile page. Your role is shown in the Organization panel.







