Skip to content

RocHPCG Singularity Application

This document shows how to run rochpcg singularity application.

How To Run RocHPCG Singularity Application

Step 1: To start creating a workload, sign into the AAC web platform.

Select Application

Click on Applications. Select HPCG.

Select Container

Select the desired RocHPCG version with desired container type as docker/singularity.

Note: In this case, we have selected rocHPCG 3.1.0_97 version and container as singularity.

New Workload

Click on New Workload button available on the top right corner.

Select Team

In case the user is assigned to more than one team, a pop window will required to select one of the customer teams to which the user is part of. If there is just one team assigned to the user, it will not be required.

Note: In this case, we have selected Team as AMD Internal.

Click Start new workload button.

Click Next button to continue.

Select Resources

Click Next to continue with default script.

Note: Default script is for 8 GPUs

mpirun --mca pml ucx -np 8 hpcg 280 280 280 1860

Select the allowed run time. The number of GPUs should be 8. Here, we have selected the run time as 1 hour, number of GPU’s as 8 and telemetry is enabled.Click on Next button.

Note: The time for which workload is allowed to run should be specified in the Maximum allowed runtime field. By default 1 hour will be selected.

If maximum allowed run time is 1 hour, it implies, workload will run for 1 hour and then it will be automatically stopped after 1 hour as it will not be allowed to exceed Maximum allowed runtime.

Based on the time required for workload, user should change the Maximum allowed runtime. Once workload is launched, user cannot change the total workload time. It has to be configured in the current step.

Select Compute

Select the cluster and desired queue to run the job. In this case 1CN96C8G1H_4IB_MI250_Ubuntu22 was selected. Click on Next.

Review Workload Submission

Review all the configurations selected and click on Run Workload.

Monitor Workload

Once the application execution is successful then the status gets changed into Completed the user can check the STDOUT and STDERR logs by clicking on respective tabs.The Performace tab shows the telemetry collected.

The final score in the STDOUT tab is the performance of RocHPCG application.