How to Monitor Workload
Workload Status and Details
After submitting a workload, Workload Status and details of workload can be seen in Workloads page.
Types Of Workloads
Different types of Workload Status
Each workload goes through several different states after it is submitted
Created – The workload has been created in the system
Sent – The workload has been sent to the queue that user selected in the workload submission process
Pending – The workload is in a waiting state in the queue
Running – The workload has started running in the selected queue
Completed – The workload has successfully finished processing
Failed – A problem has occurred which has prevented the workload from completing successfully
Cancelled – The workload has been canceled by the user and stopped running
Finished - The interactive job that has been finished by user.
Workload Overview Page
Workload Overview page has the following tabs.
1. Overview tab
The details in overview tab can be viewed while the workload is running. It has details like the node on which the workload is running, real time cpu utilization, memory etc.
Once the workload completes, Overview tab will be disabled.
2. Parameters tab
This tab has the details entered while submitting the workload like application launched, queue selected, app configuration, number of GPUs, CPUs, memory etc.
3. SYSLOG tab
The system logs can be seen in SYSLOG tab while workload is running. Once workload finishes, syslog can be downloaded by clicking 'Show historical logs' button.
4. STDOUT tab
The output logs can be seen in STDOUT tab while workload is running. Once workload finishes, stdout log can be downloaded by clicking 'Download logs' button.
5. STDERR tab
The error logs can be seen in STDERR-tab while workload is running. Once workload finishes, stderr log can be downloaded by clicking 'Download logs' button.
6. Performance tab
Performance tab shows the telemetry