GPU Monitoring with ControlUp 8.7
    • Dark
      Light
    • PDF

    GPU Monitoring with ControlUp 8.7

    • Dark
      Light
    • PDF

    Article Summary

    Introduction

    With ControlUp, you benefit from real-time monitoring of your GPU data. You can monitor the performance of your GPUs, whether it is on a machine, session or process level.

    Up until version 8.7, only metrics for NVIDIA GPUs data was collected. From Version 8.7, ControlUp collects data for all GPU models by using Windows built-in GPU metrics. This means that you can display GPU metrics in the Real-Time Console and Solve just as you would by running the Task Manager on the machine with the GPU installed.

    System Prerequisites

    Version 8.7 and later supports a broad range of graphic card manufacturers. The monitored machine running the GPU must meet one of the following requirements:

    • Non-NVIDIA GPU models require the Windows Display Driver Model (WDDM). To support non-NVIDIA graphic cards, the machine must run one of the following operating systems:
      • Windows Server 2019 or later
      • Windows 10 1709 or later
    • Physical Windows machine or virtual Windows machine with passthrough devices.
    • Any other Windows / VM configuration where data is visible from the task manager.

    ControlUp Prerequisites

    To use the new GPU feature, you need to

    1. Deploy the ControlUp Agent on each machine that runs the GPU.
    2. On the same machines, set the following registry values:

    Path: HKEY_LOCAL_MACHINE\SOFTWARE\Smart-X\ControlUp\Agent\GPU\
    Value: IsGPUDisabled
    Type: REG_DWORD
    Value Data: 0

    Path: HKEY_LOCAL_MACHINE\SOFTWARE\Smart-X\ControlUp\Agent\GPU\
    Value: EnableNvidiaGPUCollection
    Type: REG_DWORD
    Value Value: 0

    Tip for adding registry values on multiple machines

    By using Controllers, you can set the registry keys on multiple machines simultaneously.

    GPU Metrics

    We provide a number of useful GPU metrics that help you monitor the GPUs of your machines. The table below provides an overview of all GPU-related metrics that are implemented in our products:

    Metric NameDescriptionTab
    Average GPU Frame Buffer UsageAverage frame buffer usage for all GPUsFolders
    Average GPU UsageAverage usage of all GPUsFolders
    GPU ArchitectureGPU architectureMachine
    GPU Available MemoryGPU available memory in megabytes (MB)Machine
    GPU CPU UtilizationGPU CPU UtilizationSession
    GPU Decoder UtilizationGPU decoder UtilizationMachine
    GPU Driver versionCurrent version of the installed GPU driverMachine
    GPU Encoder UtilizationGPU encoder UtilizationMachine
    GPU Frame Buffer Memory UtilizationGPU Frame Buffer Memory UtilizationSession
    GPU Frame Buffer SizeSize of memory assigned to the GPUMachine
    GPU Frame Buffer UsageUsed size of the physical graphics card frame buffer memory in percentMachine
    GPU License PortPrimary license server portMachine
    GPU License ServerPrimary license server nameMachine
    GPU Memory UsageGPU memory usage in percentsMachine
    GPU ModelGPU name or GRID GPU profile typeMachine
    GPU Number of CoresNumber of CUDA coresMachine
    GPU UsageGPU usage in percentsMachine
    GPU UtilizationGPU UtilizationProcess
    GPU Video Decoder UsageGPU video decoder usage in percentsSession
    GPU Video Encoder UsageGPU video encoder usage in percentsSession
    Machines with GPUNumber of machines with GPUsFolders

    Collecting Data from NVIDIA API (Optional)

    Windows is the default data source for collecting GPU metrics in 8.7. To switch back to collecting data from the NVIDIA API, set the following registry key on the machine that has the NVIDIA GPU installed:

    Path: HKEY_LOCAL_MACHINE\SOFTWARE\Smart-X\ControlUp\Agent\GPU\
    Value: EnableNvidiaGPUCollection
    Type: REG_DWORD
    Value Data: 1

    Known Issue: GPU Metrics

    Note

    This issue was fixed in version 8.8. We recommend that you upgrade to 8.8 to avoid it.

    In version 8.7, GPU metrics are displayed even when no GPU is enabled.

    Workaround
    Set the following registry key on each agent machine with no GPU enabled:

    Path: HKEY_LOCAL_MACHINE\SOFTWARE\Smart-X\ControlUp\Agent\GPU\
    Value: EnableNvidiaGPUCollection
    Type: REG_DWORD
    Value Data: 0

    If you need to apply the registry changes on multiple machines, use the Controllers pane.


    Was this article helpful?