Use Azure Batch to run large-scale parallel and high-performance computing (HPC) batch jobs efficiently in Azure.
Azure Batch does the following:
- Creates and manages a pool of compute nodes (virtual machines).
- Installs the applications you want to run.
- Schedules jobs to run on the nodes.
There is no cluster or job scheduler software to install, manage, or scale.
Use the Azure public cloud integration to discover and collect metrics against the Azure service.
External reference
Setup
To set up the Azure integration and discover the Azure service, go to Azure Integration Discovery Profile and select Batch Accounts
.
Event support
- Supported
- Configurable in OpsRamp Azure Integration Discovery Profile.
Supported metrics
OpsRamp Metric | Metric Display Name | Unit | Aggregation Type |
---|---|---|---|
azure_core_count Total number of dedicated cores in the batch account. | Dedicated Core Count | Count | Total |
azure_total_node_count Total number of dedicated nodes in the batch account. | Dedicated Node Count | Count | Total |
azure_low_priority_core_count Total number of low priority cores in the batch account. | LowPriority Core Count | Count | Total |
azure_total_low_priority_node_count Total number of low priority nodes in the batch account. | Low Priority Node Count | Count | Total |
azure_creating_node_count Number of nodes being created. | Creating Node Count | Count | Total |
azure_starting_node_count Number of nodes starting. | Starting Node Count | Count | Total |
azure_waiting_for_start_task_node_count Number of nodes waiting for the Start Task to complete. | Waitng For Start Task Node Count | Count | Total |
azure_start_task_failed_node_count Number of nodes where the Start Task has failed. | Start Task Failed Node Count | Count | Total |
azure_idle_node_count Number of idle nodes. | Idle Node Count | Count | Total |
azure_offline_node_count Number of offline nodes. | Offline Node Count | Count | Total |
azure_rebooting_node_count Number of rebooting nodes. | Rebooting Node Count | Count | Total |
azure_remaining_node_count Number of reimaging nodes. | Reimaging Node Count | Count | Total |
azure_running_node_count Number of running nodes. | Running Node Count | Count | Total |
azure_leaving_pool_node_count Number of nodes leaving the pool. | Leaving Pool Node Count | Count | Total |
azure_unusable_node_count Number of unusable nodes. | Unusable Node Count | Count | Total |
azure_preempted_node_count Number of preempted nodes. | Preempted Node Count | Count | Total |
azure_task_start_event Number of tasks that have started. | Task Start Events | Count | Total |
azure_task_complete_event Total number of tasks that have completed. | Task Complete Events | Count | Total |
azure_task_fail_event Total number of tasks that have completed in a failed state. | Task Fail Events | Count | Total |
azure_pool_create_event Total number of pools created. | Pool Create Events | Count | Total |
azure_pool_resize_start_event Total number of pool resizes that have started. | Pool Resize Start Events | Count | Total |
azure_pool_resize_complete_event Total number of pool resizes that have completed. | Pool Resize Complete Events | Count | Total |
azure_pool_delete_start_event Total number of pool deletes that have started. | Pool Delete Start Events | Count | Total |
azure_pool_delete_complete_event Total number of pool deletes that have completed. | Pool Delete Complete Events | Count | Total |
azure_batchaccount_job_delete_complete_event Total number of jobs successfully deleted. | Job Delete Complete Events | Count | Total |
azure_batchaccount_job_delete_start_event Total number of jobs requested to be deleted. | Job Delete Start Events | Count | Total |
azure_batchaccount_job_disable_complete_event Total number of jobs successfully disabled. | Job Disable Complete Events | Count | Total |
azure_batchaccount_job_disable_start_event Total number of jobs requested to be disabled. | Job Disable Start Events | Count | Total |
azure_batchaccount_job_start_event Total number of jobs successfully started. | Job Start Events | Count | Total |
azure_batchaccount_job_terminate_complete_event Total number of jobs successfully terminated. | Job Terminate Complete Events | Count | Total |
azure_batchaccount_job_terminate_start_event Total number of jobs requested to be terminated. | Job Terminate Start Events | Count | Total |
cloud_instance_state Status/State | Status/State | n/a | n/a |