Collector Type: Agent
Category: Application Monitors
Application Name: MesosMaster
Global Template Name: Mesos Master Monitoring Template
Parameters
| Names | Description | Default Value |
|---|---|---|
| Host IP Address | The host on which Monitd is running. | 127.0.0.1 |
| Port | The port on which Mesos is running. | 8080 |
| Username | The username of the server, if authentication is enabled. | NA |
| Password | The password of the server, if authentication is enabled. | NA |
Note: All field attributes are mandatory. Use default values wherever applicable.
Collected Metrics
| Metric Name | Display Name |
|---|---|
| mesos.framework.cpu | Mesos Framework CPU |
| mesos.framework.mem | Mesos Framework Memory |
| mesos.framework.disk | Mesos Framework Disk |
| marathon.apps | Marathon Applications Count |
| marathon.deployments | Marathon Deployments |
| marathon.backoffFactor | Marathon Backoff Factor |
| marathon.backoffSeconds | Marathon Backoff Seconds |
| marathon.cpus | Marathon CPUs |
| marathon.disk | Marathon DISK |
| marathon.instances | Marathon Instances |
| marathon.mem | Marathon Memory |
| marathon.taskRateLimit | Marathon Task Rate Limit |
| marathon.tasksRunning | Marathon Task Running |
| marathon.tasksStaged | Marathon Task Staged |
| marathon.tasksHealthy | Marathon Tasks Healthy |
| marathon.tasksUnhealthy | Marathon Tasks Unhealthy |
| marathon.queue.size | Marathon Queue Size |
| marathon.queue.count | Marathon Queue Count |
| marathon.queue.delay | Marathon Queue Delay |
| marathon.queue.offers.processed | Marathon Queue Offer Processed |
| marathon.queue.offers.unused | Marathon Queue Offers Unused |
| marathon.queue.offers.reject.last | Marathon Queue Offers Reject Last |
| marathon.queue.offers.reject.launch | Marathon Queue Offers Reject Launch |
| mesos.registrar.registry_size_bytes | Mesos Registrar registry_size_bytes |
| mesos.registrar.state_store_ms.p90 | Registrar state_store_ms.p90 |
| mesos.registrar.state_store_ms.p99 | Registrar state_store_ms.p99 |
| mesos.registrar.queued_operations | Mesos Registrar queued_operations |
| mesos.registrar.state_store_ms.p999 | Registrar state_store_ms.p999 |
| mesos.registrar.state_store_ms.p95 | Registrar state_store_ms.p95 |
| mesos.registrar.state_store_ms.p9999 | Registrar state_store_ms.p9999 |
| mesos.invalid_status_update_acknowledgements | Number of invalid status update acknowledgements |
| mesos.registrar.state_store_ms.p50 | Registrar state_store_ms.p50 |
| mesos.stats.elected | Elected as master |
| mesos.registrar.log.recovered | Registrar log recovered |
| mesos.master.count | Registry write count |
| mesos.role.disk | Mesos Role Disk |
| mesos.role.cpu | Mesos Role CPU |
| mesos.role.mem | Mesos Role Memory |
| mesos.cluster.slave_registrations | Slave registrations |
| mesos.cluster.mem_percent | Allocated memory percent |
| mesos.cluster.tasks_error | Invalid tasks |
| mesos.cluster.disk_total | Disk space total |
| mesos.cluster.tasks_finished | Tasks finished |
| mesos.cluster.tasks_killed | Tasks killed |
| mesos.cluster.slave_shutdowns_scheduled | Slave shutdowns scheduled |
| mesos.cluster.frameworks_active | Frameworks active |
| mesos.cluster.frameworks_connected | Frameworks connected |
| mesos.cluster.slaves_inactive | Agents inactive |
| mesos.cluster.slaves_unreachable | Agents unreachable |
| mesos.cluster.gpus_used | Number of GPUs used |
| mesos.cluster.mem_total | Memory total |
| mesos.cluster.frameworks_inactive | Frameworks inactive |
| mesos.cluster.event_queue_http_requests | Event queue HTTP requests |
| mesos.cluster.tasks_starting | Tasks starting |
| mesos.cluster.slave_removals | Slave removals |
| mesos.cluster.cpus_total | CPUs total |
| mesos.cluster.tasks_staging | Tasks staging |
| mesos.cluster.mem_used | Allocated memory |
| mesos.cluster.slaves_active | Agents active |
| mesos.cluster.gpus_total | GPUs total |
| mesos.cluster.disk_percent | Allocated disk space percent |
| mesos.cluster.frameworks_disconnected | Frameworks disconnected |
| mesos.cluster.invalid_status_updates | Invalid status updates |
| mesos.cluster.valid_framework_to_executor_messages | Valid framework to executor messages |
| mesos.cluster.tasks_failed | Failed tasks |
| mesos.cluster.tasks_lost | Tasks lost |
| mesos.cluster.event_queue_messages | Event queue messages |
| mesos.cluster.slave_reregistrations | Slave reregistrations |
| mesos.cluster.slaves_connected | Agents connected |
| mesos.cluster.valid_status_update_acknowledgements | Valid status update acknowledgement messages |
| mesos.cluster.slave_shutdowns_canceled | Slave shutdowns canceled |
| mesos.cluster.slaves_disconnected | Agents disconnected |
| mesos.cluster.cpus_used | Number of CPUs used |
| mesos.cluster.outstanding_offers | Outstanding resource offers |
| mesos.cluster.disk_used | Allocated disk space |
| mesos.cluster.dropped_messages | Dropped messages |
| mesos.cluster.invalid_framework_to_executor_messages | Invalid framework to executor messages |
| mesos.cluster.gpus_percent | Allocated GPUs percent |
| mesos.cluster.tasks_running | Tasks running |
| mesos.cluster.cpus_percent | Allocated CPUs percent |
| mesos.cluster.event_queue_dispatches | Dispatches in the event queue |
| mesos.cluster.valid_status_updates | Valid status updates |
| dcos.health.admin.router.agent | Admin router agent service health |
| dcos.health.log.agent | Agent Log service health |
| dcos.health.marathon | Marathon service health |
| dcos.health.telegraf | Telegraf service health |
| dcos.health.admin.router.master | Admin router master service health |
| dcos.health.checks.api.socket | Checks API socket health |
| dcos.health.checks.timer | Checks Timer service health |
| dcos.health.history | History service health |
| mdcos.health.log.master.socket | Master Log socket health |
| dcos.health.net.watchdog | Net Watchdog service health |
| dcos.health.gc | Docker GC |
| dcos.health.resolv.timer | Generate resolv.conf Timer service health |
| dcos.health.mesos.master | Mesos Master service health |
| dcos.health.authentication | Authentication service health |
| dcos.health.gc.timer | Docker GC Timer |
| dcos.health.diagnostics.agent | Diagnostics Agent service health |
| dcos.health.jobs | Jobs service health |
| dcos.health.net | Net service health |
| dcos.health.rexray | REX_Ray service health |
| dcos.health.diagnostics.agent.socket | Diagnostics Agent socket health |
| dcos.health.logrotate.agent | Agent Logrotate service health |
| dcos.health.logrotate.master | Master Logrotate service health |
| dcos.health.logrotate.master.timer | Logrotate Timer |
| dcos.health.mesos.agent.public | Mesos Public Agent service health |
| dcos.health.poststart.checks | Poststart Checks service health |
| dcos.health.signal | Signal service health |
| dcos.health.signal.timer | Signal Timer service health |
| dcos.health.resolv | Generate resolv.conf service health |
| dcos.health.telegraf.socket | Telegraf socket health |
| dcos.health.log.master | Master Log service health |
| dcos.health.exhibitor | Exhibitor service health |
| dcos.health.checks.api | Checks API service health |
| dcos.health.component.package.manager | Component Package Manager (Pkgpanda) service health |
| dcos.health.log.agent.socket | Agent Log socket health |
| dcos.health.package.manager | Package Manager service health |
| dcos.health.logrotate.agent.timer | Logrotate Timer |
| dcos.health.mesos.agent | Mesos Agent service health |
| dcos.health.mesos.dns | Mesos DNS service health |