Collector Type: Agent
Category: Application Monitors
Application Name: MesosMaster
Global Template Name: Mesos Master Monitoring Template
Parameters
Names | Description | Default Value |
---|---|---|
Host IP Address | The host on which Monitd is running. | 127.0.0.1 |
Port | The port on which Mesos is running. | 8080 |
Username | The username of the server, if authentication is enabled. | NA |
Password | The password of the server, if authentication is enabled. | NA |
Note: All field attributes are mandatory. Use default values wherever applicable.
Collected Metrics
Metric Name | Display Name |
---|---|
mesos.framework.cpu | Mesos Framework CPU |
mesos.framework.mem | Mesos Framework Memory |
mesos.framework.disk | Mesos Framework Disk |
marathon.apps | Marathon Applications Count |
marathon.deployments | Marathon Deployments |
marathon.backoffFactor | Marathon Backoff Factor |
marathon.backoffSeconds | Marathon Backoff Seconds |
marathon.cpus | Marathon CPUs |
marathon.disk | Marathon DISK |
marathon.instances | Marathon Instances |
marathon.mem | Marathon Memory |
marathon.taskRateLimit | Marathon Task Rate Limit |
marathon.tasksRunning | Marathon Task Running |
marathon.tasksStaged | Marathon Task Staged |
marathon.tasksHealthy | Marathon Tasks Healthy |
marathon.tasksUnhealthy | Marathon Tasks Unhealthy |
marathon.queue.size | Marathon Queue Size |
marathon.queue.count | Marathon Queue Count |
marathon.queue.delay | Marathon Queue Delay |
marathon.queue.offers.processed | Marathon Queue Offer Processed |
marathon.queue.offers.unused | Marathon Queue Offers Unused |
marathon.queue.offers.reject.last | Marathon Queue Offers Reject Last |
marathon.queue.offers.reject.launch | Marathon Queue Offers Reject Launch |
mesos.registrar.registry_size_bytes | Mesos Registrar registry_size_bytes |
mesos.registrar.state_store_ms.p90 | Registrar state_store_ms.p90 |
mesos.registrar.state_store_ms.p99 | Registrar state_store_ms.p99 |
mesos.registrar.queued_operations | Mesos Registrar queued_operations |
mesos.registrar.state_store_ms.p999 | Registrar state_store_ms.p999 |
mesos.registrar.state_store_ms.p95 | Registrar state_store_ms.p95 |
mesos.registrar.state_store_ms.p9999 | Registrar state_store_ms.p9999 |
mesos.invalid_status_update_acknowledgements | Number of invalid status update acknowledgements |
mesos.registrar.state_store_ms.p50 | Registrar state_store_ms.p50 |
mesos.stats.elected | Elected as master |
mesos.registrar.log.recovered | Registrar log recovered |
mesos.master.count | Registry write count |
mesos.role.disk | Mesos Role Disk |
mesos.role.cpu | Mesos Role CPU |
mesos.role.mem | Mesos Role Memory |
mesos.cluster.slave_registrations | Slave registrations |
mesos.cluster.mem_percent | Allocated memory percent |
mesos.cluster.tasks_error | Invalid tasks |
mesos.cluster.disk_total | Disk space total |
mesos.cluster.tasks_finished | Tasks finished |
mesos.cluster.tasks_killed | Tasks killed |
mesos.cluster.slave_shutdowns_scheduled | Slave shutdowns scheduled |
mesos.cluster.frameworks_active | Frameworks active |
mesos.cluster.frameworks_connected | Frameworks connected |
mesos.cluster.slaves_inactive | Agents inactive |
mesos.cluster.slaves_unreachable | Agents unreachable |
mesos.cluster.gpus_used | Number of GPUs used |
mesos.cluster.mem_total | Memory total |
mesos.cluster.frameworks_inactive | Frameworks inactive |
mesos.cluster.event_queue_http_requests | Event queue HTTP requests |
mesos.cluster.tasks_starting | Tasks starting |
mesos.cluster.slave_removals | Slave removals |
mesos.cluster.cpus_total | CPUs total |
mesos.cluster.tasks_staging | Tasks staging |
mesos.cluster.mem_used | Allocated memory |
mesos.cluster.slaves_active | Agents active |
mesos.cluster.gpus_total | GPUs total |
mesos.cluster.disk_percent | Allocated disk space percent |
mesos.cluster.frameworks_disconnected | Frameworks disconnected |
mesos.cluster.invalid_status_updates | Invalid status updates |
mesos.cluster.valid_framework_to_executor_messages | Valid framework to executor messages |
mesos.cluster.tasks_failed | Failed tasks |
mesos.cluster.tasks_lost | Tasks lost |
mesos.cluster.event_queue_messages | Event queue messages |
mesos.cluster.slave_reregistrations | Slave reregistrations |
mesos.cluster.slaves_connected | Agents connected |
mesos.cluster.valid_status_update_acknowledgements | Valid status update acknowledgement messages |
mesos.cluster.slave_shutdowns_canceled | Slave shutdowns canceled |
mesos.cluster.slaves_disconnected | Agents disconnected |
mesos.cluster.cpus_used | Number of CPUs used |
mesos.cluster.outstanding_offers | Outstanding resource offers |
mesos.cluster.disk_used | Allocated disk space |
mesos.cluster.dropped_messages | Dropped messages |
mesos.cluster.invalid_framework_to_executor_messages | Invalid framework to executor messages |
mesos.cluster.gpus_percent | Allocated GPUs percent |
mesos.cluster.tasks_running | Tasks running |
mesos.cluster.cpus_percent | Allocated CPUs percent |
mesos.cluster.event_queue_dispatches | Dispatches in the event queue |
mesos.cluster.valid_status_updates | Valid status updates |
dcos.health.admin.router.agent | Admin router agent service health |
dcos.health.log.agent | Agent Log service health |
dcos.health.marathon | Marathon service health |
dcos.health.telegraf | Telegraf service health |
dcos.health.admin.router.master | Admin router master service health |
dcos.health.checks.api.socket | Checks API socket health |
dcos.health.checks.timer | Checks Timer service health |
dcos.health.history | History service health |
mdcos.health.log.master.socket | Master Log socket health |
dcos.health.net.watchdog | Net Watchdog service health |
dcos.health.gc | Docker GC |
dcos.health.resolv.timer | Generate resolv.conf Timer service health |
dcos.health.mesos.master | Mesos Master service health |
dcos.health.authentication | Authentication service health |
dcos.health.gc.timer | Docker GC Timer |
dcos.health.diagnostics.agent | Diagnostics Agent service health |
dcos.health.jobs | Jobs service health |
dcos.health.net | Net service health |
dcos.health.rexray | REX_Ray service health |
dcos.health.diagnostics.agent.socket | Diagnostics Agent socket health |
dcos.health.logrotate.agent | Agent Logrotate service health |
dcos.health.logrotate.master | Master Logrotate service health |
dcos.health.logrotate.master.timer | Logrotate Timer |
dcos.health.mesos.agent.public | Mesos Public Agent service health |
dcos.health.poststart.checks | Poststart Checks service health |
dcos.health.signal | Signal service health |
dcos.health.signal.timer | Signal Timer service health |
dcos.health.resolv | Generate resolv.conf service health |
dcos.health.telegraf.socket | Telegraf socket health |
dcos.health.log.master | Master Log service health |
dcos.health.exhibitor | Exhibitor service health |
dcos.health.checks.api | Checks API service health |
dcos.health.component.package.manager | Component Package Manager (Pkgpanda) service health |
dcos.health.log.agent.socket | Agent Log socket health |
dcos.health.package.manager | Package Manager service health |
dcos.health.logrotate.agent.timer | Logrotate Timer |
dcos.health.mesos.agent | Mesos Agent service health |
dcos.health.mesos.dns | Mesos DNS service health |