InfluxDB by HTTP
Macros used
| Name | Value |
|---|---|
| {$INFLUXDB.API.TOKEN} | - |
| {$INFLUXDB.ORG_NAME.MATCHES} | .* |
| {$INFLUXDB.ORG_NAME.NOT_MATCHES} | CHANGE_IF_NEEDED |
| {$INFLUXDB.REQ.FAIL.MAX.WARN} | 2 |
| {$INFLUXDB.TASK.RUN.FAIL.MAX.WARN} | 2 |
| {$INFLUXDB.URL} | http://localhost:8086 |
Items collected
| Name | Description | Type | Interval | Key and additional info |
|---|---|---|---|---|
| InfluxDB: Get instance metrics | - | HTTP_AGENT | - | influx.get_metrics |
| InfluxDB: Instance status | Get the health of an instance. | HTTP_AGENT | - | influx.healthcheck |
| InfluxDB: Boltdb reads, rate | Total number of boltdb reads per second. | DEPENDENT | - | influxdb.boltdb_reads.rate |
| InfluxDB: Boltdb writes, rate | Total number of boltdb writes per second. | DEPENDENT | - | influxdb.boltdb_writes.rate |
| InfluxDB: Buckets, total | Number of total buckets on the server. | DEPENDENT | - | influxdb.buckets.total |
| InfluxDB: Dashboards, total | Number of total dashboards on the server. | DEPENDENT | - | influxdb.dashboards.total |
| InfluxDB: Organizations, total | Number of total organizations on the server. | DEPENDENT | - | influxdb.organizations.total |
| InfluxDB: Scrapers, total | Number of total scrapers on the server. | DEPENDENT | - | influxdb.scrapers.total |
| InfluxDB: Task runs failed, rate | Total number of failure runs across all tasks. | DEPENDENT | - | influxdb.task_executor_complete.failed.rate |
| InfluxDB: Task runs successful, rate | Total number of runs successful completed across all tasks. | DEPENDENT | - | influxdb.task_executor_complete.successful.rate |
| InfluxDB: Workers currently running | Total number of workers currently running tasks. | DEPENDENT | - | influxdb.task_executor_runs_active.total |
| InfluxDB: Workers busy, pct | Percent of total available workers that are currently busy. | DEPENDENT | - | influxdb.task_executor_workers_busy.pct |
| InfluxDB: Telegrafs, total | Number of total telegraf configurations on the server. | DEPENDENT | - | influxdb.telegrafs.total |
| InfluxDB: Telegraf plugins, total | Number of individual telegraf plugins configured. | DEPENDENT | - | influxdb.telegraf_plugins.total |
| InfluxDB: Tokens, total | Number of total tokens on the server. | DEPENDENT | - | influxdb.tokens.total |
| InfluxDB: Uptime | InfluxDB process uptime in seconds. | DEPENDENT | - | influxdb.uptime |
| InfluxDB: Users, total | Number of total users on the server. | DEPENDENT | - | influxdb.users.total |
| InfluxDB: Version | Version of the InfluxDB instance. | DEPENDENT | - | influxdb.version |
Triggers
| Name | Description | Expression | Priority | Dependencies |
|---|---|---|---|---|
| InfluxDB: Health check was failed | The InfluxDB instance is not available or unhealthy. | last(/InfluxDB by HTTP/influx.healthcheck)=0 | HIGH ⛔ | InfluxDB: Instance status |
| InfluxDB: Too many tasks failure runs | "Number of failure runs completed across all tasks is too high." | min(/InfluxDB by HTTP/influxdb.task_executor_complete.failed.rate,5m)>{$INFLUXDB.TASK.RUN.FAIL.MAX.WARN} | WARNING 📢 | InfluxDB: Task runs failed, rate |
| InfluxDB: has been restarted | Uptime is less than 10 minutes. | last(/InfluxDB by HTTP/influxdb.uptime)<10m | INFO 🔔 | InfluxDB: Uptime |
| InfluxDB: Version has changed | InfluxDB version has changed. Ack to close. | last(/InfluxDB by HTTP/influxdb.version,#1)<>last(/InfluxDB by HTTP/influxdb.version,#2) and length(last(/InfluxDB by HTTP/influxdb.version))>0 | INFO 🔔 | InfluxDB: Version |
Discovery rule №1
| Name | Description | Type | Interval | Key and additional info |
|---|---|---|---|---|
| Organizations discovery | Discovery of organizations metrics. | HTTP_AGENT | 10m | influxdb.orgs.discovery |
Item prototypes
| Name | Description | Type | Interval | Key and additional info |
|---|---|---|---|---|
| InfluxDB: [{#ORG_NAME}] Query response bytes, failed | Count of bytes returned with status not 200 per second. | DEPENDENT | - | influxdb.org.http_query_response_bytes.failed.rate["{#ORG_NAME}"] |
| InfluxDB: [{#ORG_NAME}] Query response bytes, success | Count of bytes returned with status 200 per second. | DEPENDENT | - | influxdb.org.http_query_response_bytes.success.rate["{#ORG_NAME}"] |
| InfluxDB: [{#ORG_NAME}] Query requests, failed | Total number of query requests with status not 200 per second. | DEPENDENT | - | influxdb.org.query_request.failed.rate["{#ORG_NAME}"] |
| InfluxDB: [{#ORG_NAME}] Query requests, success | Total number of query requests with status 200 per second. | DEPENDENT | - | influxdb.org.query_request.success.rate["{#ORG_NAME}"] |
| InfluxDB: [{#ORG_NAME}] Query requests bytes, failed | Count of bytes received with status not 200 per second. | DEPENDENT | - | influxdb.org.query_request_bytes.failed.rate["{#ORG_NAME}"] |
| InfluxDB: [{#ORG_NAME}] Query requests bytes, success | Count of bytes received with status 200 per second. | DEPENDENT | - | influxdb.org.query_request_bytes.success.rate["{#ORG_NAME}"] |
Trigger prototypes
| Name | Description | Expression | Priority | Dependencies |
|---|---|---|---|---|
| InfluxDB: [{#ORG_NAME}]: Too many requests failures | Too many query requests failed. | min(/InfluxDB by HTTP/influxdb.org.query_request.failed.rate["{#ORG_NAME}"],5m)>{$INFLUXDB.REQ.FAIL.MAX.WARN} | WARNING 📢 | InfluxDB: [{#ORG_NAME}] Query requests, failed |