Перейти к основному содержимому

CockroachDB by HTTP

Macros used

NameValue
{$COCKROACHDB.API.PORT}8080
{$COCKROACHDB.API.SCHEME}http
{$COCKROACHDB.CERT.CA.EXPIRY.WARN}90
{$COCKROACHDB.CERT.NODE.EXPIRY.WARN}30
{$COCKROACHDB.CLOCK.OFFSET.MAX.WARN}300
{$COCKROACHDB.OPEN.FDS.MAX.WARN}80
{$COCKROACHDB.STATEMENTS.ERRORS.MAX.WARN}2
{$COCKROACHDB.STORE.USED.MIN.CRIT}10
{$COCKROACHDB.STORE.USED.MIN.WARN}20

Items collected

NameDescriptionTypeIntervalKey and additional info
CockroachDB: CA certificate expiration dateCA certificate expires at that date.DEPENDENT

-

cockroachdb.cert.expire_date.ca
CockroachDB: Node certificate expiration dateNode certificate expires at that date.DEPENDENT

-

cockroachdb.cert.expire_date.node
CockroachDB: Clock offsetMean clock offset of the node against the rest of the cluster.DEPENDENT

-

cockroachdb.clock.offset
CockroachDB: CPU: System timeSystem CPU time.DEPENDENT

-

cockroachdb.cpu.system_time
CockroachDB: CPU: User timeUser CPU time.DEPENDENT

-

cockroachdb.cpu.user_time
CockroachDB: CPU: UtilizationCPU utilization in %.DEPENDENT

-

cockroachdb.cpu.util
CockroachDB: File descriptors: LimitOpen file descriptors soft limit of the process.DEPENDENT

-

cockroachdb.descriptors.limit
CockroachDB: File descriptors: OpenThe number of open file descriptors.DEPENDENT

-

cockroachdb.descriptors.open
CockroachDB: Disk: IOPS in progress, rateNumber of disk IO operations currently in progress on this host.DEPENDENT

-

cockroachdb.disk.iops.in_progress.rate
CockroachDB: Disk: Read IOPS, rateNumber of disk read operations per second across all disks since this process started.DEPENDENT

-

cockroachdb.disk.iops.read.rate
CockroachDB: Disk: Write IOPS, rateDisk write operations per second across all disks since this process started.DEPENDENT

-

cockroachdb.disk.iops.write.rate
CockroachDB: Disk: Reads, rateBytes read from all disks per second since this process startedDEPENDENT

-

cockroachdb.disk.read.rate
CockroachDB: Disk: Writes, rateBytes written to all disks per second since this process started.DEPENDENT

-

cockroachdb.disk.write.rate
CockroachDB: GC: Pause timeThe amount of processor time used by Go's garbage collector across all nodes. During garbage collection, application code execution is paused.DEPENDENT

-

cockroachdb.gc.pause_time
CockroachDB: GC: Runs, rateThe number of times that Go's garbage collector was invoked per second across all nodes.DEPENDENT

-

cockroachdb.gc.runs.rate
CockroachDB: Get healthGet node /health endpointHTTP_AGENT

-

cockroachdb.get_health
CockroachDB: Get metricsGet raw metrics from the Prometheus endpoint.HTTP_AGENT

-

cockroachdb.get_metrics
CockroachDB: Get readinessGet node /health?ready=1 endpointHTTP_AGENT

-

cockroachdb.get_readiness
CockroachDB: Go: Goroutines countCurrent number of Goroutines. This count should rise and fall based on load.DEPENDENT

-

cockroachdb.go.goroutines.count
CockroachDB: Liveness heartbeats, rateNumber of successful node liveness heartbeats per second from this node.DEPENDENT

-

cockroachdb.heartbeaths.success.rate
CockroachDB: KV transactions: Aborted, rateNumber of aborted KV transactions per second.DEPENDENT

-

cockroachdb.kv.transactions.aborted.rate
CockroachDB: KV transactions: Committed, rateNumber of KV transactions (including 1PC) committed per second.DEPENDENT

-

cockroachdb.kv.transactions.committed.rate
CockroachDB: Live nodes countThe number of live nodes in the cluster (will be 0 if this node is not itself live).DEPENDENT

-

cockroachdb.live_count
CockroachDB: Memory: Allocated by CgoCurrent bytes of memory allocated by the C layer.DEPENDENT

-

cockroachdb.memory.cgo.allocated
CockroachDB: Memory: Managed by CgoTotal bytes of memory managed by the C layer.DEPENDENT

-

cockroachdb.memory.cgo.managed
CockroachDB: Memory: Allocated by GoCurrent bytes of memory allocated by the Go layer.DEPENDENT

-

cockroachdb.memory.go.allocated
CockroachDB: Memory: Managed by GoTotal bytes of memory managed by the Go layer.DEPENDENT

-

cockroachdb.memory.go.managed
CockroachDB: Memory: Allocated by SQLCurrent SQL statement memory usage for root.DEPENDENT

-

cockroachdb.memory.sql
CockroachDB: Memory: Total usageResident set size (RSS) of memory in use by the node.DEPENDENT

-

cockroachdb.memory.total
CockroachDB: Network: Bytes received, rateBytes received per second on all network interfaces since this process started.DEPENDENT

-

cockroachdb.network.bytes.received.rate
CockroachDB: Network: Bytes sent, rateBytes sent per second on all network interfaces since this process started.DEPENDENT

-

cockroachdb.network.bytes.sent.rate
CockroachDB: Slow requests: DistSender RPCsNumber of RPCs stuck or retrying for a long time.DEPENDENT

-

cockroachdb.slow_requests.rpc
CockroachDB: SQL: Bytes received, rateTotal amount of incoming SQL client network traffic in bytes per second.DEPENDENT

-

cockroachdb.sql.bytes.received.rate
CockroachDB: SQL: Bytes sent, rateTotal amount of outgoing SQL client network traffic in bytes per second.DEPENDENT

-

cockroachdb.sql.bytes.sent.rate
CockroachDB: SQL: Schema changes, rateTotal number of SQL DDL statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.schema_changes.rate
CockroachDB: SQL sessions: OpenTotal number of open SQL sessions.DEPENDENT

-

cockroachdb.sql.sessions
CockroachDB: SQL statements: ActiveTotal number of SQL statements currently active.DEPENDENT

-

cockroachdb.sql.statements.active
CockroachDB: SQL statements: Contention, rateTotal number of SQL statements that experienced contention per second.DEPENDENT

-

cockroachdb.sql.statements.contention.rate
CockroachDB: SQL statements: DELETE, rateA moving average of the number of DELETE statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.statements.delete.rate
CockroachDB: SQL statements: Denials, rateThe number of statements denied per second by a feature flag.DEPENDENT

-

cockroachdb.sql.statements.denials.rate
CockroachDB: SQL statements: Errors, rateTotal number of statements which returned a planning or runtime error per second.DEPENDENT

-

cockroachdb.sql.statements.errors.rate
CockroachDB: SQL statements: Executed, rateNumber of SQL queries executed per second.DEPENDENT

-

cockroachdb.sql.statements.executed.rate
CockroachDB: SQL statements: Active flows distributed, rateThe number of distributed SQL flows currently active per second.DEPENDENT

-

cockroachdb.sql.statements.flows.active.rate
CockroachDB: SQL statements: INSERT, rateA moving average of the number of INSERT statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.statements.insert.rate
CockroachDB: SQL statements: SELECT, rateA moving average of the number of SELECT statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.statements.select.rate
CockroachDB: SQL statements: UPDATE, rateA moving average of the number of UPDATE statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.statements.update.rate
CockroachDB: SQL transactions: Aborted, rateTotal number of SQL transaction abort errors per second.DEPENDENT

-

cockroachdb.sql.transactions.aborted.rate
CockroachDB: SQL transactions: Committed, rateTotal number of SQL transaction COMMIT statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.transactions.committed.rate
CockroachDB: SQL transactions: Initiated, rateTotal number of SQL transaction BEGIN statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.transactions.initiated.rate
CockroachDB: SQL transactions: OpenTotal number of currently open SQL transactions.DEPENDENT

-

cockroachdb.sql.transactions.open
CockroachDB: SQL transactions: Rolled back, rateTotal number of SQL transaction ROLLBACK statements successfully executed per second.DEPENDENT

-

cockroachdb.sql.transactions.rollbacks.rate
CockroachDB: Time series: Sample errors, rateThe number of errors encountered while attempting to write metrics to disk, per second.DEPENDENT

-

cockroachdb.ts.samples.errors.rate
CockroachDB: Time series: Samples written, rateThe number of successfully written metric samples per second.DEPENDENT

-

cockroachdb.ts.samples.written.rate
CockroachDB: UptimeProcess uptime.DEPENDENT

-

cockroachdb.uptime
CockroachDB: VersionBuild information.DEPENDENT

-

cockroachdb.version
CockroachDB: Service pingCheck if HTTP/HTTPS service accepts TCP connections.SIMPLE

-

net.tcp.service["{$COCKROACHDB.API.SCHEME}","{HOST.CONN}","{$COCKROACHDB.API.PORT}"]

Triggers

NameDescriptionExpressionPriorityDependencies
CockroachDB: CA certificate expires soonCA certificate expires soon.(last(/CockroachDB by HTTP/cockroachdb.cert.expire_date.ca) - now()) / 86400 < {$COCKROACHDB.CERT.CA.EXPIRY.WARN}WARNING 📢CockroachDB: CA certificate expiration date
CockroachDB: Node certificate expires soonNode certificate expires soon.(last(/CockroachDB by HTTP/cockroachdb.cert.expire_date.node) - now()) / 86400 < {$COCKROACHDB.CERT.NODE.EXPIRY.WARN}WARNING 📢CockroachDB: Node certificate expiration date
CockroachDB: Clock offset is too highCockroach-measured clock offset is nearing limit (by default, servers kill themselves at 400ms from the mean).min(/CockroachDB by HTTP/cockroachdb.clock.offset,5m) > {$COCKROACHDB.CLOCK.OFFSET.MAX.WARN} * 0.001WARNING 📢CockroachDB: Clock offset
CockroachDB: Node is unhealthyNode's /health endpoint has returned HTTP 500 Internal Server Error which indicates unhealthy mode.last(/CockroachDB by HTTP/cockroachdb.get_health) = 500AVERAGE ⚠CockroachDB: Get health
CockroachDB: SQL statements errors rate is too high

-

min(/CockroachDB by HTTP/cockroachdb.sql.statements.errors.rate,5m) > {$COCKROACHDB.STATEMENTS.ERRORS.MAX.WARN}WARNING 📢CockroachDB: SQL statements: Errors, rate
CockroachDB: Failed to fetch node dataZabbix has not received data for items for the last 5 minutes.nodata(/CockroachDB by HTTP/cockroachdb.uptime,5m) = 1WARNING 📢CockroachDB: Uptime
CockroachDB: Node has been restartedUptime is less than 10 minutes.last(/CockroachDB by HTTP/cockroachdb.uptime) < 10mINFO 🔔CockroachDB: Uptime
CockroachDB: Version has changed

-

last(/CockroachDB by HTTP/cockroachdb.version) <> last(/CockroachDB by HTTP/cockroachdb.version,#2) and length(last(/CockroachDB by HTTP/cockroachdb.version)) > 0INFO 🔔CockroachDB: Version
CockroachDB: Service is down

-

last(/CockroachDB by HTTP/net.tcp.service["{$COCKROACHDB.API.SCHEME}","{HOST.CONN}","{$COCKROACHDB.API.PORT}"]) = 0AVERAGE ⚠CockroachDB: Service ping

Discovery rule №1

NameDescriptionTypeIntervalKey and additional info
Storage metrics discoveryDiscover per store metrics.DEPENDENT0cockroachdb.store.discovery

Item prototypes

NameDescriptionTypeIntervalKey and additional info
CockroachDB: Storage [{#STORE}]: Queue processing failures: Consistency, rateNumber of replicas which failed processing in the consistency checker queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.consistency.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: GC, rateNumber of replicas which failed processing in the GC queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.gc.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: Replica GC, rateNumber of replicas which failed processing in the replica GC queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.gc_replica.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: Raft log, rateNumber of replicas which failed processing in the Raft log queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.raftlog.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: Raft snapshot, rateNumber of replicas which failed processing in the Raft repair queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.raftsnapshot.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: Replicate, rateNumber of replicas which failed processing in the replicate queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.replicate.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: Split, rateNumber of replicas which failed processing in the split queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.split.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Queue processing failures: Time series maintenance, rateNumber of replicas which failed processing in the time series maintenance queue per second.DEPENDENT

-

cockroachdb.queue.processing_failures.tsmaintenance.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Ranges countNumber of ranges.DEPENDENT

-

cockroachdb.ranges.[{#STORE},count]
CockroachDB: Storage [{#STORE}]: Ranges unavailableNumber of ranges with fewer live replicas than needed for quorum.DEPENDENT

-

cockroachdb.ranges.[{#STORE},unavailable]
CockroachDB: Storage [{#STORE}]: Ranges underreplicatedNumber of ranges with fewer live replicas than the replication target.DEPENDENT

-

cockroachdb.ranges.[{#STORE},underreplicated]
CockroachDB: Storage [{#STORE}]: Rebalancing: Average queries, rateNumber of kv-level requests received per second by the store, averaged over a large time period as used in rebalancing decisions.DEPENDENT

-

cockroachdb.rebalancing.queries.average.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Rebalancing: Average writes, rateNumber of keys written (i.e. applied by raft) per second to the store, averaged over a large time period as used in rebalancing decisions.DEPENDENT

-

cockroachdb.rebalancing.writes.average.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: Replication: ReplicasNumber of replicas.DEPENDENT

-

cockroachdb.replication.replicas.[{#STORE},count]
CockroachDB: Storage [{#STORE}]: Replication: Replicas quiescedNumber of quiesced replicas.DEPENDENT

-

cockroachdb.replication.replicas.[{#STORE},quiesced]
CockroachDB: Storage [{#STORE}]: Replication: Lease holdersNumber of lease holders.DEPENDENT

-

cockroachdb.replication.[{#STORE},lease_holders]
CockroachDB: Storage [{#STORE}]: RocksDB cache hits, rateCount of block cache hits per second.DEPENDENT

-

cockroachdb.rocksdb.cache.hits.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: RocksDB cache misses, rateCount of block cache misses per second.DEPENDENT

-

cockroachdb.rocksdb.cache.misses.[{#STORE},rate]
CockroachDB: Storage [{#STORE}]: RocksDB cache hit ratioBlock cache hit ratio in %.CALCULATED

-

cockroachdb.rocksdb.cache.[{#STORE},hit_ratio]
CockroachDB: Storage [{#STORE}]: RocksDB read amplificationThe average number of real read operations executed per logical read operation.DEPENDENT

-

cockroachdb.rocksdb.[{#STORE},read_amp]
CockroachDB: Storage [{#STORE}]: RocksDB SSTablesThe number of SSTables in use.DEPENDENT

-

cockroachdb.rocksdb.[{#STORE},sstables]
CockroachDB: Storage [{#STORE}]: Slow requests: Latch acquisitionsNumber of requests that have been stuck for a long time acquiring latches.DEPENDENT

-

cockroachdb.slow_requests.[{#STORE},latch_acquisitions]
CockroachDB: Storage [{#STORE}]: Slow requests: Lease acquisitionsNumber of requests that have been stuck for a long time acquiring a lease.DEPENDENT

-

cockroachdb.slow_requests.[{#STORE},lease_acquisitions]
CockroachDB: Storage [{#STORE}]: Slow requests: Raft proposalsNumber of requests that have been stuck for a long time in raft.DEPENDENT

-

cockroachdb.slow_requests.[{#STORE},raft_proposals]
CockroachDB: Storage [{#STORE}]: Bytes: LiveNumber of logical bytes stored in live key-value pairs on this node. Live data excludes historical and deleted data.DEPENDENT

-

cockroachdb.storage.bytes.[{#STORE},live]
CockroachDB: Storage [{#STORE}]: Bytes: LogicalNumber of logical bytes stored in key-value pairs on this node. This includes historical and deleted data.DEPENDENT

-

cockroachdb.storage.bytes.[{#STORE},logical]
CockroachDB: Storage [{#STORE}]: Bytes: SystemNumber of physical bytes stored in system key-value pairs.DEPENDENT

-

cockroachdb.storage.bytes.[{#STORE},system]
CockroachDB: Storage [{#STORE}]: Capacity availableAvailable storage capacity.DEPENDENT

-

cockroachdb.storage.capacity.[{#STORE},available]
CockroachDB: Storage [{#STORE}]: Capacity available in %Available storage capacity in %.CALCULATED

-

cockroachdb.storage.capacity.[{#STORE},available_percent]
CockroachDB: Storage [{#STORE}]: Capacity totalTotal storage capacity. This value may be explicitly set using --store. If a store size has not been set, this metric displays the actual disk capacity.DEPENDENT

-

cockroachdb.storage.capacity.[{#STORE},total]
CockroachDB: Storage [{#STORE}]: Capacity usedDisk space in use by CockroachDB data on this node. This excludes the Cockroach binary, operating system, and other system files.DEPENDENT

-

cockroachdb.storage.capacity.[{#STORE},used]

Trigger prototypes

NameDescriptionExpressionPriorityDependencies
CockroachDB: Storage [{#STORE}]: Available storage capacity is critically lowStorage is running critically low on free space (less than {$COCKROACHDB.STORE.USED.MIN.CRIT}% available).max(/CockroachDB by HTTP/cockroachdb.storage.capacity.[{#STORE},available_percent],5m) < {$COCKROACHDB.STORE.USED.MIN.CRIT}AVERAGE ⚠CockroachDB: Storage [{#STORE}]: Capacity available in %
CockroachDB: Storage [{#STORE}]: Available storage capacity is lowStorage is running low on free space (less than {$COCKROACHDB.STORE.USED.MIN.WARN}% available).max(/CockroachDB by HTTP/cockroachdb.storage.capacity.[{#STORE},available_percent],5m) < {$COCKROACHDB.STORE.USED.MIN.WARN}WARNING 📢CockroachDB: Storage [{#STORE}]: Capacity available in %