mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2024-11-10 04:27:25 +01:00
30 lines
1.0 KiB
Markdown
30 lines
1.0 KiB
Markdown
|
## `smartmon` collector
|
||
|
|
||
|
```json
|
||
|
"smartmon": {
|
||
|
"use_sudo" : true,
|
||
|
"exclude_devices": [
|
||
|
"/dev/sda",
|
||
|
]
|
||
|
}
|
||
|
```
|
||
|
|
||
|
The `smartmon` collector reads the data from the command `smartctl`. It retrieves S.M.A.R.T data from disks
|
||
|
|
||
|
Metrics:
|
||
|
* `smartmon_temp`: Temperature of the device (`unit=degC`)
|
||
|
* `smartmon_avail_spare`: Amount of spare left (`unit=percent`)
|
||
|
* `smartmon_percent_used`: Percentage of the device is used (`unit=percent`)
|
||
|
* `smartmon_data_units_read`: Read data units
|
||
|
* `smartmon_data_units_write`: Written data units
|
||
|
* `smartmon_host_reads`: Read operations
|
||
|
* `smartmon_host_writes`: Write operations
|
||
|
* `smartmon_power_cycles`: Number of power cycles
|
||
|
* `smartmon_power_on`: Seconds the device is powered on (`unit=seconds`)
|
||
|
* `smartmon_unsafe_shutdowns`: Count of unsafe shutdowns
|
||
|
* `smartmon_media_errors`: Media errors of the device
|
||
|
* `smartmon_errlog_entries`: Error log entries
|
||
|
* `smartmon_warn_temp_time`: Time above the warning temperature threshold
|
||
|
* `smartmon_crit_temp_time`: Time above the critical temperature threshold
|
||
|
|