mirror of
https://github.com/ClusterCockpit/cc-metric-collector.git
synced 2026-03-10 18:47:30 +01:00
1.6 KiB
1.6 KiB
smartmon collector
"smartmon": {
"use_sudo": true,
"exclude_devices": [
"/dev/sda"
],
"excludeMetrics": [
"smartmon_warn_temp_time",
"smartmon_crit_comp_time"
],
"devices": [
{
"name": "/dev/nvme0",
"type": "nvme"
}
]
}
The smartmon collector retrieves S.M.A.R.T data from NVMEs via command smartctl.
Available NVMEs can be either automatically detected by a device scan or manually added with the "devices" config option.
Metrics:
smartmon_temp: Temperature of the device (unit=degC)smartmon_avail_spare: Amount of spare left (unit=percent)smartmon_percent_used: Percentage of the device is used (unit=percent)smartmon_data_units_read: Read data unitssmartmon_data_units_write: Written data unitssmartmon_host_reads: Read operationssmartmon_host_writes: Write operationssmartmon_power_cycles: Number of power cyclessmartmon_power_on: Seconds the device is powered on (unit=seconds)smartmon_unsafe_shutdowns: Count of unsafe shutdownssmartmon_media_errors: Media errors of the devicesmartmon_errlog_entries: Error log entriessmartmon_warn_temp_time: Time above the warning temperature thresholdsmartmon_crit_comp_time: Time above the critical composite temperature threshold