Skip to main content

io-errors

Overview

Detects NVMe storage device errors by searching dmesg for I/O error messages. Extracts unique NVMe device names experiencing errors and triggers CRITICAL state if any I/O errors are detected.

Command-Line Options

OptionTypeDefaultDescription
--timeoutInteger300Command execution timeout in seconds
--sinkStringdo_nothingTelemetry sink destination
--sink-optsMultiple-Sink-specific configuration
--verbose-outFlagFalseDisplay detailed output
--log-levelChoiceINFODEBUG, INFO, WARNING, ERROR, CRITICAL
--log-folderString/var/log/fb-monitoringLog directory
--heterogeneous-cluster-v1FlagFalseEnable heterogeneous cluster support

Exit Conditions

Exit CodeCondition
OK (0)Feature flag disabled (killswitch active)
OK (0)No I/O errors detected
WARN (1)Command execution failed
CRITICAL (2)NVMe I/O errors detected

Usage Examples

io-errors - Basic Check

health_checks check-syslogs io-errors [CLUSTER] app

io-errors - Debug Mode

health_checks check-syslogs io-errors \
--log-level DEBUG \
--verbose-out \
[CLUSTER] \
app