Skip to main content

check_zombie

Overview

Detects zombie processes (defunct processes in state 'Z') that have persisted beyond a configurable threshold. Zombie processes indicate parent processes failing to reap child processes and may signal application bugs or system issues.

Command-Line Options

OptionTypeDefaultDescription
--elapsedInteger300Age threshold in seconds for zombie processes
--timeoutInteger300Command execution timeout in seconds
--sinkStringdo_nothingTelemetry sink destination
--sink-optsMultiple-Sink-specific configuration
--verbose-outFlagFalseDisplay detailed output
--log-levelChoiceINFODEBUG, INFO, WARNING, ERROR, CRITICAL
--log-folderString/var/log/fb-monitoringLog directory
--heterogeneous-cluster-v1FlagFalseEnable heterogeneous cluster support

Exit Conditions

Exit CodeCondition
OK (0)Feature flag disabled (killswitch active)
OK (0)No zombie processes exceed threshold
WARN (1)Zombie processes exceed age threshold
UNKNOWN (3)Command execution failed

Usage Examples

check-zombie - Basic Detection

health_checks check-process check-zombie \
--sink stdout \
[CLUSTER] \
app

check-zombie - Custom Threshold

health_checks check-process check-zombie \
--elapsed 600 \
--sink otel \
--sink-opts "log_resource_attributes={'attr_1': 'value1'}" \
[CLUSTER] \
app

check-zombie - With Telemetry

health_checks check-process check-zombie \
--elapsed 300 \
--timeout 60 \
--sink file --sink-opts filepath=/var/log/zombie_check.json \
[CLUSTER] \
app

check-zombie - Debug Mode

health_checks check-process check-zombie \
--elapsed 120 \
--log-level DEBUG \
--verbose-out \
--sink stdout \
[CLUSTER] \
app