Skip to main content

uptime

Overview

Verifies that the compute node has been running long enough to be considered stable. Warns if the node recently rebooted, which may indicate maintenance or instability.

Command-Line Options

OptionTypeDefaultDescription
--uptime-thresholdInteger600 secondsMinimum uptime before WARN condition (default: 10 minutes)
--timeoutInteger300Command execution timeout in seconds
--sinkStringdo_nothingTelemetry sink destination
--sink-optsMultiple-Sink-specific configuration
--verbose-outFlagFalseDisplay detailed output
--log-levelChoiceINFODEBUG, INFO, WARNING, ERROR, CRITICAL
--log-folderString/var/log/fb-monitoringLog directory
--heterogeneous-cluster-v1FlagFalseEnable heterogeneous cluster support

Exit Conditions

Exit CodeCondition
OK (0)Feature flag disabled (killswitch active)
OK (0)Uptime exceeds threshold
WARN (1)Uptime below threshold
WARN (1)Command execution failed

Usage Examples

Basic Uptime Check

# 5 minute threshold (300 seconds)
health_checks check-node uptime \
--uptime-threshold 300 \
[CLUSTER] \
app

With Telemetry

health_checks check-node uptime \
--sink otel \
--sink-opts "log_resource_attributes={'attr_1': 'value1'}" \
--verbose-out \
[CLUSTER] \
app

Debug Mode

health_checks check-node uptime \
--log-level DEBUG \
--verbose-out \
[CLUSTER] \
app