Commit 9dbc3f4
committed
fix(alarms): exclude systemd-coredump@* transient units from FailedUnits count
systemd-coredump@<uid>-<pid>-<n>.service units are one-shot transient units
that systemd spawns to handle a coredump and then leaves in 'failed' state
after exit. They are not real service failures but they inflate the
FailedUnits metric and cause loki-failed-units to fire on any box that
has recently dumped a core.
Patch the documented health-check command to grep them out. (Live fix also
applied to /usr/local/bin/loki-health-check.sh on the current instance.)1 parent 124c966 commit 9dbc3f4
1 file changed
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
163 | 163 | | |
164 | 164 | | |
165 | 165 | | |
166 | | - | |
| 166 | + | |
167 | 167 | | |
168 | 168 | | |
169 | 169 | | |
| |||
0 commit comments