Page MenuHomePhabricator

SystemdUnitDown The systemd unit remove_dangling_cinder_snapshots.service on node cloudbackup1002-dev has been failing for more than two hours.
Closed, ResolvedPublic

Description

Common information

  • alertname: SystemdUnitDown
  • cluster: wmcs
  • instance: cloudbackup1002-dev:9100
  • job: node
  • name: remove_dangling_cinder_snapshots.service
  • prometheus: ops
  • severity: critical
  • site: eqiad
  • source: prometheus
  • state: failed
  • team: wmcs
  • type: oneshot

Firing alerts


Event Timeline

Andrew triaged this task as Medium priority.

this seems to be working now

  NODES
HOME 1
Note 1
os 5
server 2