> For the complete documentation index, see [llms.txt](https://docs.eridian.xyz/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.eridian.xyz/ethereum-dev/infrastructure/alerting-and-monitoring.md).

# Alerting and Monitoring

Each machine runs a Prometheus and Alertmanager instance which monitors the services running on the same machine.

If Prometheus and/or Alertmanager are down, or the entire machine is down/unresponsive then HealthChecks.io is used as a dead-mans-hand alert. Periodic heartbeat pings are sent by a cron script (`~/healthchecks.sh`) every 5 minutes, with a 10 minute grace period.

All services are integrated with PagerDuty for alerts.

{% content-ref url="/pages/ud8BcY3LqFgG7wrR9MNH" %}
[Prometheus](/ethereum-dev/infrastructure/alerting-and-monitoring/prometheus.md)
{% endcontent-ref %}

{% content-ref url="/pages/fOljlY8Mdp94TYLZMrg2" %}
[HealthChecks.io](/ethereum-dev/infrastructure/alerting-and-monitoring/healthchecks.io.md)
{% endcontent-ref %}

{% content-ref url="/pages/tuIrcOP19fAC24doLzOB" %}
[PagerDuty](/ethereum-dev/infrastructure/alerting-and-monitoring/pagerduty.md)
{% endcontent-ref %}