<aside> ⚠️ After migrating to Docker, not all of these monitoring services are currently working as intended. See Improved Monitoring.

</aside>

Reporting

Monitoring of devices is primarily done using Telegraf. The agent is installed on servers and reports statistics to an InfluxDB database. A dedicated Telegraf process also collects information from devices that report using SNMP, such as the Ubiquiti router/switch and Synology NAS.

Proxmox additionally logs its own statistics to InfluxDB.

Visualising

Grafana is used to chart and visualise the data stored in the InfluxDB database.

Logging

Logs are currently not centralised. See Improved Monitoring on the ideas board.

Power

Some of the lab is connected to a cheap, basic power monitoring socket. In future it would be ideal to get a UPS.

Updown

To monitor the status of public sites and service, Updown.io is used.

The Updown status page can be viewed here.

If you’d like to try Updown, I have a referral link where we both get free stuff: ‣