Currently, ARGO is using Nagios to check the availability of services, and produces great availability reports. From a service management point of view, I interpret this as fulfilling the Service Availability and Continuity Management role - if the service goes down, someone gets informed, and you can write a report saying how much the service has been down, compared to the agreed level.
I would like to be able to do something similar, but for Capacity. From what I’ve seen so far, it seems that ARGO is not providing any metrics related to utilisation - ie, I want to be able to answer the question
- how much of the resource is available to consume? (if it’s a fixed resource, like storage space or memory)
- how long are service requests taking?
I read the ARGO probe guide which says that there is room for adding performance metrics to the probes - something explained in the nagios documentation, but I wonder how and where that would be displayed?
Would it be feasible to have it in ARGO?
Would it be feasible to consume nagios data from ARGO and feed it into a dashboard with Grafana or something?