Generate dashboard and alerts for CF component metrics from Indicator Protocol
The monitoring-indicator-protocol (see https://github.com/pivotal/monitoring-indicator-protocol/wiki ) is a format proposed by the cloudfoundry/pivotal teams, which enables core CF components to complete their metrics with expected normal values ranges, as well as abnormal ranges associated with alert definitions.
This enables CF component teams that know best the target SLI for their component to maintain such business dashboards and alerts (e.g. #226 for CAPI team). According to @jtuchscherer (sharing about indicator protocol at cf summit philly 2019 unconference), adoption in CF component is in progress with loggregator team implemented it and diego team starting. See https://github.com/search?q=indicators.yml+in%3Apath+user%3Acloudfoundry&type=Code
The indicator-format command can generate Grafana JSON dashboard, and prometheus alerting rules from an indicator protocol yml file.
The monitoring-indicator-protocol repo provides a mechasnism to distribute indicator protocol files shipped within CF component bosh releases so that they'd be available to prometheus, see indicator-registration-agent and indicator-registry
It would be great to be able to leverage this mechanism to Generate dashboards and alerts for CF component metrics from Indicator Protocol
/CC @psycofdj
/cc @joerodriguez
@gberche-orange Happy to pair on this, how would you like to proceed?
If you don't mind me asking, did this fizzle out because of lack of interest, or because a hindrance was discovered that would prevent this from proceeding (or otherwise being a huge pain)?
Sorry for late reply on this. @joerodriguez I'm unfortunately lacking golang skills to pair on this.
However, @aegershman, I understand from the cf-for-k8s Metrics Guide that CF4K8S components include automatically generated dashboards from indicator protocol, and possibly tailored dashboards per component.
Might be worth looking for progress in this direction. Please keep this issue updated if you find some related progress.
Closed due to inactivity; Please feel free to re-open if issue still persists