solana/metrics/metrics-main
Will Hickey 3293ca4a52
Add canaries to metrics config (#31517)
2023-05-10 12:12:50 -05:00
..
README.md update influx enterprise scripts (#31117) 2023-04-10 09:10:54 -05:00
alertmanager-discord.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
alertmanager.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
alertmanager.yml
chronograf.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
chronograf_8889.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
first_rules.yml
grafana-metrics.solana.com.ini
grafana.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
host.sh
kapacitor.conf
kapacitor.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
prometheus.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
prometheus.yml Add canaries to metrics config (#31517) 2023-05-10 12:12:50 -05:00
start.sh increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
status.sh update metrics status scripts (#31037) 2023-04-04 09:03:57 -05:00

README.md

image

Services:

  1. Prometheus
  2. AlertManager
  3. Chronograf (on port 8888)
  4. Chronograf_8889 (on port 8889)
  5. Grafana (on port 3000)
  6. AlertManager_Discord
  7. Kapacitor

To install all the services on the metrics-main server you need to run the start.sh script.

Install the Buildkite-agent to run the status.sh script to periodically check for the status of the containers.

If any of the containers is not in running state or in exited state then it will try to redeploy the container, if it fails to do so an alert will be triggered to Discord and PagerDuty.

Note: If you deleted or removed any of containers manually you need to run the start.sh script.