solana/metrics/metrics-main
joeaba eedb92a6c0
refactor container status check (#30998)
* refactor container status check

* remove blank line at EOF

* add pagerduty integration

Co-authored-by: axleiro <83293196+axleiro@users.noreply.github.com>

* fix discord webhook reference

* remove webhook references

---------

Co-authored-by: axleiro <83293196+axleiro@users.noreply.github.com>
2023-03-30 22:35:21 -05:00
..
README.md
alertmanager-discord.sh
alertmanager.sh refactor container status check (#30998) 2023-03-30 22:35:21 -05:00
alertmanager.yml
chronograf.sh
chronograf_8889.sh
first_rules.yml
grafana-metrics.solana.com.ini
grafana.sh
host.sh
kapacitor.conf
kapacitor.sh
prometheus.sh
prometheus.yml
start.sh
status.sh refactor container status check (#30998) 2023-03-30 22:35:21 -05:00

README.md

image

Services:

  1. Prometheus
  2. AlertManager
  3. Chronograf2 (on port 8888)
  4. Chronograf_8889 (on port 8889)
  5. Grafana (on port 3000)
  6. Grafana2 (on port 3001)
  7. Kapacitor

To install all the services on the metrics-internal server, you need to run the ./start.sh script.

Install the Buildkite-agent to run the pipeline to get the status of the container.

If any of the containers is not in running state or in exited state then it will redeploy the container as per the specific container status.

Note: If you delete or remove the container manually then you can also run the script to redeploy it again.