From 313fa1dbac610432b7f4c47f656b05d1a81d787c Mon Sep 17 00:00:00 2001 From: Ubbo Veentjer <veentjer@sub.uni-goettingen.de> Date: Fri, 8 May 2020 17:52:51 +0200 Subject: [PATCH] some notes on metrics. belongs to #13 --- rdd-technical-reference.md | 19 ++++++++++++++++--- 1 file changed, 16 insertions(+), 3 deletions(-) diff --git a/rdd-technical-reference.md b/rdd-technical-reference.md index 947aa07..d6e415a 100644 --- a/rdd-technical-reference.md +++ b/rdd-technical-reference.md @@ -251,13 +251,26 @@ There is a [full and documented example](https://gitlab.gwdg.de/SADE/SADE/blob/d For server configuration and setup we use puppet for most servers. The main puppet code is provided in GitLab <https://gitlab.gwdg.de/dariah-de-puppet>. The DARIAH-DE and TextGrid Repository module (dhrep) is contained in Github <https://github.com/DARIAH-DE/puppetmodule-dhrep>. -### Monitoring +## Monitoring - Icinga probes for DARIAH-DE services <https://icinga.de.dariah.eu/icinga> -- Metrics for Sever specific monitoring <https://metrics.gwdg.de> +### Metrics for Sever specific monitoring <https://metrics.gwdg.de> -### Release Management +To view real time metrics from our servers or applications we use [Grafana](https://grafana.com/), which is available at <https://metrics.gwdg.de> inside GoeNet. +Grafana retrieves its data from influxdb. [Telegraf](https://github.com/influxdata/telegraf)can be used to store data from the servers in that database. It is easy to enable telegraf on puppet configured servers. Telegraf stores metrics from the server in the influxdb. + +Some system stats monitored by telegraf in our current puppet setup: + +* CPU +* Memory +* Space +* Apache +* ... + +A list of stats telegraf collects stats with the input plugin, a list is [available](https://github.com/influxdata/telegraf#input-plugins). + +## Release Management # Code quality level for RDD -- GitLab