Devops. Monitoring

Overview

https://en.wikipedia.org/wiki/Monitoring

Simple Monitoring with Linux cmd

free -m

https://www.linuxatemyram.com/ To see how much ram your applications could use without swapping, run free -m and look at the "available" column (On installations from before 2016, look at "free" column in the "-/+ buffers/cache" row instead.) When should I start to worry?

A healthy Linux system with more than enough memory will, after running for a while, show the following expected and harmless behavior:

Warning signs of a genuine low memory situation that you may want to look into:

df -h

docker system df

docker stats

With the docker system df command you would get a summary of your Docker usage including things like:

du -hs /var/lib/docker
du -hs /var/lib/docker/containers
du -hs /var/lib/docker/overlay2
du -hs /var/lib/docker/volumes

For example:

du -hs /var/lib/docker/volumes/elk_elasticsearch
du -hs /var/lib/docker/volumes/monitoring_loki-data

https://docs.docker.com/config/containers/resource_constraints/#understand-the-risks-of-running-out-of-memory

Monitoring system with Prometheus and Grafana

Prometheus

https://prometheus.io/

Prometheus is a free software application used for event monitoring and alerting. It records metrics in a time series database (allowing for high dimensionality) built using an HTTP pull model, with flexible queries and real-time alerting. The project is written in Go and licensed under the Apache 2 License, with source code available on GitHub, and is a graduated project of the Cloud Native Computing Foundation, along with Kubernetes and Envoy. https://en.wikipedia.org/wiki/Prometheus_(software)

https://github.com/prometheus/client_java examples for java

https://hub.docker.com/r/prom/prometheus/ official docker image

https://docs.docker.com/config/daemon/prometheus/ Collect Docker metrics with Prometheus

Exporters (for Prometheus)

https://prometheus.io/docs/instrumenting/exporters/

Some of the most popular exporters:

Exporter for windows

https://www.devopsschool.com/blog/how-to-install-windows-exporter-for-prometheus/

You can view metrics on http://yourhost:9182/metrics

Configuration in Prometheus (example for two hosts 192.168.1.74 and 192.168.1.75) :

Dashboard for Grafana

https://grafana.com/grafana/dashboards/14694-windows-exporter-dashboard/ ID: 14694

Node exporter

https://github.com/prometheus/node_exporter

Installation for Linux https://www.fosstechnix.com/install-prometheus-node-exporter-on-linux/

cAdvisor

https://github.com/google/cadvisor cAdvisor (Container Advisor) provides container users an understanding of the resource usage and performance characteristics of their running containers. It is a running daemon that collects, aggregates, processes, and exports information about running containers. Specifically, for each container it keeps resource isolation parameters, historical resource usage, histograms of complete historical resource usage and network statistics. This data is exported by container and machine-wide.

docker run \
  --volume=/:/rootfs:ro \
  --volume=/var/run:/var/run:ro \
  --volume=/sys:/sys:ro \
  --volume=/var/lib/docker/:/var/lib/docker:ro \
  --volume=/dev/disk/:/dev/disk:ro \
  --publish=8080:8080 \
  --detach=true \
  --name=cadvisor \
  google/cadvisor:latest

blackbox_exporter

https://github.com/prometheus/blackbox_exporter/blob/master/README.md

https://medium.com/the-telegraph-engineering/how-prometheus-and-the-blackbox-exporter-makes-monitoring-microservice-endpoints-easy-and-free-of-a986078912ee

https://gitlab.com/gitlab-com/runbooks/blob/master/alerts/test-alert.yml

Micrometer

Micrometer provides a simple facade over the instrumentation clients for a number of popular monitoring systems. Currently, it supports the following monitoring systems: Atlas, Datadog, Graphite, Ganglia, Influx, JMX, and Prometheus. https://www.baeldung.com/micrometer

Example urls:

http://localhost:8080/actuator/metrics main metrics

http://localhost:8080/actuator/health probes

http://localhost:8080/actuator/health/readiness

http://localhost:8080/actuator/health/liveness

http://localhost:8080/actuator/info information

Configuration (example) in Prometheus

- job_name: "spring"
  metrics_path: '/actuator/prometheus'
  static_configs:
    - targets: ['192.168.1.149:8080']

Grafana

Grafana is a multi-platform open source analytics and interactive visualization web application. It provides charts, graphs, and alerts for the web when connected to supported data sources. End users can create complex monitoring dashboards using interactive query builders. Grafana is divided into a front end and back end, written in TypeScript and Go, respectively.As a visualization tool, Grafana is a popular component in monitoring stacks, often used in combination with time series databases such as InfluxDB, Prometheus and Graphite; monitoring platforms such as Sensu, Icinga, Checkmk, Zabbix, Netdata, and PRTG; SIEMs such as Elasticsearchand Splunk; and other data sources. The Grafana user interface was originally based on version 3 of Kibana. https://en.wikipedia.org/wiki/Grafana

Userful dashboards (grafana)

https://grafana.com/grafana/dashboards/1860-node-exporter-full/ ( https://grafana.com/docs/grafana-cloud/quickstart/docker-compose-linux/ )

https://grafana.com/grafana/dashboards/14282-cadvisor-exporter/

{#id33} https://grafana.com/grafana/dashboards/4701-jvm-micrometer/ - 4701

https://grafana.com/grafana/dashboards/5373-micrometer-spring-throughput/ - 5373 {#id74}

https://grafana.com/dashboards/1860 node exporter full

https://grafana.com/dashboards/893 cadvisor + System

https://grafana.com/dashboards/193 cadvisor Importing pre-built dashboards from Grafana.com

Grafana.com maintains a collection of shared dashboards which can be downloaded and used with standalone instances of Grafana. Use the Grafana.com "Filter" option to browse dashboards for the "Prometheus" data source only. You must currently manually edit the downloaded JSON files and correct the datasource: entries to reflect the Grafana data source name which you chose for your Prometheus server. Use the "Dashboards" → "Home" → "Import" option to import the edited dashboard file into your Grafana install.

VictoriaMetrics {#importing-pre-built-dashboards-from-grafana-com0}

https://victoriametrics.com/

VictoriaMetrics is a powerful time-series database designed for high-performance monitoring and analytics. It is similar to Prometheus in many ways, but it offers several advantages, including better performance, scalability, and data compression. VictoriaMetrics is also open-source and free to use. VictoriaMetrics is a monitoring system that is fully compatible with Prometheus. It offers many of the same features as Prometheus, but with some additional benefits, such as higher performance and scalability. https://medium.com/@seifeddinerajhi/victoriametrics-a-comprehensive-guide-comparing-it-to-prometheus-and-implementing-kubernetes-03eb8feb0cc2

victoria-metrics-ui

Spring Boot Admin

https://github.com/codecentric/spring-boot-admin

Documentation: https://docs.spring-boot-admin.com/current/getting-started.html

https://www.baeldung.com/spring-boot-admin