github.com/nginxinc/kubernetes-ingress@v1.12.5/grafana/README.md

github.com/nginxinc/kubernetes-ingress@v1.12.5/grafana/README.md (about)

1 # Grafana Dashboard
2 We provide the official Grafana dashboard that visualizes the metrics exposed by the NGINX Ingress Controller. The dashboard allows you to filter metrics per Ingress Controller replica or see the metrics from all replicas.
3
4 ## Prerequisites
5
6 The dashboard has been tested with the following software versions:
7
8 * Grafana >= v7.2.0
9 * Prometheus >= v2.18.1
10
11 A Prometheus data source needs to be [added](https://prometheus.io/docs/visualization/grafana/#using) before installing the dashboard.
12
13 ## Installing the Dashboard
14
15 In the Grafana UI complete the following steps:
16
17 1. Use the *New Dashboard* button and click *Import*.
18 2. Upload `NGINXPlusICDashboard.json` or copy and paste the contents of the file in the textbox and click *Load*.
19 3. Set the Prometheus data source and click *Import*.
20 4. The dashboard will appear. Note how you filter metrics per namespace, per replica name and also per NGINX Plus server zone, server and upstream server (top left corner).
21
22 ![dashboard](./dashboard.png)
23
24 ## Graphs
25
26 The dashboard comes with 4 rows with the following graphs (the corresponding Prometheus metrics are in parentheses):
27
28 * Environment Metrics
29 * NGINX Plus Reload (`nginx_last_reload_status`). This graph shows the state of the last NGINX Plus reload, `Successful`/`Failed`.
30 * Last Reload Time (`nginx_last_reload_milliseconds`) graph shows duration of the last reload in milliseconds.
31 * Reloads ( `nginx_reloads_total`). This graph shows the total times NGINX Plus has reloaded.
32 * Reload Errors (`nginx_reload_errors_total`) graph shows the total number of times NGINX Plus has failed to reload.
33 * Network I/O (`nginxplus_server_zone_received` and `nginxplus_server_zone_sent`). This graphs shows the traffic sent and received by the Ingress Controller.
34 * Ingress Metrics
35 * HTTP Request Volume (`nginxplus_http_requests_total`). This graphs shows the total number of HTTP requests received by the Ingress Controller.
36 * Success Rate Over Time (`nginxplus_server_zone_responses`). This graph shows all the `1XX`, `2XX` and `3XX` status code responses the Ingress Controller has returned over time.
37 * Error Rate Over Time (`nginxplus_server_zone_responses`). This graph shows all the `4XX` and `5XX` status code responses the Ingress Controller has returned over time.
38 * Ingress Count (`ingress_resources_total`). The total number of Ingress resources the Ingress Controller is handling at that time.
39 * Handled Ingress Resource (`ingress_resources_total`). This shows the different types of Ingress resources the Ingress Controller is handling, `Master`, `Minion` or `Regular`.
40 * Ingress State (`nginxplus_up`). This shows the current state of the Ingress Controller, `up`/`down`.
41 * Request Success Rate (`nginxplus_server_zone_responses`). This graph shows all the non-`4XX` and `5XX` status code responses divided by the total status code responses.
42 * Zone Request Volume (`nginxplus_server_zone_requests`). This graph shows the total HTTP request across all server zones over the last 5 minutes.
43 * Upstream Metrics
44 * Upstream Success Rate (`nginxplus_upstream_server_responses`). This graph shows all the `1XX`, `2XX` and `3XX` status code responses of upstreams over time.
45 * Upstream Error Rate (`nginxplus_upstream_server_responses`). This graph shows all the `4XX` and `5XX` status code responses of upstreams over time.
46 * Upstream Server Count (`nginxplus_upstream_server_state`). This graph shows the count for the total number of upstream servers.
47 * Upstream Server State (`nginxplus_upstream_server_state`). This graph shows the current state of the upstream servers.
48 * SSL
49 * SSL Performance (`nginxplus_ssl_handshakes`, `nginxplus_ssl_handshakes_failed`). The graph shows the total number of SSL handshakes and the total number of SSL handshakes that failed.