|
| 1 | +.. _tcm_cluster_monitoring: |
| 2 | + |
| 3 | +Cluster monitoring |
| 4 | +================== |
| 5 | + |
| 6 | +|tcm_full_name| provides means for monitoring various aspects of connected clusters, |
| 7 | +such as: |
| 8 | + |
| 9 | +* topology |
| 10 | +* instance state |
| 11 | +* memory usage |
| 12 | +* data distribution |
| 13 | +* Tarantool versions |
| 14 | + |
| 15 | +Cluster monitoring tools are available on the **Cluster** > **Stateboard** page. |
| 16 | + |
| 17 | +.. _tcm_cluster_monitoring_topology: |
| 18 | + |
| 19 | +Cluster topology |
| 20 | +---------------- |
| 21 | + |
| 22 | +The cluster topology is displayed on the **Stateboard** page in one of two forms: |
| 23 | +a list or a graph. |
| 24 | + |
| 25 | +.. _tcm_cluster_monitoring_topology_list: |
| 26 | + |
| 27 | +List view |
| 28 | +~~~~~~~~~ |
| 29 | + |
| 30 | +The list view of the cluster topology is used by default. In this view, each row contains |
| 31 | +the general information about an instance: its current state, memory usage and limit, |
| 32 | +and other parameters. |
| 33 | + |
| 34 | +In the list view, |tcm| additionally displays the Tarantool version information |
| 35 | +and instance states on circle diagrams. You can click the sectors of these diagrams |
| 36 | +to filter the instances with the selected versions and states. |
| 37 | + |
| 38 | +To switch to the list view, click the list button on the right of the search bar on the **Stateboard** page. |
| 39 | + |
| 40 | +.. _tcm_cluster_monitoring_topology_graph: |
| 41 | + |
| 42 | +Graph view |
| 43 | +~~~~~~~~~~ |
| 44 | + |
| 45 | +The graph view of the cluster topology is shown in a tree-like structure where |
| 46 | +leafs are the cluster's instances. Each instance's state is shown by its color. |
| 47 | +You can move the graph vertexes to arrange them as you like, and zoom in and out, |
| 48 | +which is helpful for bigger clusters. |
| 49 | + |
| 50 | +To switch to the graph view, click the graph button on the right of the search bar on the **Stateboard** page. |
| 51 | + |
| 52 | +.. _tcm_cluster_monitoring_topology_group: |
| 53 | + |
| 54 | +Instance grouping |
| 55 | +~~~~~~~~~~~~~~~~~ |
| 56 | + |
| 57 | +By default, the cluster topology is shown hierarchically as it's defined in the configuration: |
| 58 | +instances are grouped by their replica set, and replica sets are grouped by |
| 59 | +their configuration group. |
| 60 | + |
| 61 | +For better navigation across the cluster, you can adjust the instance grouping. |
| 62 | +For example, you can group instances by their roles or custom tags defined in the configuration. |
| 63 | +A typical case for such tags is adding a geographical markers to instances. In this case, |
| 64 | +you see if issues happen in a specific data center or server. |
| 65 | + |
| 66 | +To change the instance grouping, click **Group by** in the **Actions** menu on the **Stateboard** page. |
| 67 | +Then add or remove grouping criteria. |
| 68 | + |
| 69 | +.. _tcm_cluster_monitoring_topology_filters: |
| 70 | + |
| 71 | +Filtering |
| 72 | +~~~~~~~~~ |
| 73 | + |
| 74 | +You can filter the instances shown on the **Stateboard** page using the search bar |
| 75 | +at the top. It has predefined filters that select: |
| 76 | + |
| 77 | +* instances with errors or warnings |
| 78 | +* leader or read-only instances |
| 79 | +* instances with no issues |
| 80 | +* stale instances |
| 81 | + |
| 82 | +To display all instances, delete the filter applied in the search bar. |
| 83 | + |
| 84 | +.. _tcm_cluster_monitoring_instance_details: |
| 85 | + |
| 86 | +Instance details |
| 87 | +---------------- |
| 88 | + |
| 89 | +The general information about the state of cluster instances is shown in the |
| 90 | +list view of the cluster topology. Each row contains the information about the instance |
| 91 | +status, used and available memory, read-only status, and virtual buckets for sharded |
| 92 | +clusters. |
| 93 | + |
| 94 | +To view the detailed information about an instance or connect to it, click the corresponding |
| 95 | +row in the instances list or a vertex of the graph. On the instance page, you can |
| 96 | +find: |
| 97 | + |
| 98 | +* its configuration overview |
| 99 | +* current state (with warning and error messages if any) |
| 100 | +* the detailed Tarantool information returned by the instance introspection functions |
| 101 | + from :ref:`box.info <box_introspection-box_info>`, :ref:`box.stat <box_introspection-box_stat>`, |
| 102 | + and other built-in modules. |
| 103 | + |
| 104 | +Additionally, on the instance details page there is a terminal in which you can execute |
| 105 | +arbitrary Lua code on the instance. |
| 106 | + |
| 107 | +.. _tcm_cluster_monitoring_urls: |
| 108 | + |
| 109 | +Additional monitoring services |
| 110 | +------------------------------ |
| 111 | + |
| 112 | +When you :ref:`connect a cluster <tcm_connect_clusters>` to |tcm|, you can specify |
| 113 | +URLs of external services linked to this cluster. For example, this can be a Grafana |
| 114 | +server that monitors the cluster metrics. |
| 115 | + |
| 116 | +All the URLs added for a cluster are available for quick access in the **Actions** |
| 117 | +menu on the **Stateboard** page. |
0 commit comments