github.com/muhammadn/cortex@v1.9.1-0.20220510110439-46bb7000d03d/docs/guides/gossip-ring-getting-started.md

github.com/muhammadn/cortex@v1.9.1-0.20220510110439-46bb7000d03d/docs/guides/gossip-ring-getting-started.md (about)

     1  ---
     2  title: "Getting started with gossiped ring"
     3  linkTitle: "Getting started with a gossip ring cluster"
     4  weight: 4
     5  slug: getting-started-with-gossiped-ring
     6  ---
     7  
     8  Cortex requires Key-Value (KV) store to store the ring. It can use traditional KV stores like Consul or Etcd,
     9  but it can also build its own KV store on top of memberlist library using a gossip algorithm.
    10  
    11  This short guide shows how to start Cortex in [single-binary mode](../architecture.md) with memberlist-based ring.
    12  To reduce number of required dependencies in this guide, it will use [blocks storage](../blocks-storage/_index.md) with no shipping to external stores.
    13  Storage engine and external storage configuration are not dependant on the ring configuration.
    14  
    15  ## Single-binary, two Cortex instances
    16  
    17  For simplicity and to get started, we'll run it as a two instances of Cortex on local computer.
    18  We will use prepared configuration files ([file 1](../../configuration/single-process-config-blocks-gossip-1.yaml), [file 2](../../configuration/single-process-config-blocks-gossip-2.yaml)), with no external
    19  dependencies.
    20  
    21  Build Cortex first:
    22  ```sh
    23  $ go build ./cmd/cortex
    24  ```
    25  
    26  Run two instances of Cortex, each one with its own dedicated config file:
    27  ```
    28  $ ./cortex -config.file docs/configuration/single-process-config-blocks-gossip-1.yaml
    29  $ ./cortex -config.file docs/configuration/single-process-config-blocks-gossip-2.yaml
    30  ```
    31  
    32  Download Prometheus and configure it to use our first Cortex instance for remote writes.
    33  
    34  ```yaml
    35  remote_write:
    36  - url: http://localhost:9109/api/v1/push
    37  ```
    38  
    39  After starting Prometheus, it will now start pushing data to Cortex. Distributor component in Cortex will
    40  distribute incoming samples between the two instances.
    41  
    42  To query that data, you can configure your Grafana instance to use http://localhost:9109/prometheus (first Cortex) as a Prometheus data source.
    43  
    44  ## How it works
    45  
    46  The two instances we started earlier should be able to find each other via memberlist configuration (already present in the config files):
    47  
    48  ```yaml
    49  memberlist:
    50    # defaults to hostname
    51    node_name: "Ingester 1"
    52    bind_port: 7946
    53    join_members:
    54      - localhost:7947
    55    abort_if_cluster_join_fails: false
    56  ```
    57  
    58  This tells memberlist to listen on port 7946, and connect to localhost:7947, which is the second instance.
    59  Port numbers are reversed in the second configuration file.
    60  We also need to configure `node_name` and also ingester ID (`ingester.lifecycler.id` field), because default to hostname,
    61  but we are running both Cortex instances on the same host.
    62  
    63  To make sure that both ingesters generate unique tokens, we configure `join_after` and `observe_period` to 10 seconds.
    64  First option tells Cortex to wait 10 seconds before joining the ring.  This option is normally used to tell Cortex ingester
    65  how long to wait for a potential tokens and data transfer from leaving ingester, but we also use it here to increase
    66  the chance of finding other gossip peers. When Cortex joins the ring, it generates tokens and writes them to the ring.
    67  If multiple Cortex instances do this at the same time, they can generate conflicting tokens. This can be a problem
    68  when using gossiped ring (instances may simply not see each other yet), so we use `observe_period` to watch the ring for token conflicts.
    69  If conflict is detected, new tokens are generated instead of conflicting tokens, and observe period is restarted.
    70  If no conflict is detected within the observe period, ingester switches to ACTIVE state.
    71  
    72  We are able to observe ring state on [http://localhost:9109/ring](http://localhost:9109/ring) and [http://localhost:9209/ring](http://localhost:9209/ring).
    73  The two instances may see slightly different views (eg. different timestamps), but should converge to a common state soon, with both instances
    74  being ACTIVE and ready to receive samples.
    75  
    76  ## How to add another instance?
    77  
    78  To add another Cortex to the small cluster, copy `docs/configuration/single-process-config-blocks-gossip-1.yaml` to a new file,
    79  and make following modifications. We assume that third Cortex will run on the same machine again, so we change node name and ingester ID as well. Here
    80  is annotated diff:
    81  
    82  ```diff
    83  ...
    84  
    85   server:
    86  +  # These ports need to be unique.
    87  -  http_listen_port: 9109
    88  -  grpc_listen_port: 9195
    89  +  http_listen_port: 9309
    90  +  grpc_listen_port: 9395
    91  
    92  ...
    93  
    94   ingester:
    95     lifecycler:
    96       # Defaults to hostname, but we run both ingesters in this demonstration on the same machine.
    97  -    id: "Ingester 1"
    98  +    id: "Ingester 3"
    99  
   100  ...
   101  
   102   memberlist:
   103      # defaults to hostname
   104  -   node_name: "Ingester 1"
   105  +   node_name: "Ingester 3"
   106  
   107      # bind_port needs to be unique
   108  -   bind_port: 7946
   109  +   bind_port: 7948
   110  
   111  ...
   112  
   113  +# Directory names in the `blocks_storage` > `tsdb` config ending with `...1` to end with `...3`. This is to avoid different instances
   114  +# writing in-progress data to the same directories.
   115   blocks_storage:
   116     tsdb:
   117  -    dir: /tmp/cortex/tsdb-ing1
   118  +    dir: /tmp/cortex/tsdb-ing3
   119      bucket_store:
   120  -     sync_dir: /tmp/cortex/tsdb-sync-querier1
   121  +     sync_dir: /tmp/cortex/tsdb-sync-querier3
   122  
   123  ...
   124  ```
   125  
   126  We don't need to change or add `memberlist.join_members` list. This new instance will simply join to the second one (listening on port 7947), and
   127  will discover other peers through it. When using kubernetes, suggested setup is to have a headless service pointing to all pods
   128  that want to be part of gossip cluster, and then point `join_members` to this headless service.
   129  
   130  We also don't need to change `/tmp/cortex/storage` directory in `blocks_storage.filesystem.dir` field. This is directory where all ingesters will
   131  "upload" finished blocks. This can also be an S3 or GCP storage, but for simplicity, we use local filesystem in this example.
   132  
   133  After these changes, we can start another Cortex instance using the modified configuration file. This instance will join the ring
   134  and will start receiving samples after it enters into ACTIVE state.