github.com/muhammadn/cortex@v1.9.1-0.20220510110439-46bb7000d03d/CHANGELOG.md

github.com/muhammadn/cortex@v1.9.1-0.20220510110439-46bb7000d03d/CHANGELOG.md (about)

     1  # Changelog
     2  
     3  ## master / unreleased
     4  
     5  * [CHANGE] Changed default for `-ingester.min-ready-duration` from 1 minute to 15 seconds. #4539
     6  * [ENHANCEMENT] Added new ring related config `-ingester.readiness-check-ring-health` when enabled the readiness probe will succeed only after all instances are ACTIVE and healthy in the ring, this is enabled by default. #4539
     7  * [ENHANCEMENT] Added new ring related config `-distributor.excluded-zones` when set this will exclude the comma-separated zones from the ring, default is "". #4539
     8  * [ENHANCEMENT] Upgraded Docker base images to `alpine:3.14`. #4514
     9  * [ENHANCEMENT] Updated Prometheus to latest. Includes changes from prometheus#9239, adding 15 new functions. Multiple TSDB bugfixes prometheus#9438 & prometheus#9381. #4524
    10  
    11  ## 1.11.0-rc.0 in progress
    12  
    13  * [CHANGE] Memberlist: Expose default configuration values to the command line options. Note that setting these explicitly to zero will no longer cause the default to be used. If the default is desired, then do set the option. The following are affected: #4276
    14    - `-memberlist.stream-timeout`
    15    - `-memberlist.retransmit-factor`
    16    - `-memberlist.pull-push-interval`
    17    - `-memberlist.gossip-interval`
    18    - `-memberlist.gossip-nodes`
    19    - `-memberlist.gossip-to-dead-nodes-time`
    20    - `-memberlist.dead-node-reclaim-time`
    21  * [CHANGE] `-querier.max-fetched-chunks-per-query` previously applied to chunks from ingesters and store separately; now the two combined should not exceed the limit. #4260
    22  * [CHANGE] Memberlist: the metric `memberlist_kv_store_value_bytes` has been removed due to values no longer being stored in-memory as encoded bytes. #4345
    23  * [CHANGE] Some files and directories created by Cortex components on local disk now have stricter permissions, and are only readable by owner, but not group or others. #4394
    24  * [CHANGE] The metric `cortex_deprecated_flags_inuse_total` has been renamed to `deprecated_flags_inuse_total` as part of using grafana/dskit functionality. #4443
    25  * [FEATURE] Ruler: Add new `-ruler.query-stats-enabled` which when enabled will report the `cortex_ruler_query_seconds_total` as a per-user metric that tracks the sum of the wall time of executing queries in the ruler in seconds. #4317
    26  * [FEATURE] Query Frontend: Add `cortex_query_fetched_series_total` and `cortex_query_fetched_chunks_bytes_total` per-user counters to expose the number of series and bytes fetched as part of queries. These metrics can be enabled with the `-frontend.query-stats-enabled` flag (or its respective YAML config option `query_stats_enabled`). #4343
    27  * [FEATURE] AlertManager: Add support for SNS Receiver. #4382
    28  * [FEATURE] Distributor: Add label `status` to metric `cortex_distributor_ingester_append_failures_total` #4442
    29  * [FEATURE] Queries: Added `present_over_time` PromQL function, also some TSDB optimisations. #4505
    30  * [ENHANCEMENT] Add timeout for waiting on compactor to become ACTIVE in the ring. #4262
    31  * [ENHANCEMENT] Reduce memory used by streaming queries, particularly in ruler. #4341
    32  * [ENHANCEMENT] Ring: allow experimental configuration of disabling of heartbeat timeouts by setting the relevant configuration value to zero. Applies to the following: #4342
    33    * `-distributor.ring.heartbeat-timeout`
    34    * `-ring.heartbeat-timeout`
    35    * `-ruler.ring.heartbeat-timeout`
    36    * `-alertmanager.sharding-ring.heartbeat-timeout`
    37    * `-compactor.ring.heartbeat-timeout`
    38    * `-store-gateway.sharding-ring.heartbeat-timeout`
    39  * [ENHANCEMENT] Ring: allow heartbeats to be explicitly disabled by setting the interval to zero. This is considered experimental. This applies to the following configuration options: #4344
    40    * `-distributor.ring.heartbeat-period`
    41    * `-ingester.heartbeat-period`
    42    * `-ruler.ring.heartbeat-period`
    43    * `-alertmanager.sharding-ring.heartbeat-period`
    44    * `-compactor.ring.heartbeat-period`
    45    * `-store-gateway.sharding-ring.heartbeat-period`
    46  * [ENHANCEMENT] Memberlist: optimized receive path for processing ring state updates, to help reduce CPU utilization in large clusters. #4345
    47  * [ENHANCEMENT] Memberlist: expose configuration of memberlist packet compression via `-memberlist.compression=enabled`. #4346
    48  * [ENHANCEMENT] Update Go version to 1.16.6. #4362
    49  * [ENHANCEMENT] Updated Prometheus to include changes from prometheus/prometheus#9083. Now whenever `/labels` API calls include matchers, blocks store is queried for `LabelNames` with matchers instead of `Series` calls which was inefficient. #4380
    50  * [ENHANCEMENT] Exemplars are now emitted for all gRPC calls and many operations tracked by histograms. #4462
    51  * [ENHANCEMENT] New options `-server.http-listen-network` and `-server.grpc-listen-network` allow binding as 'tcp4' or 'tcp6'. #4462
    52  * [ENHANCEMENT] Rulers: Using shuffle sharding subring on GetRules API. #4466
    53  * [ENHANCEMENT] Support memcached auto-discovery via `auto-discovery` flag, introduced by thanos in https://github.com/thanos-io/thanos/pull/4487. Both AWS and Google Cloud memcached service support auto-discovery, which returns a list of nodes of the memcached cluster. #4412
    54  * [ENHANCEMENT] Query federation: improve performance in MergeQueryable by memoizing labels. #4502
    55  * [BUGFIX] Fixes a panic in the query-tee when comparing result. #4465
    56  * [BUGFIX] Frontend: Fixes @ modifier functions (start/end) when splitting queries by time. #4464
    57  * [BUGFIX] Compactor: compactor will no longer try to compact blocks that are already marked for deletion. Previously compactor would consider blocks marked for deletion within `-compactor.deletion-delay / 2` period as eligible for compaction. #4328
    58  * [BUGFIX] HA Tracker: when cleaning up obsolete elected replicas from KV store, tracker didn't update number of cluster per user correctly. #4336
    59  * [BUGFIX] Ruler: fixed counting of PromQL evaluation errors as user-errors when updating `cortex_ruler_queries_failed_total`. #4335
    60  * [BUGFIX] Ingester: When using block storage, prevent any reads or writes while the ingester is stopping. This will prevent accessing TSDB blocks once they have been already closed. #4304
    61  * [BUGFIX] Ingester: fixed ingester stuck on start up (LEAVING ring state) when `-ingester.heartbeat-period=0` and `-ingester.unregister-on-shutdown=false`. #4366
    62  * [BUGFIX] Ingester: panic during shutdown while fetching batches from cache. #4397
    63  * [BUGFIX] Querier: After query-frontend restart, querier may have lower than configured concurrency. #4417
    64  * [BUGFIX] Memberlist: forward only changes, not entire original message. #4419
    65  * [BUGFIX] Memberlist: don't accept old tombstones as incoming change, and don't forward such messages to other gossip members. #4420
    66  * [BUGFIX] Querier: fixed panic when querying exemplars and using `-distributor.shard-by-all-labels=false`. #4473
    67  * [BUGFIX] Querier: honor querier minT,maxT if `nil` SelectHints are passed to Select(). #4413
    68  * [BUGFIX] Compactor: fixed panic while collecting Prometheus metrics. #4483
    69  * [BUGFIX] AlertManager: remove stale template files. #4495
    70  
    71  
    72  ## 1.10.0 / 2021-08-03
    73  
    74  * [CHANGE] Prevent path traversal attack from users able to control the HTTP header `X-Scope-OrgID`. #4375 (CVE-2021-36157)
    75    * Users only have control of the HTTP header when Cortex is not frontend by an auth proxy validating the tenant IDs
    76  * [CHANGE] Enable strict JSON unmarshal for `pkg/util/validation.Limits` struct. The custom `UnmarshalJSON()` will now fail if the input has unknown fields. #4298
    77  * [CHANGE] Cortex chunks storage has been deprecated and it's now in maintenance mode: all Cortex users are encouraged to migrate to the blocks storage. No new features will be added to the chunks storage. The default Cortex configuration still runs the chunks engine; please check out the [blocks storage doc](https://cortexmetrics.io/docs/blocks-storage/) on how to configure Cortex to run with the blocks storage.  #4268
    78  * [CHANGE] The example Kubernetes manifests (stored at `k8s/`) have been removed due to a lack of proper support and maintenance. #4268
    79  * [CHANGE] Querier / ruler: deprecated `-store.query-chunk-limit` CLI flag (and its respective YAML config option `max_chunks_per_query`) in favour of `-querier.max-fetched-chunks-per-query` (and its respective YAML config option `max_fetched_chunks_per_query`). The new limit specifies the maximum number of chunks that can be fetched in a single query from ingesters and long-term storage: the total number of actual fetched chunks could be 2x the limit, being independently applied when querying ingesters and long-term storage. #4125
    80  * [CHANGE] Alertmanager: allowed to configure the experimental receivers firewall on a per-tenant basis. The following CLI flags (and their respective YAML config options) have been changed and moved to the limits config section: #4143
    81    - `-alertmanager.receivers-firewall.block.cidr-networks` renamed to `-alertmanager.receivers-firewall-block-cidr-networks`
    82    - `-alertmanager.receivers-firewall.block.private-addresses` renamed to `-alertmanager.receivers-firewall-block-private-addresses`
    83  * [CHANGE] Change default value of `-server.grpc.keepalive.min-time-between-pings` from `5m` to `10s` and `-server.grpc.keepalive.ping-without-stream-allowed` to `true`. #4168
    84  * [CHANGE] Ingester: Change default value of `-ingester.active-series-metrics-enabled` to `true`. This incurs a small increase in memory usage, between 1.2% and 1.6% as measured on ingesters with 1.3M active series. #4257
    85  * [CHANGE] Dependency: update go-redis from v8.2.3 to v8.9.0. #4236
    86  * [FEATURE] Querier: Added new `-querier.max-fetched-series-per-query` flag. When Cortex is running with blocks storage, the max series per query limit is enforced in the querier and applies to unique series received from ingesters and store-gateway (long-term storage). #4179
    87  * [FEATURE] Querier/Ruler: Added new `-querier.max-fetched-chunk-bytes-per-query` flag. When Cortex is running with blocks storage, the max chunk bytes limit is enforced in the querier and ruler and limits the size of all aggregated chunks returned from ingesters and storage as bytes for a query. #4216
    88  * [FEATURE] Alertmanager: support negative matchers, time-based muting - [upstream release notes](https://github.com/prometheus/alertmanager/releases/tag/v0.22.0). #4237
    89  * [FEATURE] Alertmanager: Added rate-limits to notifiers. Rate limits used by all integrations can be configured using `-alertmanager.notification-rate-limit`, while per-integration rate limits can be specified via `-alertmanager.notification-rate-limit-per-integration` parameter. Both shared and per-integration limits can be overwritten using overrides mechanism. These limits are applied on individual (per-tenant) alertmanagers. Rate-limited notifications are failed notifications. It is possible to monitor rate-limited notifications via new `cortex_alertmanager_notification_rate_limited_total` metric. #4135 #4163
    90  * [FEATURE] Alertmanager: Added `-alertmanager.max-config-size-bytes` limit to control size of configuration files that Cortex users can upload to Alertmanager via API. This limit is configurable per-tenant. #4201
    91  * [FEATURE] Alertmanager: Added `-alertmanager.max-templates-count` and `-alertmanager.max-template-size-bytes` options to control number and size of templates uploaded to Alertmanager via API. These limits are configurable per-tenant. #4223
    92  * [FEATURE] Added flag `-debug.block-profile-rate` to enable goroutine blocking events profiling. #4217
    93  * [FEATURE] Alertmanager: The experimental sharding feature is now considered complete. Detailed information about the configuration options can be found [here for alertmanager](https://cortexmetrics.io/docs/configuration/configuration-file/#alertmanager_config) and [here for the alertmanager storage](https://cortexmetrics.io/docs/configuration/configuration-file/#alertmanager_storage_config). To use the feature: #3925 #4020 #4021 #4031 #4084 #4110 #4126 #4127 #4141 #4146 #4161 #4162 #4222
    94    * Ensure that a remote storage backend is configured for Alertmanager to store state using `-alertmanager-storage.backend`, and flags related to the backend. Note that the `local` and `configdb` storage backends are not supported.
    95    * Ensure that a ring store is configured using `-alertmanager.sharding-ring.store`, and set the flags relevant to the chosen store type.
    96    * Enable the feature using `-alertmanager.sharding-enabled`.
    97    * Note the prior addition of a new configuration option `-alertmanager.persist-interval`. This sets the interval between persisting the current alertmanager state (notification log and silences) to object storage. See the [configuration file reference](https://cortexmetrics.io/docs/configuration/configuration-file/#alertmanager_config) for more information.
    98  * [ENHANCEMENT] Alertmanager: Cleanup persisted state objects from remote storage when a tenant configuration is deleted. #4167
    99  * [ENHANCEMENT] Storage: Added the ability to disable Open Census within GCS client (e.g `-gcs.enable-opencensus=false`). #4219
   100  * [ENHANCEMENT] Etcd: Added username and password to etcd config. #4205
   101  * [ENHANCEMENT] Alertmanager: introduced new metrics to monitor operation when using `-alertmanager.sharding-enabled`: #4149
   102    * `cortex_alertmanager_state_fetch_replica_state_total`
   103    * `cortex_alertmanager_state_fetch_replica_state_failed_total`
   104    * `cortex_alertmanager_state_initial_sync_total`
   105    * `cortex_alertmanager_state_initial_sync_completed_total`
   106    * `cortex_alertmanager_state_initial_sync_duration_seconds`
   107    * `cortex_alertmanager_state_persist_total`
   108    * `cortex_alertmanager_state_persist_failed_total`
   109  * [ENHANCEMENT] Blocks storage: support ingesting exemplars and querying of exemplars.  Enabled by setting new CLI flag `-blocks-storage.tsdb.max-exemplars=<n>` or config option `blocks_storage.tsdb.max_exemplars` to positive value. #4124 #4181
   110  * [ENHANCEMENT] Distributor: Added distributors ring status section in the admin page. #4151
   111  * [ENHANCEMENT] Added zone-awareness support to alertmanager for use when sharding is enabled. When zone-awareness is enabled, alerts will be replicated across availability zones. #4204
   112  * [ENHANCEMENT] Added `tenant_ids` tag to tracing spans #4186
   113  * [ENHANCEMENT] Ring, query-frontend: Avoid using automatic private IPs (APIPA) when discovering IP address from the interface during the registration of the instance in the ring, or by query-frontend when used with query-scheduler. APIPA still used as last resort with logging indicating usage. #4032
   114  * [ENHANCEMENT] Memberlist: introduced new metrics to aid troubleshooting tombstone convergence: #4231
   115    * `memberlist_client_kv_store_value_tombstones`
   116    * `memberlist_client_kv_store_value_tombstones_removed_total`
   117    * `memberlist_client_messages_to_broadcast_dropped_total`
   118  * [ENHANCEMENT] Alertmanager: Added `-alertmanager.max-dispatcher-aggregation-groups` option to control max number of active dispatcher groups in Alertmanager (per tenant, also overrideable). When the limit is reached, Dispatcher produces log message and increases `cortex_alertmanager_dispatcher_aggregation_group_limit_reached_total` metric. #4254
   119  * [ENHANCEMENT] Alertmanager: Added `-alertmanager.max-alerts-count` and `-alertmanager.max-alerts-size-bytes` to control max number of alerts and total size of alerts that a single user can have in Alertmanager's memory. Adding more alerts will fail with a log message and incrementing `cortex_alertmanager_alerts_insert_limited_total` metric (per-user). These limits can be overrided by using per-tenant overrides. Current values are tracked in `cortex_alertmanager_alerts_limiter_current_alerts` and `cortex_alertmanager_alerts_limiter_current_alerts_size_bytes` metrics. #4253
   120  * [ENHANCEMENT] Store-gateway: added `-store-gateway.sharding-ring.wait-stability-min-duration` and `-store-gateway.sharding-ring.wait-stability-max-duration` support to store-gateway, to wait for ring stability at startup. #4271
   121  * [ENHANCEMENT] Ruler: added `rule_group` label to metrics `cortex_prometheus_rule_group_iterations_total` and `cortex_prometheus_rule_group_iterations_missed_total`. #4121
   122  * [ENHANCEMENT] Ruler: added new metrics for tracking total number of queries and push requests sent to ingester, as well as failed queries and push requests. Failures are only counted for internal errors, but not user-errors like limits or invalid query. This is in contrast to existing `cortex_prometheus_rule_evaluation_failures_total`, which is incremented also when query or samples appending fails due to user-errors. #4281
   123    * `cortex_ruler_write_requests_total`
   124    * `cortex_ruler_write_requests_failed_total`
   125    * `cortex_ruler_queries_total`
   126    * `cortex_ruler_queries_failed_total`
   127  * [ENHANCEMENT] Ingester: Added option `-ingester.ignore-series-limit-for-metric-names` with comma-separated list of metric names that will be ignored in max series per metric limit. #4302
   128  * [ENHANCEMENT] Added instrumentation to Redis client, with the following metrics: #3976
   129    - `cortex_rediscache_request_duration_seconds`
   130  * [BUGFIX] Purger: fix `Invalid null value in condition for column range` caused by `nil` value in range for WriteBatch query. #4128
   131  * [BUGFIX] Ingester: fixed infrequent panic caused by a race condition between TSDB mmap-ed head chunks truncation and queries. #4176
   132  * [BUGFIX] Alertmanager: fix Alertmanager status page if clustering via gossip is disabled or sharding is enabled. #4184
   133  * [BUGFIX] Ruler: fix `/ruler/rule_groups` endpoint doesn't work when used with object store. #4182
   134  * [BUGFIX] Ruler: Honor the evaluation delay for the `ALERTS` and `ALERTS_FOR_STATE` series. #4227
   135  * [BUGFIX] Make multiple Get requests instead of MGet on Redis Cluster. #4056
   136  * [BUGFIX] Ingester: fix issue where runtime limits erroneously override default limits. #4246
   137  * [BUGFIX] Ruler: fix startup in single-binary mode when the new `ruler_storage` is used. #4252
   138  * [BUGFIX] Querier: fix queries failing with "at least 1 healthy replica required, could only find 0" error right after scaling up store-gateways until they're ACTIVE in the ring. #4263
   139  * [BUGFIX] Store-gateway: when blocks sharding is enabled, do not load all blocks in each store-gateway in case of a cold startup, but load only blocks owned by the store-gateway replica. #4271
   140  * [BUGFIX] Memberlist: fix to setting the default configuration value for `-memberlist.retransmit-factor` when not provided. This should improve propagation delay of the ring state (including, but not limited to, tombstones). Note that if the configuration is already explicitly given, this fix has no effect. #4269
   141  * [BUGFIX] Querier: Fix issue where samples in a chunk might get skipped by batch iterator. #4218
   142  
   143  ## Blocksconvert
   144  
   145  * [ENHANCEMENT] Scanner: add support for DynamoDB (v9 schema only). #3828
   146  * [ENHANCEMENT] Add Cassandra support. #3795
   147  * [ENHANCEMENT] Scanner: retry failed uploads. #4188
   148  
   149  ## 1.9.0 / 2021-05-14
   150  
   151  * [CHANGE] Alertmanager now removes local files after Alertmanager is no longer running for removed or resharded user. #3910
   152  * [CHANGE] Alertmanager now stores local files in per-tenant folders. Files stored by Alertmanager previously are migrated to new hierarchy. Support for this migration will be removed in Cortex 1.11. #3910
   153  * [CHANGE] Ruler: deprecated `-ruler.storage.*` CLI flags (and their respective YAML config options) in favour of `-ruler-storage.*`. The deprecated config will be removed in Cortex 1.11. #3945
   154  * [CHANGE] Alertmanager: deprecated `-alertmanager.storage.*` CLI flags (and their respective YAML config options) in favour of `-alertmanager-storage.*`. This change doesn't apply to `alertmanager.storage.path` and `alertmanager.storage.retention`. The deprecated config will be removed in Cortex 1.11. #4002
   155  * [CHANGE] Alertmanager: removed `-cluster.` CLI flags deprecated in Cortex 1.7. The new config options to use are: #3946
   156    * `-alertmanager.cluster.listen-address` instead of `-cluster.listen-address`
   157    * `-alertmanager.cluster.advertise-address` instead of `-cluster.advertise-address`
   158    * `-alertmanager.cluster.peers` instead of `-cluster.peer`
   159    * `-alertmanager.cluster.peer-timeout` instead of `-cluster.peer-timeout`
   160  * [CHANGE] Blocks storage: removed the config option `-blocks-storage.bucket-store.index-cache.postings-compression-enabled`, which was deprecated in Cortex 1.6. Postings compression is always enabled. #4101
   161  * [CHANGE] Querier: removed the config option `-store.max-look-back-period`, which was deprecated in Cortex 1.6 and was used only by the chunks storage. You should use `-querier.max-query-lookback` instead. #4101
   162  * [CHANGE] Query Frontend: removed the config option `-querier.compress-http-responses`, which was deprecated in Cortex 1.6. You should use`-api.response-compression-enabled` instead. #4101
   163  * [CHANGE] Runtime-config / overrides: removed the config options `-limits.per-user-override-config` (use `-runtime-config.file`) and `-limits.per-user-override-period` (use `-runtime-config.reload-period`), both deprecated since Cortex 0.6.0. #4112
   164  * [CHANGE] Cortex now fails fast on startup if unable to connect to the ring backend. #4068
   165  * [FEATURE] The following features have been marked as stable: #4101
   166    - Shuffle-sharding
   167    - Querier support for querying chunks and blocks store at the same time
   168    - Tracking of active series and exporting them as metrics (`-ingester.active-series-metrics-enabled` and related flags)
   169    - Blocks storage: lazy mmap of block indexes in the store-gateway (`-blocks-storage.bucket-store.index-header-lazy-loading-enabled`)
   170    - Ingester: close idle TSDB and remove them from local disk (`-blocks-storage.tsdb.close-idle-tsdb-timeout`)
   171  * [FEATURE] Memberlist: add TLS configuration options for the memberlist transport layer used by the gossip KV store. #4046
   172    * New flags added for memberlist communication:
   173      * `-memberlist.tls-enabled`
   174      * `-memberlist.tls-cert-path`
   175      * `-memberlist.tls-key-path`
   176      * `-memberlist.tls-ca-path`
   177      * `-memberlist.tls-server-name`
   178      * `-memberlist.tls-insecure-skip-verify`
   179  * [FEATURE] Ruler: added `local` backend support to the ruler storage configuration under the `-ruler-storage.` flag prefix. #3932
   180  * [ENHANCEMENT] Upgraded Docker base images to `alpine:3.13`. #4042
   181  * [ENHANCEMENT] Blocks storage: reduce ingester memory by eliminating series reference cache. #3951
   182  * [ENHANCEMENT] Ruler: optimized `<prefix>/api/v1/rules` and `<prefix>/api/v1/alerts` when ruler sharding is enabled. #3916
   183  * [ENHANCEMENT] Ruler: added the following metrics when ruler sharding is enabled: #3916
   184    * `cortex_ruler_clients`
   185    * `cortex_ruler_client_request_duration_seconds`
   186  * [ENHANCEMENT] Alertmanager: Add API endpoint to list all tenant alertmanager configs: `GET /multitenant_alertmanager/configs`. #3529
   187  * [ENHANCEMENT] Ruler: Add API endpoint to list all tenant ruler rule groups: `GET /ruler/rule_groups`. #3529
   188  * [ENHANCEMENT] Query-frontend/scheduler: added querier forget delay (`-query-frontend.querier-forget-delay` and `-query-scheduler.querier-forget-delay`) to mitigate the blast radius in the event queriers crash because of a repeatedly sent "query of death" when shuffle-sharding is enabled. #3901
   189  * [ENHANCEMENT] Query-frontend: reduced memory allocations when serializing query response. #3964
   190  * [ENHANCEMENT] Querier / ruler: some optimizations to PromQL query engine. #3934 #3989
   191  * [ENHANCEMENT] Ingester: reduce CPU and memory when an high number of errors are returned by the ingester on the write path with the blocks storage. #3969 #3971 #3973
   192  * [ENHANCEMENT] Distributor: reduce CPU and memory when an high number of errors are returned by the distributor on the write path. #3990
   193  * [ENHANCEMENT] Put metric before label value in the "label value too long" error message. #4018
   194  * [ENHANCEMENT] Allow use of `y|w|d` suffixes for duration related limits and per-tenant limits. #4044
   195  * [ENHANCEMENT] Query-frontend: Small optimization on top of PR #3968 to avoid unnecessary Extents merging. #4026
   196  * [ENHANCEMENT] Add a metric `cortex_compactor_compaction_interval_seconds` for the compaction interval config value. #4040
   197  * [ENHANCEMENT] Ingester: added following per-ingester (instance) experimental limits: max number of series in memory (`-ingester.instance-limits.max-series`), max number of users in memory (`-ingester.instance-limits.max-tenants`), max ingestion rate (`-ingester.instance-limits.max-ingestion-rate`), and max inflight requests (`-ingester.instance-limits.max-inflight-push-requests`). These limits are only used when using blocks storage. Limits can also be configured using runtime-config feature, and current values are exported as `cortex_ingester_instance_limits` metric. #3992.
   198  * [ENHANCEMENT] Cortex is now built with Go 1.16. #4062
   199  * [ENHANCEMENT] Distributor: added per-distributor experimental limits: max number of inflight requests (`-distributor.instance-limits.max-inflight-push-requests`) and max ingestion rate in samples/sec (`-distributor.instance-limits.max-ingestion-rate`). If not set, these two are unlimited. Also added metrics to expose current values (`cortex_distributor_inflight_push_requests`, `cortex_distributor_ingestion_rate_samples_per_second`) as well as limits (`cortex_distributor_instance_limits` with various `limit` label values). #4071
   200  * [ENHANCEMENT] Ruler: Added `-ruler.enabled-tenants` and `-ruler.disabled-tenants` to explicitly enable or disable rules processing for specific tenants. #4074
   201  * [ENHANCEMENT] Block Storage Ingester: `/flush` now accepts two new parameters: `tenant` to specify tenant to flush and `wait=true` to make call synchronous. Multiple tenants can be specified by repeating `tenant` parameter. If no `tenant` is specified, all tenants are flushed, as before. #4073
   202  * [ENHANCEMENT] Alertmanager: validate configured `-alertmanager.web.external-url` and fail if ends with `/`. #4081
   203  * [ENHANCEMENT] Alertmanager: added `-alertmanager.receivers-firewall.block.cidr-networks` and `-alertmanager.receivers-firewall.block.private-addresses` to block specific network addresses in HTTP-based Alertmanager receiver integrations. #4085
   204  * [ENHANCEMENT] Allow configuration of Cassandra's host selection policy. #4069
   205  * [ENHANCEMENT] Store-gateway: retry synching blocks if a per-tenant sync fails. #3975 #4088
   206  * [ENHANCEMENT] Add metric `cortex_tcp_connections` exposing the current number of accepted TCP connections. #4099
   207  * [ENHANCEMENT] Querier: Allow federated queries to run concurrently. #4065
   208  * [ENHANCEMENT] Label Values API call now supports `match[]` parameter when querying blocks on storage (assuming `-querier.query-store-for-labels-enabled` is enabled). #4133
   209  * [BUGFIX] Ruler-API: fix bug where `/api/v1/rules/<namespace>/<group_name>` endpoint return `400` instead of `404`. #4013
   210  * [BUGFIX] Distributor: reverted changes done to rate limiting in #3825. #3948
   211  * [BUGFIX] Ingester: Fix race condition when opening and closing tsdb concurrently. #3959
   212  * [BUGFIX] Querier: streamline tracing spans. #3924
   213  * [BUGFIX] Ruler Storage: ignore objects with empty namespace or group in the name. #3999
   214  * [BUGFIX] Distributor: fix issue causing distributors to not extend the replication set because of failing instances when zone-aware replication is enabled. #3977
   215  * [BUGFIX] Query-frontend: Fix issue where cached entry size keeps increasing when making tiny query repeatedly. #3968
   216  * [BUGFIX] Compactor: `-compactor.blocks-retention-period` now supports weeks (`w`) and years (`y`). #4027
   217  * [BUGFIX] Querier: returning 422 (instead of 500) when query hits `max_chunks_per_query` limit with block storage, when the limit is hit in the store-gateway. #3937
   218  * [BUGFIX] Ruler: Rule group limit enforcement should now allow the same number of rules in a group as the limit. #3616
   219  * [BUGFIX] Frontend, Query-scheduler: allow querier to notify about shutdown without providing any authentication. #4066
   220  * [BUGFIX] Querier: fixed race condition causing queries to fail right after querier startup with the "empty ring" error. #4068
   221  * [BUGFIX] Compactor: Increment `cortex_compactor_runs_failed_total` if compactor failed compact a single tenant. #4094
   222  * [BUGFIX] Tracing: hot fix to avoid the Jaeger tracing client to indefinitely block the Cortex process shutdown in case the HTTP connection to the tracing backend is blocked. #4134
   223  * [BUGFIX] Forward proper EndsAt from ruler to Alertmanager inline with Prometheus behaviour. #4017
   224  * [BUGFIX] Querier: support filtering LabelValues with matchers when using tenant federation. #4277
   225  
   226  ## Blocksconvert
   227  
   228  * [ENHANCEMENT] Builder: add `-builder.timestamp-tolerance` option which may reduce block size by rounding timestamps to make difference whole seconds. #3891
   229  
   230  ## 1.8.1 / 2021-04-27
   231  
   232  * [CHANGE] Fix for CVE-2021-31232: Local file disclosure vulnerability when `-experimental.alertmanager.enable-api` is used. The HTTP basic auth `password_file` can be used as an attack vector to send any file content via a webhook. The alertmanager templates can be used as an attack vector to send any file content because the alertmanager can load any text file specified in the templates list.
   233  
   234  ## 1.8.0 / 2021-03-24
   235  
   236  * [CHANGE] Alertmanager: Don't expose cluster information to tenants via the `/alertmanager/api/v1/status` API endpoint when operating with clustering enabled. #3903
   237  * [CHANGE] Ingester: don't update internal "last updated" timestamp of TSDB if tenant only sends invalid samples. This affects how "idle" time is computed. #3727
   238  * [CHANGE] Require explicit flag `-<prefix>.tls-enabled` to enable TLS in GRPC clients. Previously it was enough to specify a TLS flag to enable TLS validation. #3156
   239  * [CHANGE] Query-frontend: removed `-querier.split-queries-by-day` (deprecated in Cortex 0.4.0). Please use `-querier.split-queries-by-interval` instead. #3813
   240  * [CHANGE] Store-gateway: the chunks pool controlled by `-blocks-storage.bucket-store.max-chunk-pool-bytes` is now shared across all tenants. #3830
   241  * [CHANGE] Ingester: return error code 400 instead of 429 when per-user/per-tenant series/metadata limits are reached. #3833
   242  * [CHANGE] Compactor: add `reason` label to `cortex_compactor_blocks_marked_for_deletion_total` metric. Source blocks marked for deletion by compactor are labelled as `compaction`, while blocks passing the retention period are labelled as `retention`. #3879
   243  * [CHANGE] Alertmanager: the `DELETE /api/v1/alerts` is now idempotent. No error is returned if the alertmanager config doesn't exist. #3888
   244  * [FEATURE] Experimental Ruler Storage: Add a separate set of configuration options to configure the ruler storage backend under the `-ruler-storage.` flag prefix. All blocks storage bucket clients and the config service are currently supported. Clients using this implementation will only be enabled if the existing `-ruler.storage` flags are left unset. #3805 #3864
   245  * [FEATURE] Experimental Alertmanager Storage: Add a separate set of configuration options to configure the alertmanager storage backend under the `-alertmanager-storage.` flag prefix. All blocks storage bucket clients and the config service are currently supported. Clients using this implementation will only be enabled if the existing `-alertmanager.storage` flags are left unset. #3888
   246  * [FEATURE] Adds support to S3 server-side encryption using KMS. The S3 server-side encryption config can be overridden on a per-tenant basis for the blocks storage, ruler and alertmanager. Deprecated `-<prefix>.s3.sse-encryption`, please use the following CLI flags that have been added. #3651 #3810 #3811 #3870 #3886 #3906
   247    - `-<prefix>.s3.sse.type`
   248    - `-<prefix>.s3.sse.kms-key-id`
   249    - `-<prefix>.s3.sse.kms-encryption-context`
   250  * [FEATURE] Querier: Enable `@ <timestamp>` modifier in PromQL using the new `-querier.at-modifier-enabled` flag. #3744
   251  * [FEATURE] Overrides Exporter: Add `overrides-exporter` module for exposing per-tenant resource limit overrides as metrics. It is not included in `all` target (single-binary mode), and must be explicitly enabled. #3785
   252  * [FEATURE] Experimental thanosconvert: introduce an experimental tool `thanosconvert` to migrate Thanos block metadata to Cortex metadata. #3770
   253  * [FEATURE] Alertmanager: It now shards the `/api/v1/alerts` API using the ring when sharding is enabled. #3671
   254    * Added `-alertmanager.max-recv-msg-size` (defaults to 16M) to limit the size of HTTP request body handled by the alertmanager.
   255    * New flags added for communication between alertmanagers:
   256      * `-alertmanager.max-recv-msg-size`
   257      * `-alertmanager.alertmanager-client.remote-timeout`
   258      * `-alertmanager.alertmanager-client.tls-enabled`
   259      * `-alertmanager.alertmanager-client.tls-cert-path`
   260      * `-alertmanager.alertmanager-client.tls-key-path`
   261      * `-alertmanager.alertmanager-client.tls-ca-path`
   262      * `-alertmanager.alertmanager-client.tls-server-name`
   263      * `-alertmanager.alertmanager-client.tls-insecure-skip-verify`
   264  * [FEATURE] Compactor: added blocks storage per-tenant retention support. This is configured via `-compactor.retention-period`, and can be overridden on a per-tenant basis. #3879
   265  * [ENHANCEMENT] Queries: Instrument queries that were discarded due to the configured `max_outstanding_requests_per_tenant`. #3894
   266    * `cortex_query_frontend_discarded_requests_total`
   267    * `cortex_query_scheduler_discarded_requests_total`
   268  * [ENHANCEMENT] Ruler: Add TLS and explicit basis authentication configuration options for the HTTP client the ruler uses to communicate with the alertmanager. #3752
   269    * `-ruler.alertmanager-client.basic-auth-username`: Configure the basic authentication username used by the client. Takes precedent over a URL configured username.
   270    * `-ruler.alertmanager-client.basic-auth-password`: Configure the basic authentication password used by the client. Takes precedent over a URL configured password.
   271    * `-ruler.alertmanager-client.tls-ca-path`: File path to the CA file.
   272    * `-ruler.alertmanager-client.tls-cert-path`: File path to the TLS certificate.
   273    * `-ruler.alertmanager-client.tls-insecure-skip-verify`: Boolean to disable verifying the certificate.
   274    * `-ruler.alertmanager-client.tls-key-path`: File path to the TLS key certificate.
   275    * `-ruler.alertmanager-client.tls-server-name`: Expected name on the TLS certificate.
   276  * [ENHANCEMENT] Ingester: exposed metric `cortex_ingester_oldest_unshipped_block_timestamp_seconds`, tracking the unix timestamp of the oldest TSDB block not shipped to the storage yet. #3705
   277  * [ENHANCEMENT] Prometheus upgraded. #3739 #3806
   278    * Avoid unnecessary `runtime.GC()` during compactions.
   279    * Prevent compaction loop in TSDB on data gap.
   280  * [ENHANCEMENT] Query-Frontend now returns server side performance metrics using `Server-Timing` header when query stats is enabled. #3685
   281  * [ENHANCEMENT] Runtime Config: Add a `mode` query parameter for the runtime config endpoint. `/runtime_config?mode=diff` now shows the YAML runtime configuration with all values that differ from the defaults. #3700
   282  * [ENHANCEMENT] Distributor: Enable downstream projects to wrap distributor push function and access the deserialized write requests berfore/after they are pushed. #3755
   283  * [ENHANCEMENT] Add flag `-<prefix>.tls-server-name` to require a specific server name instead of the hostname on the certificate. #3156
   284  * [ENHANCEMENT] Alertmanager: Remove a tenant's alertmanager instead of pausing it as we determine it is no longer needed. #3722
   285  * [ENHANCEMENT] Blocks storage: added more configuration options to S3 client. #3775
   286    * `-blocks-storage.s3.tls-handshake-timeout`: Maximum time to wait for a TLS handshake. 0 means no limit.
   287    * `-blocks-storage.s3.expect-continue-timeout`: The time to wait for a server's first response headers after fully writing the request headers if the request has an Expect header. 0 to send the request body immediately.
   288    * `-blocks-storage.s3.max-idle-connections`: Maximum number of idle (keep-alive) connections across all hosts. 0 means no limit.
   289    * `-blocks-storage.s3.max-idle-connections-per-host`: Maximum number of idle (keep-alive) connections to keep per-host. If 0, a built-in default value is used.
   290    * `-blocks-storage.s3.max-connections-per-host`: Maximum number of connections per host. 0 means no limit.
   291  * [ENHANCEMENT] Ingester: when tenant's TSDB is closed, Ingester now removes pushed metrics-metadata from memory, and removes metadata (`cortex_ingester_memory_metadata`, `cortex_ingester_memory_metadata_created_total`, `cortex_ingester_memory_metadata_removed_total`) and validation metrics (`cortex_discarded_samples_total`, `cortex_discarded_metadata_total`). #3782
   292  * [ENHANCEMENT] Distributor: cleanup metrics for inactive tenants. #3784
   293  * [ENHANCEMENT] Ingester: Have ingester to re-emit following TSDB metrics. #3800
   294    * `cortex_ingester_tsdb_blocks_loaded`
   295    * `cortex_ingester_tsdb_reloads_total`
   296    * `cortex_ingester_tsdb_reloads_failures_total`
   297    * `cortex_ingester_tsdb_symbol_table_size_bytes`
   298    * `cortex_ingester_tsdb_storage_blocks_bytes`
   299    * `cortex_ingester_tsdb_time_retentions_total`
   300  * [ENHANCEMENT] Querier: distribute workload across `-store-gateway.sharding-ring.replication-factor` store-gateway replicas when querying blocks and `-store-gateway.sharding-enabled=true`. #3824
   301  * [ENHANCEMENT] Distributor / HA Tracker: added cleanup of unused elected HA replicas from KV store. Added following metrics to monitor this process: #3809
   302    * `cortex_ha_tracker_replicas_cleanup_started_total`
   303    * `cortex_ha_tracker_replicas_cleanup_marked_for_deletion_total`
   304    * `cortex_ha_tracker_replicas_cleanup_deleted_total`
   305    * `cortex_ha_tracker_replicas_cleanup_delete_failed_total`
   306  * [ENHANCEMENT] Ruler now has new API endpoint `/ruler/delete_tenant_config` that can be used to delete all ruler groups for tenant. It is intended to be used by administrators who wish to clean up state after removed user. Note that this endpoint is enabled regardless of `-experimental.ruler.enable-api`. #3750 #3899
   307  * [ENHANCEMENT] Query-frontend, query-scheduler: cleanup metrics for inactive tenants. #3826
   308  * [ENHANCEMENT] Blocks storage: added `-blocks-storage.s3.region` support to S3 client configuration. #3811
   309  * [ENHANCEMENT] Distributor: Remove cached subrings for inactive users when using shuffle sharding. #3849
   310  * [ENHANCEMENT] Store-gateway: Reduced memory used to fetch chunks at query time. #3855
   311  * [ENHANCEMENT] Ingester: attempt to prevent idle compaction from happening in concurrent ingesters by introducing a 25% jitter to the configured idle timeout (`-blocks-storage.tsdb.head-compaction-idle-timeout`). #3850
   312  * [ENHANCEMENT] Compactor: cleanup local files for users that are no longer owned by compactor. #3851
   313  * [ENHANCEMENT] Store-gateway: close empty bucket stores, and delete leftover local files for tenants that no longer belong to store-gateway. #3853
   314  * [ENHANCEMENT] Store-gateway: added metrics to track partitioner behaviour. #3877
   315    * `cortex_bucket_store_partitioner_requested_bytes_total`
   316    * `cortex_bucket_store_partitioner_requested_ranges_total`
   317    * `cortex_bucket_store_partitioner_expanded_bytes_total`
   318    * `cortex_bucket_store_partitioner_expanded_ranges_total`
   319  * [ENHANCEMENT] Store-gateway: added metrics to monitor chunk buffer pool behaviour. #3880
   320    * `cortex_bucket_store_chunk_pool_requested_bytes_total`
   321    * `cortex_bucket_store_chunk_pool_returned_bytes_total`
   322  * [ENHANCEMENT] Alertmanager: load alertmanager configurations from object storage concurrently, and only load necessary configurations, speeding configuration synchronization process and executing fewer "GET object" operations to the storage when sharding is enabled. #3898
   323  * [ENHANCEMENT] Ingester (blocks storage): Ingester can now stream entire chunks instead of individual samples to the querier. At the moment this feature must be explicitly enabled either by using `-ingester.stream-chunks-when-using-blocks` flag or `ingester_stream_chunks_when_using_blocks` (boolean) field in runtime config file, but these configuration options are temporary and will be removed when feature is stable. #3889
   324  * [ENHANCEMENT] Alertmanager: New endpoint `/multitenant_alertmanager/delete_tenant_config` to delete configuration for tenant identified by `X-Scope-OrgID` header. This is an internal endpoint, available even if Alertmanager API is not enabled by using `-experimental.alertmanager.enable-api`. #3900
   325  * [ENHANCEMENT] MemCached: Add `max_item_size` support. #3929
   326  * [BUGFIX] Cortex: Fixed issue where fatal errors and various log messages where not logged. #3778
   327  * [BUGFIX] HA Tracker: don't track as error in the `cortex_kv_request_duration_seconds` metric a CAS operation intentionally aborted. #3745
   328  * [BUGFIX] Querier / ruler: do not log "error removing stale clients" if the ring is empty. #3761
   329  * [BUGFIX] Store-gateway: fixed a panic caused by a race condition when the index-header lazy loading is enabled. #3775 #3789
   330  * [BUGFIX] Compactor: fixed "could not guess file size" log when uploading blocks deletion marks to the global location. #3807
   331  * [BUGFIX] Prevent panic at start if the http_prefix setting doesn't have a valid value. #3796
   332  * [BUGFIX] Memberlist: fixed panic caused by race condition in `armon/go-metrics` used by memberlist client. #3725
   333  * [BUGFIX] Querier: returning 422 (instead of 500) when query hits `max_chunks_per_query` limit with block storage. #3895
   334  * [BUGFIX] Alertmanager: Ensure that experimental `/api/v1/alerts` endpoints work when `-http.prefix` is empty. #3905
   335  * [BUGFIX] Chunk store: fix panic in inverted index when deleted fingerprint is no longer in the index. #3543
   336  
   337  ## 1.7.1 / 2021-04-27
   338  
   339  * [CHANGE] Fix for CVE-2021-31232: Local file disclosure vulnerability when `-experimental.alertmanager.enable-api` is used. The HTTP basic auth `password_file` can be used as an attack vector to send any file content via a webhook. The alertmanager templates can be used as an attack vector to send any file content because the alertmanager can load any text file specified in the templates list.
   340  
   341  ## 1.7.0 / 2021-02-23
   342  
   343  Note the blocks storage compactor runs a migration task at startup in this version, which can take many minutes and use a lot of RAM.
   344  [Turn this off after first run](https://cortexmetrics.io/docs/blocks-storage/production-tips/#ensure-deletion-marks-migration-is-disabled-after-first-run).
   345  
   346  * [CHANGE] FramedSnappy encoding support has been removed from Push and Remote Read APIs. This means Prometheus 1.6 support has been removed and the oldest Prometheus version supported in the remote write is 1.7. #3682
   347  * [CHANGE] Ruler: removed the flag `-ruler.evaluation-delay-duration-deprecated` which was deprecated in 1.4.0. Please use the `ruler_evaluation_delay_duration` per-tenant limit instead. #3694
   348  * [CHANGE] Removed the flags `-<prefix>.grpc-use-gzip-compression` which were deprecated in 1.3.0: #3694
   349    * `-query-scheduler.grpc-client-config.grpc-use-gzip-compression`: use `-query-scheduler.grpc-client-config.grpc-compression` instead
   350    * `-frontend.grpc-client-config.grpc-use-gzip-compression`: use `-frontend.grpc-client-config.grpc-compression` instead
   351    * `-ruler.client.grpc-use-gzip-compression`: use `-ruler.client.grpc-compression` instead
   352    * `-bigtable.grpc-use-gzip-compression`: use `-bigtable.grpc-compression` instead
   353    * `-ingester.client.grpc-use-gzip-compression`: use `-ingester.client.grpc-compression` instead
   354    * `-querier.frontend-client.grpc-use-gzip-compression`: use `-querier.frontend-client.grpc-compression` instead
   355  * [CHANGE] Querier: it's not required to set `-frontend.query-stats-enabled=true` in the querier anymore to enable query statistics logging in the query-frontend. The flag is now required to be configured only in the query-frontend and it will be propagated to the queriers. #3595 #3695
   356  * [CHANGE] Blocks storage: compactor is now required when running a Cortex cluster with the blocks storage, because it also keeps the bucket index updated. #3583
   357  * [CHANGE] Blocks storage: block deletion marks are now stored in a per-tenant global markers/ location too, other than within the block location. The compactor, at startup, will copy deletion marks from the block location to the global location. This migration is required only once, so it can be safely disabled via `-compactor.block-deletion-marks-migration-enabled=false` after new compactor has successfully started at least once in the cluster. #3583
   358  * [CHANGE] OpenStack Swift: the default value for the `-ruler.storage.swift.container-name` and `-swift.container-name` config options has changed from `cortex` to empty string. If you were relying on the default value, please set it back to `cortex`. #3660
   359  * [CHANGE] HA Tracker: configured replica label is now verified against label value length limit (`-validation.max-length-label-value`). #3668
   360  * [CHANGE] Distributor: `extend_writes` field in YAML configuration has moved from `lifecycler` (inside `ingester_config`) to `distributor_config`. This doesn't affect command line option `-distributor.extend-writes`, which stays the same. #3719
   361  * [CHANGE] Alertmanager: Deprecated `-cluster.` CLI flags in favor of their `-alertmanager.cluster.` equivalent. The deprecated flags (and their respective YAML config options) are: #3677
   362    * `-cluster.listen-address` in favor of `-alertmanager.cluster.listen-address`
   363    * `-cluster.advertise-address` in favor of `-alertmanager.cluster.advertise-address`
   364    * `-cluster.peer` in favor of `-alertmanager.cluster.peers`
   365    * `-cluster.peer-timeout` in favor of `-alertmanager.cluster.peer-timeout`
   366  * [CHANGE] Blocks storage: the default value of `-blocks-storage.bucket-store.sync-interval` has been changed from `5m` to `15m`. #3724
   367  * [FEATURE] Querier: Queries can be federated across multiple tenants. The tenants IDs involved need to be specified separated by a `|` character in the `X-Scope-OrgID` request header. This is an experimental feature, which can be enabled by setting `-tenant-federation.enabled=true` on all Cortex services. #3250
   368  * [FEATURE] Alertmanager: introduced the experimental option `-alertmanager.sharding-enabled` to shard tenants across multiple Alertmanager instances. This feature is still under heavy development and its usage is discouraged. The following new metrics are exported by the Alertmanager: #3664
   369    * `cortex_alertmanager_ring_check_errors_total`
   370    * `cortex_alertmanager_sync_configs_total`
   371    * `cortex_alertmanager_sync_configs_failed_total`
   372    * `cortex_alertmanager_tenants_discovered`
   373    * `cortex_alertmanager_tenants_owned`
   374  * [ENHANCEMENT] Allow specifying JAEGER_ENDPOINT instead of sampling server or local agent port. #3682
   375  * [ENHANCEMENT] Blocks storage: introduced a per-tenant bucket index, periodically updated by the compactor, used to avoid full bucket scanning done by queriers, store-gateways and rulers. The bucket index is updated by the compactor during blocks cleanup, on every `-compactor.cleanup-interval`. #3553 #3555 #3561 #3583 #3625 #3711 #3715
   376  * [ENHANCEMENT] Blocks storage: introduced an option `-blocks-storage.bucket-store.bucket-index.enabled` to enable the usage of the bucket index in the querier, store-gateway and ruler. When enabled, the querier, store-gateway and ruler will use the bucket index to find a tenant's blocks instead of running the periodic bucket scan. The following new metrics are exported by the querier and ruler: #3614 #3625
   377    * `cortex_bucket_index_loads_total`
   378    * `cortex_bucket_index_load_failures_total`
   379    * `cortex_bucket_index_load_duration_seconds`
   380    * `cortex_bucket_index_loaded`
   381  * [ENHANCEMENT] Compactor: exported the following metrics. #3583 #3625
   382    * `cortex_bucket_blocks_count`: Total number of blocks per tenant in the bucket. Includes blocks marked for deletion, but not partial blocks.
   383    * `cortex_bucket_blocks_marked_for_deletion_count`: Total number of blocks per tenant marked for deletion in the bucket.
   384    * `cortex_bucket_blocks_partials_count`: Total number of partial blocks.
   385    * `cortex_bucket_index_last_successful_update_timestamp_seconds`: Timestamp of the last successful update of a tenant's bucket index.
   386  * [ENHANCEMENT] Ruler: Add `cortex_prometheus_last_evaluation_samples` to expose the number of samples generated by a rule group per tenant. #3582
   387  * [ENHANCEMENT] Memberlist: add status page (/memberlist) with available details about memberlist-based KV store and memberlist cluster. It's also possible to view KV values in Go struct or JSON format, or download for inspection. #3575
   388  * [ENHANCEMENT] Memberlist: client can now keep a size-bounded buffer with sent and received messages and display them in the admin UI (/memberlist) for troubleshooting. #3581 #3602
   389  * [ENHANCEMENT] Blocks storage: added block index attributes caching support to metadata cache. The TTL can be configured via `-blocks-storage.bucket-store.metadata-cache.block-index-attributes-ttl`. #3629
   390  * [ENHANCEMENT] Alertmanager: Add support for Azure blob storage. #3634
   391  * [ENHANCEMENT] Compactor: tenants marked for deletion will now be fully cleaned up after some delay since deletion of last block. Cleanup includes removal of remaining marker files (including tenant deletion mark file) and files under `debug/metas`. #3613
   392  * [ENHANCEMENT] Compactor: retry compaction of a single tenant on failure instead of re-running compaction for all tenants. #3627
   393  * [ENHANCEMENT] Querier: Implement result caching for tenant query federation. #3640
   394  * [ENHANCEMENT] API: Add a `mode` query parameter for the config endpoint: #3645
   395    * `/config?mode=diff`: Shows the YAML configuration with all values that differ from the defaults.
   396    * `/config?mode=defaults`: Shows the YAML configuration with all the default values.
   397  * [ENHANCEMENT] OpenStack Swift: added the following config options to OpenStack Swift backend client: #3660
   398    - Chunks storage: `-swift.auth-version`, `-swift.max-retries`, `-swift.connect-timeout`, `-swift.request-timeout`.
   399    - Blocks storage: ` -blocks-storage.swift.auth-version`, ` -blocks-storage.swift.max-retries`, ` -blocks-storage.swift.connect-timeout`, ` -blocks-storage.swift.request-timeout`.
   400    - Ruler: `-ruler.storage.swift.auth-version`, `-ruler.storage.swift.max-retries`, `-ruler.storage.swift.connect-timeout`, `-ruler.storage.swift.request-timeout`.
   401  * [ENHANCEMENT] Disabled in-memory shuffle-sharding subring cache in the store-gateway, ruler and compactor. This should reduce the memory utilisation in these services when shuffle-sharding is enabled, without introducing a significantly increase CPU utilisation. #3601
   402  * [ENHANCEMENT] Shuffle sharding: optimised subring generation used by shuffle sharding. #3601
   403  * [ENHANCEMENT] New /runtime_config endpoint that returns the defined runtime configuration in YAML format. The returned configuration includes overrides. #3639
   404  * [ENHANCEMENT] Query-frontend: included the parameter name failed to validate in HTTP 400 message. #3703
   405  * [ENHANCEMENT] Fail to startup Cortex if provided runtime config is invalid. #3707
   406  * [ENHANCEMENT] Alertmanager: Add flags to customize the cluster configuration: #3667
   407    * `-alertmanager.cluster.gossip-interval`: The interval between sending gossip messages. By lowering this value (more frequent) gossip messages are propagated across cluster more quickly at the expense of increased bandwidth usage.
   408    * `-alertmanager.cluster.push-pull-interval`: The interval between gossip state syncs. Setting this interval lower (more frequent) will increase convergence speeds across larger clusters at the expense of increased bandwidth usage.
   409  * [ENHANCEMENT] Distributor: change the error message returned when a received series has too many label values. The new message format has the series at the end and this plays better with Prometheus logs truncation. #3718
   410    - From: `sample for '<series>' has <value> label names; limit <value>`
   411    - To: `series has too many labels (actual: <value>, limit: <value>) series: '<series>'`
   412  * [ENHANCEMENT] Improve bucket index loader to handle edge case where new tenant has not had blocks uploaded to storage yet. #3717
   413  * [BUGFIX] Allow `-querier.max-query-lookback` use `y|w|d` suffix like deprecated `-store.max-look-back-period`. #3598
   414  * [BUGFIX] Memberlist: Entry in the ring should now not appear again after using "Forget" feature (unless it's still heartbeating). #3603
   415  * [BUGFIX] Ingester: do not close idle TSDBs while blocks shipping is in progress. #3630 #3632
   416  * [BUGFIX] Ingester: correctly update `cortex_ingester_memory_users` and `cortex_ingester_active_series` when a tenant's idle TSDB is closed, when running Cortex with the blocks storage. #3646
   417  * [BUGFIX] Querier: fix default value incorrectly overriding `-querier.frontend-address` in single-binary mode. #3650
   418  * [BUGFIX] Compactor: delete `deletion-mark.json` at last when deleting a block in order to not leave partial blocks without deletion mark in the bucket if the compactor is interrupted while deleting a block. #3660
   419  * [BUGFIX] Blocks storage: do not cleanup a partially uploaded block when `meta.json` upload fails. Despite failure to upload `meta.json`, this file may in some cases still appear in the bucket later. By skipping early cleanup, we avoid having corrupted blocks in the storage. #3660
   420  * [BUGFIX] Alertmanager: disable access to `/alertmanager/metrics` (which exposes all Cortex metrics), `/alertmanager/-/reload` and `/alertmanager/debug/*`, which were available to any authenticated user with enabled AlertManager. #3678
   421  * [BUGFIX] Query-Frontend: avoid creating many small sub-queries by discarding cache extents under 5 minutes #3653
   422  * [BUGFIX] Ruler: Ensure the stale markers generated for evaluated rules respect the configured `-ruler.evaluation-delay-duration`. This will avoid issues with samples with NaN be persisted with timestamps set ahead of the next rule evaluation. #3687
   423  * [BUGFIX] Alertmanager: don't serve HTTP requests until Alertmanager has fully started. Serving HTTP requests earlier may result in loss of configuration for the user. #3679
   424  * [BUGFIX] Do not log "failed to load config" if runtime config file is empty. #3706
   425  * [BUGFIX] Do not allow to use a runtime config file containing multiple YAML documents. #3706
   426  * [BUGFIX] HA Tracker: don't track as error in the `cortex_kv_request_duration_seconds` metric a CAS operation intentionally aborted. #3745
   427  
   428  ## 1.6.0 / 2020-12-29
   429  
   430  * [CHANGE] Query Frontend: deprecate `-querier.compress-http-responses` in favour of `-api.response-compression-enabled`. #3544
   431  * [CHANGE] Querier: deprecated `-store.max-look-back-period`. You should use `-querier.max-query-lookback` instead. #3452
   432  * [CHANGE] Blocks storage: increased `-blocks-storage.bucket-store.chunks-cache.attributes-ttl` default from `24h` to `168h` (1 week). #3528
   433  * [CHANGE] Blocks storage: the config option `-blocks-storage.bucket-store.index-cache.postings-compression-enabled` has been deprecated and postings compression is always enabled. #3538
   434  * [CHANGE] Ruler: gRPC message size default limits on the Ruler-client side have changed: #3523
   435    - limit for outgoing gRPC messages has changed from 2147483647 to 16777216 bytes
   436    - limit for incoming gRPC messages has changed from 4194304 to 104857600 bytes
   437  * [FEATURE] Distributor/Ingester: Provide ability to not overflow writes in the presence of a leaving or unhealthy ingester. This allows for more efficient ingester rolling restarts. #3305
   438  * [FEATURE] Query-frontend: introduced query statistics logged in the query-frontend when enabled via `-frontend.query-stats-enabled=true`. When enabled, the metric `cortex_query_seconds_total` is tracked, counting the sum of the wall time spent across all queriers while running queries (on a per-tenant basis). The metrics `cortex_request_duration_seconds` and `cortex_query_seconds_total` are different: the first one tracks the request duration (eg. HTTP request from the client), while the latter tracks the sum of the wall time on all queriers involved executing the query. #3539
   439  * [ENHANCEMENT] API: Add GZIP HTTP compression to the API responses. Compression can be enabled via `-api.response-compression-enabled`. #3536
   440  * [ENHANCEMENT] Added zone-awareness support on queries. When zone-awareness is enabled, queries will still succeed if all ingesters in a single zone will fail. #3414
   441  * [ENHANCEMENT] Blocks storage ingester: exported more TSDB-related metrics. #3412
   442    - `cortex_ingester_tsdb_wal_corruptions_total`
   443    - `cortex_ingester_tsdb_head_truncations_failed_total`
   444    - `cortex_ingester_tsdb_head_truncations_total`
   445    - `cortex_ingester_tsdb_head_gc_duration_seconds`
   446  * [ENHANCEMENT] Enforced keepalive on all gRPC clients used for inter-service communication. #3431
   447  * [ENHANCEMENT] Added `cortex_alertmanager_config_hash` metric to expose hash of Alertmanager Config loaded per user. #3388
   448  * [ENHANCEMENT] Query-Frontend / Query-Scheduler: New component called "Query-Scheduler" has been introduced. Query-Scheduler is simply a queue of requests, moved outside of Query-Frontend. This allows Query-Frontend to be scaled separately from number of queues. To make Query-Frontend and Querier use Query-Scheduler, they need to be started with `-frontend.scheduler-address` and `-querier.scheduler-address` options respectively. #3374 #3471
   449  * [ENHANCEMENT] Query-frontend / Querier / Ruler: added `-querier.max-query-lookback` to limit how long back data (series and metadata) can be queried. This setting can be overridden on a per-tenant basis and is enforced in the query-frontend, querier and ruler. #3452 #3458
   450  * [ENHANCEMENT] Querier: added `-querier.query-store-for-labels-enabled` to query store for label names, label values and series APIs. Only works with blocks storage engine. #3461 #3520
   451  * [ENHANCEMENT] Ingester: exposed `-blocks-storage.tsdb.wal-segment-size-bytes` config option to customise the TSDB WAL segment max size. #3476
   452  * [ENHANCEMENT] Compactor: concurrently run blocks cleaner for multiple tenants. Concurrency can be configured via `-compactor.cleanup-concurrency`. #3483
   453  * [ENHANCEMENT] Compactor: shuffle tenants before running compaction. #3483
   454  * [ENHANCEMENT] Compactor: wait for a stable ring at startup, when sharding is enabled. #3484
   455  * [ENHANCEMENT] Store-gateway: added `-blocks-storage.bucket-store.index-header-lazy-loading-enabled` to enable index-header lazy loading (experimental). When enabled, index-headers will be mmap-ed only once required by a query and will be automatically released after `-blocks-storage.bucket-store.index-header-lazy-loading-idle-timeout` time of inactivity. #3498
   456  * [ENHANCEMENT] Alertmanager: added metrics `cortex_alertmanager_notification_requests_total` and `cortex_alertmanager_notification_requests_failed_total`. #3518
   457  * [ENHANCEMENT] Ingester: added `-blocks-storage.tsdb.head-chunks-write-buffer-size-bytes` to fine-tune the TSDB head chunks write buffer size when running Cortex blocks storage. #3518
   458  * [ENHANCEMENT] /metrics now supports OpenMetrics output. HTTP and gRPC servers metrics can now include exemplars. #3524
   459  * [ENHANCEMENT] Expose gRPC keepalive policy options by gRPC server. #3524
   460  * [ENHANCEMENT] Blocks storage: enabled caching of `meta.json` attributes, configurable via `-blocks-storage.bucket-store.metadata-cache.metafile-attributes-ttl`. #3528
   461  * [ENHANCEMENT] Compactor: added a config validation check to fail fast if the compactor has been configured invalid block range periods (each period is expected to be a multiple of the previous one). #3534
   462  * [ENHANCEMENT] Blocks storage: concurrently fetch deletion marks from object storage. #3538
   463  * [ENHANCEMENT] Blocks storage ingester: ingester can now close idle TSDB and delete local data. #3491 #3552
   464  * [ENHANCEMENT] Blocks storage: add option to use V2 signatures for S3 authentication. #3540
   465  * [ENHANCEMENT] Exported process metrics to monitor the number of memory map areas allocated. #3537
   466    * - `process_memory_map_areas`
   467    * - `process_memory_map_areas_limit`
   468  * [ENHANCEMENT] Ruler: Expose gRPC client options. #3523
   469  * [ENHANCEMENT] Compactor: added metrics to track on-going compaction. #3535
   470    * `cortex_compactor_tenants_discovered`
   471    * `cortex_compactor_tenants_skipped`
   472    * `cortex_compactor_tenants_processing_succeeded`
   473    * `cortex_compactor_tenants_processing_failed`
   474  * [ENHANCEMENT] Added new experimental API endpoints: `POST /purger/delete_tenant` and `GET /purger/delete_tenant_status` for deleting all tenant data. Only works with blocks storage. Compactor removes blocks that belong to user marked for deletion. #3549 #3558
   475  * [ENHANCEMENT] Chunks storage: add option to use V2 signatures for S3 authentication. #3560
   476  * [ENHANCEMENT] HA Tracker: Added new limit `ha_max_clusters` to set the max number of clusters tracked for single user. This limit is disabled by default. #3668
   477  * [BUGFIX] Query-Frontend: `cortex_query_seconds_total` now return seconds not nanoseconds. #3589
   478  * [BUGFIX] Blocks storage ingester: fixed some cases leading to a TSDB WAL corruption after a partial write to disk. #3423
   479  * [BUGFIX] Blocks storage: Fix the race between ingestion and `/flush` call resulting in overlapping blocks. #3422
   480  * [BUGFIX] Querier: fixed `-querier.max-query-into-future` which wasn't correctly enforced on range queries. #3452
   481  * [BUGFIX] Fixed float64 precision stability when aggregating metrics before exposing them. This could have lead to false counters resets when querying some metrics exposed by Cortex. #3506
   482  * [BUGFIX] Querier: the meta.json sync concurrency done when running Cortex with the blocks storage is now controlled by `-blocks-storage.bucket-store.meta-sync-concurrency` instead of the incorrect `-blocks-storage.bucket-store.block-sync-concurrency` (default values are the same). #3531
   483  * [BUGFIX] Querier: fixed initialization order of querier module when using blocks storage. It now (again) waits until blocks have been synchronized. #3551
   484  
   485  ## Blocksconvert
   486  
   487  * [ENHANCEMENT] Scheduler: ability to ignore users based on regexp, using `-scheduler.ignore-users-regex` flag. #3477
   488  * [ENHANCEMENT] Builder: Parallelize reading chunks in the final stage of building block. #3470
   489  * [ENHANCEMENT] Builder: remove duplicate label names from chunk. #3547
   490  
   491  ## 1.5.0 / 2020-11-09
   492  
   493  ### Cortex
   494  
   495  * [CHANGE] Blocks storage: update the default HTTP configuration values for the S3 client to the upstream Thanos default values. #3244
   496    - `-blocks-storage.s3.http.idle-conn-timeout` is set 90 seconds.
   497    - `-blocks-storage.s3.http.response-header-timeout` is set to 2 minutes.
   498  * [CHANGE] Improved shuffle sharding support in the write path. This work introduced some config changes: #3090
   499    * Introduced `-distributor.sharding-strategy` CLI flag (and its respective `sharding_strategy` YAML config option) to explicitly specify which sharding strategy should be used in the write path
   500    * `-experimental.distributor.user-subring-size` flag renamed to `-distributor.ingestion-tenant-shard-size`
   501    * `user_subring_size` limit YAML config option renamed to `ingestion_tenant_shard_size`
   502  * [CHANGE] Dropped "blank Alertmanager configuration; using fallback" message from Info to Debug level. #3205
   503  * [CHANGE] Zone-awareness replication for time-series now should be explicitly enabled in the distributor via the `-distributor.zone-awareness-enabled` CLI flag (or its respective YAML config option). Before, zone-aware replication was implicitly enabled if a zone was set on ingesters. #3200
   504  * [CHANGE] Removed the deprecated CLI flag `-config-yaml`. You should use `-schema-config-file` instead. #3225
   505  * [CHANGE] Enforced the HTTP method required by some API endpoints which did (incorrectly) allow any method before that. #3228
   506    - `GET /`
   507    - `GET /config`
   508    - `GET /debug/fgprof`
   509    - `GET /distributor/all_user_stats`
   510    - `GET /distributor/ha_tracker`
   511    - `GET /all_user_stats`
   512    - `GET /ha-tracker`
   513    - `GET /api/v1/user_stats`
   514    - `GET /api/v1/chunks`
   515    - `GET <legacy-http-prefix>/user_stats`
   516    - `GET <legacy-http-prefix>/chunks`
   517    - `GET /services`
   518    - `GET /multitenant_alertmanager/status`
   519    - `GET /status` (alertmanager microservice)
   520    - `GET|POST /ingester/ring`
   521    - `GET|POST /ring`
   522    - `GET|POST /store-gateway/ring`
   523    - `GET|POST /compactor/ring`
   524    - `GET|POST /ingester/flush`
   525    - `GET|POST /ingester/shutdown`
   526    - `GET|POST /flush`
   527    - `GET|POST /shutdown`
   528    - `GET|POST /ruler/ring`
   529    - `POST /api/v1/push`
   530    - `POST <legacy-http-prefix>/push`
   531    - `POST /push`
   532    - `POST /ingester/push`
   533  * [CHANGE] Renamed CLI flags to configure the network interface names from which automatically detect the instance IP. #3295
   534    - `-compactor.ring.instance-interface` renamed to `-compactor.ring.instance-interface-names`
   535    - `-store-gateway.sharding-ring.instance-interface` renamed to `-store-gateway.sharding-ring.instance-interface-names`
   536    - `-distributor.ring.instance-interface` renamed to `-distributor.ring.instance-interface-names`
   537    - `-ruler.ring.instance-interface` renamed to `-ruler.ring.instance-interface-names`
   538  * [CHANGE] Renamed `-<prefix>.redis.enable-tls` CLI flag to `-<prefix>.redis.tls-enabled`, and its respective YAML config option from `enable_tls` to `tls_enabled`. #3298
   539  * [CHANGE] Increased default `-<prefix>.redis.timeout` from `100ms` to `500ms`. #3301
   540  * [CHANGE] `cortex_alertmanager_config_invalid` has been removed in favor of `cortex_alertmanager_config_last_reload_successful`. #3289
   541  * [CHANGE] Query-frontend: POST requests whose body size exceeds 10MiB will be rejected. The max body size can be customised via `-frontend.max-body-size`. #3276
   542  * [FEATURE] Shuffle sharding: added support for shuffle-sharding queriers in the query-frontend. When configured (`-frontend.max-queriers-per-tenant` globally, or using per-tenant limit `max_queriers_per_tenant`), each tenants's requests will be handled by different set of queriers. #3113 #3257
   543  * [FEATURE] Shuffle sharding: added support for shuffle-sharding ingesters on the read path. When ingesters shuffle-sharding is enabled and `-querier.shuffle-sharding-ingesters-lookback-period` is set, queriers will fetch in-memory series from the minimum set of required ingesters, selecting only ingesters which may have received series since 'now - lookback period'. #3252
   544  * [FEATURE] Query-frontend: added `compression` config to support results cache with compression. #3217
   545  * [FEATURE] Add OpenStack Swift support to blocks storage. #3303
   546  * [FEATURE] Added support for applying Prometheus relabel configs on series received by the distributor. A `metric_relabel_configs` field has been added to the per-tenant limits configuration. #3329
   547  * [FEATURE] Support for Cassandra client SSL certificates. #3384
   548  * [ENHANCEMENT] Ruler: Introduces two new limits `-ruler.max-rules-per-rule-group` and `-ruler.max-rule-groups-per-tenant` to control the number of rules per rule group and the total number of rule groups for a given user. They are disabled by default. #3366
   549  * [ENHANCEMENT] Allow to specify multiple comma-separated Cortex services to `-target` CLI option (or its respective YAML config option). For example, `-target=all,compactor` can be used to start Cortex single-binary with compactor as well. #3275
   550  * [ENHANCEMENT] Expose additional HTTP configs for the S3 backend client. New flag are listed below: #3244
   551    - `-blocks-storage.s3.http.idle-conn-timeout`
   552    - `-blocks-storage.s3.http.response-header-timeout`
   553    - `-blocks-storage.s3.http.insecure-skip-verify`
   554  * [ENHANCEMENT] Added `cortex_query_frontend_connected_clients` metric to show the number of workers currently connected to the frontend. #3207
   555  * [ENHANCEMENT] Shuffle sharding: improved shuffle sharding in the write path. Shuffle sharding now should be explicitly enabled via `-distributor.sharding-strategy` CLI flag (or its respective YAML config option) and guarantees stability, consistency, shuffling and balanced zone-awareness properties. #3090 #3214
   556  * [ENHANCEMENT] Ingester: added new metric `cortex_ingester_active_series` to track active series more accurately. Also added options to control whether active series tracking is enabled (`-ingester.active-series-metrics-enabled`, defaults to false), and how often this metric is updated (`-ingester.active-series-metrics-update-period`) and max idle time for series to be considered inactive (`-ingester.active-series-metrics-idle-timeout`). #3153
   557  * [ENHANCEMENT] Store-gateway: added zone-aware replication support to blocks replication in the store-gateway. #3200
   558  * [ENHANCEMENT] Store-gateway: exported new metrics. #3231
   559    - `cortex_bucket_store_cached_series_fetch_duration_seconds`
   560    - `cortex_bucket_store_cached_postings_fetch_duration_seconds`
   561    - `cortex_bucket_stores_gate_queries_max`
   562  * [ENHANCEMENT] Added `-version` flag to Cortex. #3233
   563  * [ENHANCEMENT] Hash ring: added instance registered timestamp to the ring. #3248
   564  * [ENHANCEMENT] Reduce tail latency by smoothing out spikes in rate of chunk flush operations. #3191
   565  * [ENHANCEMENT] User Cortex as User Agent in http requests issued by Configs DB client. #3264
   566  * [ENHANCEMENT] Experimental Ruler API: Fetch rule groups from object storage in parallel. #3218
   567  * [ENHANCEMENT] Chunks GCS object storage client uses the `fields` selector to limit the payload size when listing objects in the bucket. #3218 #3292
   568  * [ENHANCEMENT] Added shuffle sharding support to ruler. Added new metric `cortex_ruler_sync_rules_total`. #3235
   569  * [ENHANCEMENT] Return an explicit error when the store-gateway is explicitly requested without a blocks storage engine. #3287
   570  * [ENHANCEMENT] Ruler: only load rules that belong to the ruler. Improves rules synching performances when ruler sharding is enabled. #3269
   571  * [ENHANCEMENT] Added `-<prefix>.redis.tls-insecure-skip-verify` flag. #3298
   572  * [ENHANCEMENT] Added `cortex_alertmanager_config_last_reload_successful_seconds` metric to show timestamp of last successful AM config reload. #3289
   573  * [ENHANCEMENT] Blocks storage: reduced number of bucket listing operations to list block content (applies to newly created blocks only). #3363
   574  * [ENHANCEMENT] Ruler: Include the tenant ID on the notifier logs. #3372
   575  * [ENHANCEMENT] Blocks storage Compactor: Added `-compactor.enabled-tenants` and `-compactor.disabled-tenants` to explicitly enable or disable compaction of specific tenants. #3385
   576  * [ENHANCEMENT] Blocks storage ingester: Creating checkpoint only once even when there are multiple Head compactions in a single `Compact()` call. #3373
   577  * [BUGFIX] Blocks storage ingester: Read repair memory-mapped chunks file which can end up being empty on abrupt shutdowns combined with faulty disks. #3373
   578  * [BUGFIX] Blocks storage ingester: Close TSDB resources on failed startup preventing ingester OOMing. #3373
   579  * [BUGFIX] No-longer-needed ingester operations for queries triggered by queriers and rulers are now canceled. #3178
   580  * [BUGFIX] Ruler: directories in the configured `rules-path` will be removed on startup and shutdown in order to ensure they don't persist between runs. #3195
   581  * [BUGFIX] Handle hash-collisions in the query path. #3192
   582  * [BUGFIX] Check for postgres rows errors. #3197
   583  * [BUGFIX] Ruler Experimental API: Don't allow rule groups without names or empty rule groups. #3210
   584  * [BUGFIX] Experimental Alertmanager API: Do not allow empty Alertmanager configurations or bad template filenames to be submitted through the configuration API. #3185
   585  * [BUGFIX] Reduce failures to update heartbeat when using Consul. #3259
   586  * [BUGFIX] When using ruler sharding, moving all user rule groups from ruler to a different one and then back could end up with some user groups not being evaluated at all. #3235
   587  * [BUGFIX] Fixed shuffle sharding consistency when zone-awareness is enabled and the shard size is increased or instances in a new zone are added. #3299
   588  * [BUGFIX] Use a valid grpc header when logging IP addresses. #3307
   589  * [BUGFIX] Fixed the metric `cortex_prometheus_rule_group_duration_seconds` in the Ruler, it wouldn't report any values. #3310
   590  * [BUGFIX] Fixed gRPC connections leaking in rulers when rulers sharding is enabled and APIs called. #3314
   591  * [BUGFIX] Fixed shuffle sharding consistency when zone-awareness is enabled and the shard size is increased or instances in a new zone are added. #3299
   592  * [BUGFIX] Fixed Gossip memberlist members joining when addresses are configured using DNS-based service discovery. #3360
   593  * [BUGFIX] Ingester: fail to start an ingester running the blocks storage, if unable to load any existing TSDB at startup. #3354
   594  * [BUGFIX] Blocks storage: Avoid deletion of blocks in the ingester which are not shipped to the storage yet. #3346
   595  * [BUGFIX] Fix common prefixes returned by List method of S3 client. #3358
   596  * [BUGFIX] Honor configured timeout in Azure and GCS object clients. #3285
   597  * [BUGFIX] Blocks storage: Avoid creating blocks larger than configured block range period on forced compaction and when TSDB is idle. #3344
   598  * [BUGFIX] Shuffle sharding: fixed max global series per user/metric limit when shuffle sharding and `-distributor.shard-by-all-labels=true` are both enabled in distributor. When using these global limits you should now set `-distributor.sharding-strategy` and `-distributor.zone-awareness-enabled` to ingesters too. #3369
   599  * [BUGFIX] Slow query logging: when using downstream server request parameters were not logged. #3276
   600  * [BUGFIX] Fixed tenant detection in the ruler and alertmanager API when running without auth. #3343
   601  
   602  ### Blocksconvert
   603  
   604  * [ENHANCEMENT] Blocksconvert – Builder: download plan file locally before processing it. #3209
   605  * [ENHANCEMENT] Blocksconvert – Cleaner: added new tool for deleting chunks data. #3283
   606  * [ENHANCEMENT] Blocksconvert – Scanner: support for scanning specific date-range only. #3222
   607  * [ENHANCEMENT] Blocksconvert – Scanner: metrics for tracking progress. #3222
   608  * [ENHANCEMENT] Blocksconvert – Builder: retry block upload before giving up. #3245
   609  * [ENHANCEMENT] Blocksconvert – Scanner: upload plans concurrently. #3340
   610  * [BUGFIX] Blocksconvert: fix chunks ordering in the block. Chunks in different order than series work just fine in TSDB blocks at the moment, but it's not consistent with what Prometheus does and future Prometheus and Cortex optimizations may rely on this ordering. #3371
   611  
   612  ## 1.4.0 / 2020-10-02
   613  
   614  * [CHANGE] TLS configuration for gRPC, HTTP and etcd clients is now marked as experimental. These features are not yet fully baked, and we expect possible small breaking changes in Cortex 1.5. #3198
   615  * [CHANGE] Cassandra backend support is now GA (stable). #3180
   616  * [CHANGE] Blocks storage is now GA (stable). The `-experimental` prefix has been removed from all CLI flags related to the blocks storage (no YAML config changes). #3180 #3201
   617    - `-experimental.blocks-storage.*` flags renamed to `-blocks-storage.*`
   618    - `-experimental.store-gateway.*` flags renamed to `-store-gateway.*`
   619    - `-experimental.querier.store-gateway-client.*` flags renamed to `-querier.store-gateway-client.*`
   620    - `-experimental.querier.store-gateway-addresses` flag renamed to `-querier.store-gateway-addresses`
   621    - `-store-gateway.replication-factor` flag renamed to `-store-gateway.sharding-ring.replication-factor`
   622    - `-store-gateway.tokens-file-path` flag renamed to `store-gateway.sharding-ring.tokens-file-path`
   623  * [CHANGE] Ingester: Removed deprecated untyped record from chunks WAL. Only if you are running `v1.0` or below, it is recommended to first upgrade to `v1.1`/`v1.2`/`v1.3` and run it for a day before upgrading to `v1.4` to avoid data loss. #3115
   624  * [CHANGE] Distributor API endpoints are no longer served unless target is set to `distributor` or `all`. #3112
   625  * [CHANGE] Increase the default Cassandra client replication factor to 3. #3007
   626  * [CHANGE] Blocks storage: removed the support to transfer blocks between ingesters on shutdown. When running the Cortex blocks storage, ingesters are expected to run with a persistent disk. The following metrics have been removed: #2996
   627    * `cortex_ingester_sent_files`
   628    * `cortex_ingester_received_files`
   629    * `cortex_ingester_received_bytes_total`
   630    * `cortex_ingester_sent_bytes_total`
   631  * [CHANGE] The buckets for the `cortex_chunk_store_index_lookups_per_query` metric have been changed to 1, 2, 4, 8, 16. #3021
   632  * [CHANGE] Blocks storage: the `operation` label value `getrange` has changed into `get_range` for the metrics `thanos_store_bucket_cache_operation_requests_total` and `thanos_store_bucket_cache_operation_hits_total`. #3000
   633  * [CHANGE] Experimental Delete Series: `/api/v1/admin/tsdb/delete_series` and `/api/v1/admin/tsdb/cancel_delete_request` purger APIs to return status code `204` instead of `200` for success. #2946
   634  * [CHANGE] Histogram `cortex_memcache_request_duration_seconds` `method` label value changes from `Memcached.Get` to `Memcached.GetBatched` for batched lookups, and is not reported for non-batched lookups (label value `Memcached.GetMulti` remains, and had exactly the same value as `Get` in nonbatched lookups).  The same change applies to tracing spans. #3046
   635  * [CHANGE] TLS server validation is now enabled by default, a new parameter `tls_insecure_skip_verify` can be set to true to skip validation optionally. #3030
   636  * [CHANGE] `cortex_ruler_config_update_failures_total` has been removed in favor of `cortex_ruler_config_last_reload_successful`. #3056
   637  * [CHANGE] `ruler.evaluation_delay_duration` field in YAML config has been moved and renamed to `limits.ruler_evaluation_delay_duration`. #3098
   638  * [CHANGE] Removed obsolete `results_cache.max_freshness` from YAML config (deprecated since Cortex 1.2). #3145
   639  * [CHANGE] Removed obsolete `-promql.lookback-delta` option (deprecated since Cortex 1.2, replaced with `-querier.lookback-delta`). #3144
   640  * [CHANGE] Cache: added support for Redis Cluster and Redis Sentinel. #2961
   641    - The following changes have been made in Redis configuration:
   642     - `-redis.master_name` added
   643     - `-redis.db` added
   644     - `-redis.max-active-conns` changed to `-redis.pool-size`
   645     - `-redis.max-conn-lifetime` changed to `-redis.max-connection-age`
   646     - `-redis.max-idle-conns` removed
   647     - `-redis.wait-on-pool-exhaustion` removed
   648  * [CHANGE] TLS configuration for gRPC, HTTP and etcd clients is now marked as experimental. These features are not yet fully baked, and we expect possible small breaking changes in Cortex 1.5. #3198
   649  * [CHANGE] Fixed store-gateway CLI flags inconsistencies. #3201
   650    - `-store-gateway.replication-factor` flag renamed to `-store-gateway.sharding-ring.replication-factor`
   651    - `-store-gateway.tokens-file-path` flag renamed to `store-gateway.sharding-ring.tokens-file-path`
   652  * [FEATURE] Logging of the source IP passed along by a reverse proxy is now supported by setting the `-server.log-source-ips-enabled`. For non standard headers the settings `-server.log-source-ips-header` and `-server.log-source-ips-regex` can be used. #2985
   653  * [FEATURE] Blocks storage: added shuffle sharding support to store-gateway blocks sharding. Added the following additional metrics to store-gateway: #3069
   654    * `cortex_bucket_stores_tenants_discovered`
   655    * `cortex_bucket_stores_tenants_synced`
   656  * [FEATURE] Experimental blocksconvert: introduce an experimental tool `blocksconvert` to migrate long-term storage chunks to blocks. #3092 #3122 #3127 #3162
   657  * [ENHANCEMENT] Improve the Alertmanager logging when serving requests from its API / UI. #3397
   658  * [ENHANCEMENT] Add support for azure storage in China, German and US Government environments. #2988
   659  * [ENHANCEMENT] Query-tee: added a small tolerance to floating point sample values comparison. #2994
   660  * [ENHANCEMENT] Query-tee: add support for doing a passthrough of requests to preferred backend for unregistered routes #3018
   661  * [ENHANCEMENT] Expose `storage.aws.dynamodb.backoff_config` configuration file field. #3026
   662  * [ENHANCEMENT] Added `cortex_request_message_bytes` and `cortex_response_message_bytes` histograms to track received and sent gRPC message and HTTP request/response sizes. Added `cortex_inflight_requests` gauge to track number of inflight gRPC and HTTP requests. #3064
   663  * [ENHANCEMENT] Publish ruler's ring metrics. #3074
   664  * [ENHANCEMENT] Add config validation to the experimental Alertmanager API. Invalid configs are no longer accepted. #3053
   665  * [ENHANCEMENT] Add "integration" as a label for `cortex_alertmanager_notifications_total` and `cortex_alertmanager_notifications_failed_total` metrics. #3056
   666  * [ENHANCEMENT] Add `cortex_ruler_config_last_reload_successful` and `cortex_ruler_config_last_reload_successful_seconds` to check status of users rule manager. #3056
   667  * [ENHANCEMENT] The configuration validation now fails if an empty YAML node has been set for a root YAML config property. #3080
   668  * [ENHANCEMENT] Memcached dial() calls now have a circuit-breaker to avoid hammering a broken cache. #3051, #3189
   669  * [ENHANCEMENT] `-ruler.evaluation-delay-duration` is now overridable as a per-tenant limit, `ruler_evaluation_delay_duration`. #3098
   670  * [ENHANCEMENT] Add TLS support to etcd client. #3102
   671  * [ENHANCEMENT] When a tenant accesses the Alertmanager UI or its API, if we have valid `-alertmanager.configs.fallback` we'll use that to start the manager and avoid failing the request. #3073
   672  * [ENHANCEMENT] Add `DELETE api/v1/rules/{namespace}` to the Ruler. It allows all the rule groups of a namespace to be deleted. #3120
   673  * [ENHANCEMENT] Experimental Delete Series: Retry processing of Delete requests during failures. #2926
   674  * [ENHANCEMENT] Improve performance of QueryStream() in ingesters. #3177
   675  * [ENHANCEMENT] Modules included in "All" target are now visible in output of `-modules` CLI flag. #3155
   676  * [ENHANCEMENT] Added `/debug/fgprof` endpoint to debug running Cortex process using `fgprof`. This adds up to the existing `/debug/...` endpoints. #3131
   677  * [ENHANCEMENT] Blocks storage: optimised `/api/v1/series` for blocks storage. (#2976)
   678  * [BUGFIX] Ruler: when loading rules from "local" storage, check for directory after resolving symlink. #3137
   679  * [BUGFIX] Query-frontend: Fixed rounding for incoming query timestamps, to be 100% Prometheus compatible. #2990
   680  * [BUGFIX] Querier: Merge results from chunks and blocks ingesters when using streaming of results. #3013
   681  * [BUGFIX] Querier: query /series from ingesters regardless the `-querier.query-ingesters-within` setting. #3035
   682  * [BUGFIX] Blocks storage: Ingester is less likely to hit gRPC message size limit when streaming data to queriers. #3015
   683  * [BUGFIX] Blocks storage: fixed memberlist support for the store-gateways and compactors ring used when blocks sharding is enabled. #3058 #3095
   684  * [BUGFIX] Fix configuration for TLS server validation, TLS skip verify was hardcoded to true for all TLS configurations and prevented validation of server certificates. #3030
   685  * [BUGFIX] Fixes the Alertmanager panicking when no `-alertmanager.web.external-url` is provided. #3017
   686  * [BUGFIX] Fixes the registration of the Alertmanager API metrics `cortex_alertmanager_alerts_received_total` and `cortex_alertmanager_alerts_invalid_total`. #3065
   687  * [BUGFIX] Fixes `flag needs an argument: -config.expand-env` error. #3087
   688  * [BUGFIX] An index optimisation actually slows things down when using caching. Moved it to the right location. #2973
   689  * [BUGFIX] Ingester: If push request contained both valid and invalid samples, valid samples were ingested but not stored to WAL of the chunks storage. This has been fixed. #3067
   690  * [BUGFIX] Cassandra: fixed consistency setting in the CQL session when creating the keyspace. #3105
   691  * [BUGFIX] Ruler: Config API would return both the `record` and `alert` in `YAML` response keys even when one of them must be empty. #3120
   692  * [BUGFIX] Index page now uses configured HTTP path prefix when creating links. #3126
   693  * [BUGFIX] Purger: fixed deadlock when reloading of tombstones failed. #3182
   694  * [BUGFIX] Fixed panic in flusher job, when error writing chunks to the store would cause "idle" chunks to be flushed, which triggered panic. #3140
   695  * [BUGFIX] Index page no longer shows links that are not valid for running Cortex instance. #3133
   696  * [BUGFIX] Configs: prevent validation of templates to fail when using template functions. #3157
   697  * [BUGFIX] Configuring the S3 URL with an `@` but without username and password doesn't enable the AWS static credentials anymore. #3170
   698  * [BUGFIX] Limit errors on ranged queries (`api/v1/query_range`) no longer return a status code `500` but `422` instead. #3167
   699  * [BUGFIX] Handle hash-collisions in the query path. Before this fix, Cortex could occasionally mix up two different series in a query, leading to invalid results, when `-querier.ingester-streaming` was used. #3192
   700  
   701  ## 1.3.0 / 2020-08-21
   702  
   703  * [CHANGE] Replace the metric `cortex_alertmanager_configs` with `cortex_alertmanager_config_invalid` exposed by Alertmanager. #2960
   704  * [CHANGE] Experimental Delete Series: Change target flag for purger from `data-purger` to `purger`. #2777
   705  * [CHANGE] Experimental blocks storage: The max concurrent queries against the long-term storage, configured via `-experimental.blocks-storage.bucket-store.max-concurrent`, is now a limit shared across all tenants and not a per-tenant limit anymore. The default value has changed from `20` to `100` and the following new metrics have been added: #2797
   706    * `cortex_bucket_stores_gate_queries_concurrent_max`
   707    * `cortex_bucket_stores_gate_queries_in_flight`
   708    * `cortex_bucket_stores_gate_duration_seconds`
   709  * [CHANGE] Metric `cortex_ingester_flush_reasons` has been renamed to `cortex_ingester_flushing_enqueued_series_total`, and new metric `cortex_ingester_flushing_dequeued_series_total` with `outcome` label (superset of reason) has been added. #2802 #2818 #2998
   710  * [CHANGE] Experimental Delete Series: Metric `cortex_purger_oldest_pending_delete_request_age_seconds` would track age of delete requests since they are over their cancellation period instead of their creation time. #2806
   711  * [CHANGE] Experimental blocks storage: the store-gateway service is required in a Cortex cluster running with the experimental blocks storage. Removed the `-experimental.tsdb.store-gateway-enabled` CLI flag and `store_gateway_enabled` YAML config option. The store-gateway is now always enabled when the storage engine is `blocks`. #2822
   712  * [CHANGE] Experimental blocks storage: removed support for `-experimental.blocks-storage.bucket-store.max-sample-count` flag because the implementation was flawed. To limit the number of samples/chunks processed by a single query you can set `-store.query-chunk-limit`, which is now supported by the blocks storage too. #2852
   713  * [CHANGE] Ingester: Chunks flushed via /flush stay in memory until retention period is reached. This affects `cortex_ingester_memory_chunks` metric. #2778
   714  * [CHANGE] Querier: the error message returned when the query time range exceeds `-store.max-query-length` has changed from `invalid query, length > limit (X > Y)` to `the query time range exceeds the limit (query length: X, limit: Y)`. #2826
   715  * [CHANGE] Add `component` label to metrics exposed by chunk, delete and index store clients. #2774
   716  * [CHANGE] Querier: when `-querier.query-ingesters-within` is configured, the time range of the query sent to ingesters is now manipulated to ensure the query start time is not older than 'now - query-ingesters-within'. #2904
   717  * [CHANGE] KV: The `role` label which was a label of `multi` KV store client only has been added to metrics of every KV store client. If KV store client is not `multi`, then the value of `role` label is `primary`. #2837
   718  * [CHANGE] Added the `engine` label to the metrics exposed by the Prometheus query engine, to distinguish between `ruler` and `querier` metrics. #2854
   719  * [CHANGE] Added ruler to the single binary when started with `-target=all` (default). #2854
   720  * [CHANGE] Experimental blocks storage: compact head when opening TSDB. This should only affect ingester startup after it was unable to compact head in previous run. #2870
   721  * [CHANGE] Metric `cortex_overrides_last_reload_successful` has been renamed to `cortex_runtime_config_last_reload_successful`. #2874
   722  * [CHANGE] HipChat support has been removed from the alertmanager (because removed from the Prometheus upstream too). #2902
   723  * [CHANGE] Add constant label `name` to metric `cortex_cache_request_duration_seconds`. #2903
   724  * [CHANGE] Add `user` label to metric `cortex_query_frontend_queue_length`. #2939
   725  * [CHANGE] Experimental blocks storage: cleaned up the config and renamed "TSDB" to "blocks storage". #2937
   726    - The storage engine setting value has been changed from `tsdb` to `blocks`; this affects `-store.engine` CLI flag and its respective YAML option.
   727    - The root level YAML config has changed from `tsdb` to `blocks_storage`
   728    - The prefix of all CLI flags has changed from `-experimental.tsdb.` to `-experimental.blocks-storage.`
   729    - The following settings have been grouped under `tsdb` property in the YAML config and their CLI flags changed:
   730      - `-experimental.tsdb.dir` changed to `-experimental.blocks-storage.tsdb.dir`
   731      - `-experimental.tsdb.block-ranges-period` changed to `-experimental.blocks-storage.tsdb.block-ranges-period`
   732      - `-experimental.tsdb.retention-period` changed to `-experimental.blocks-storage.tsdb.retention-period`
   733      - `-experimental.tsdb.ship-interval` changed to `-experimental.blocks-storage.tsdb.ship-interval`
   734      - `-experimental.tsdb.ship-concurrency` changed to `-experimental.blocks-storage.tsdb.ship-concurrency`
   735      - `-experimental.tsdb.max-tsdb-opening-concurrency-on-startup` changed to `-experimental.blocks-storage.tsdb.max-tsdb-opening-concurrency-on-startup`
   736      - `-experimental.tsdb.head-compaction-interval` changed to `-experimental.blocks-storage.tsdb.head-compaction-interval`
   737      - `-experimental.tsdb.head-compaction-concurrency` changed to `-experimental.blocks-storage.tsdb.head-compaction-concurrency`
   738      - `-experimental.tsdb.head-compaction-idle-timeout` changed to `-experimental.blocks-storage.tsdb.head-compaction-idle-timeout`
   739      - `-experimental.tsdb.stripe-size` changed to `-experimental.blocks-storage.tsdb.stripe-size`
   740      - `-experimental.tsdb.wal-compression-enabled` changed to `-experimental.blocks-storage.tsdb.wal-compression-enabled`
   741      - `-experimental.tsdb.flush-blocks-on-shutdown` changed to `-experimental.blocks-storage.tsdb.flush-blocks-on-shutdown`
   742  * [CHANGE] Flags `-bigtable.grpc-use-gzip-compression`, `-ingester.client.grpc-use-gzip-compression`, `-querier.frontend-client.grpc-use-gzip-compression` are now deprecated. #2940
   743  * [CHANGE] Limit errors reported by ingester during query-time now return HTTP status code 422. #2941
   744  * [FEATURE] Introduced `ruler.for-outage-tolerance`, Max time to tolerate outage for restoring "for" state of alert. #2783
   745  * [FEATURE] Introduced `ruler.for-grace-period`, Minimum duration between alert and restored "for" state. This is maintained only for alerts with configured "for" time greater than grace period. #2783
   746  * [FEATURE] Introduced `ruler.resend-delay`, Minimum amount of time to wait before resending an alert to Alertmanager. #2783
   747  * [FEATURE] Ruler: added `local` filesystem support to store rules (read-only). #2854
   748  * [ENHANCEMENT] Upgraded Docker base images to `alpine:3.12`. #2862
   749  * [ENHANCEMENT] Experimental: Querier can now optionally query secondary store. This is specified by using `-querier.second-store-engine` option, with values `chunks` or `blocks`. Standard configuration options for this store are used. Additionally, this querying can be configured to happen only for queries that need data older than `-querier.use-second-store-before-time`. Default value of zero will always query secondary store. #2747
   750  * [ENHANCEMENT] Query-tee: increased the `cortex_querytee_request_duration_seconds` metric buckets granularity. #2799
   751  * [ENHANCEMENT] Query-tee: fail to start if the configured `-backend.preferred` is unknown. #2799
   752  * [ENHANCEMENT] Ruler: Added the following metrics: #2786
   753    * `cortex_prometheus_notifications_latency_seconds`
   754    * `cortex_prometheus_notifications_errors_total`
   755    * `cortex_prometheus_notifications_sent_total`
   756    * `cortex_prometheus_notifications_dropped_total`
   757    * `cortex_prometheus_notifications_queue_length`
   758    * `cortex_prometheus_notifications_queue_capacity`
   759    * `cortex_prometheus_notifications_alertmanagers_discovered`
   760  * [ENHANCEMENT] The behavior of the `/ready` was changed for the query frontend to indicate when it was ready to accept queries. This is intended for use by a read path load balancer that would want to wait for the frontend to have attached queriers before including it in the backend. #2733
   761  * [ENHANCEMENT] Experimental Delete Series: Add support for deletion of chunks for remaining stores. #2801
   762  * [ENHANCEMENT] Add `-modules` command line flag to list possible values for `-target`. Also, log warning if given target is internal component. #2752
   763  * [ENHANCEMENT] Added `-ingester.flush-on-shutdown-with-wal-enabled` option to enable chunks flushing even when WAL is enabled. #2780
   764  * [ENHANCEMENT] Query-tee: Support for custom API prefix by using `-server.path-prefix` option. #2814
   765  * [ENHANCEMENT] Query-tee: Forward `X-Scope-OrgId` header to backend, if present in the request. #2815
   766  * [ENHANCEMENT] Experimental blocks storage: Added `-experimental.blocks-storage.tsdb.head-compaction-idle-timeout` option to force compaction of data in memory into a block. #2803
   767  * [ENHANCEMENT] Experimental blocks storage: Added support for flushing blocks via `/flush`, `/shutdown` (previously these only worked for chunks storage) and by using `-experimental.blocks-storage.tsdb.flush-blocks-on-shutdown` option. #2794
   768  * [ENHANCEMENT] Experimental blocks storage: Added support to enforce max query time range length via `-store.max-query-length`. #2826
   769  * [ENHANCEMENT] Experimental blocks storage: Added support to limit the max number of chunks that can be fetched from the long-term storage while executing a query. The limit is enforced both in the querier and store-gateway, and is configurable via `-store.query-chunk-limit`. #2852 #2922
   770  * [ENHANCEMENT] Ingester: Added new metric `cortex_ingester_flush_series_in_progress` that reports number of ongoing flush-series operations. Useful when calling `/flush` handler: if `cortex_ingester_flush_queue_length + cortex_ingester_flush_series_in_progress` is 0, all flushes are finished. #2778
   771  * [ENHANCEMENT] Memberlist members can join cluster via SRV records. #2788
   772  * [ENHANCEMENT] Added configuration options for chunks s3 client. #2831
   773    * `s3.endpoint`
   774    * `s3.region`
   775    * `s3.access-key-id`
   776    * `s3.secret-access-key`
   777    * `s3.insecure`
   778    * `s3.sse-encryption`
   779    * `s3.http.idle-conn-timeout`
   780    * `s3.http.response-header-timeout`
   781    * `s3.http.insecure-skip-verify`
   782  * [ENHANCEMENT] Prometheus upgraded. #2798 #2849 #2867 #2902 #2918
   783    * Optimized labels regex matchers for patterns containing literals (eg. `foo.*`, `.*foo`, `.*foo.*`)
   784  * [ENHANCEMENT] Add metric `cortex_ruler_config_update_failures_total` to Ruler to track failures of loading rules files. #2857
   785  * [ENHANCEMENT] Experimental Alertmanager: Alertmanager configuration persisted to object storage using an experimental API that accepts and returns YAML-based Alertmanager configuration. #2768
   786  * [ENHANCEMENT] Ruler: `-ruler.alertmanager-url` now supports multiple URLs. Each URL is treated as a separate Alertmanager group. Support for multiple Alertmanagers in a group can be achieved by using DNS service discovery. #2851
   787  * [ENHANCEMENT] Experimental blocks storage: Cortex Flusher now works with blocks engine. Flusher needs to be provided with blocks-engine configuration, existing Flusher flags are not used (they are only relevant for chunks engine). Note that flush errors are only reported via log. #2877
   788  * [ENHANCEMENT] Flusher: Added `-flusher.exit-after-flush` option (defaults to true) to control whether Cortex should stop completely after Flusher has finished its work. #2877
   789  * [ENHANCEMENT] Added metrics `cortex_config_hash` and `cortex_runtime_config_hash` to expose hash of the currently active config file. #2874
   790  * [ENHANCEMENT] Logger: added JSON logging support, configured via the `-log.format=json` CLI flag or its respective YAML config option. #2386
   791  * [ENHANCEMENT] Added new flags `-bigtable.grpc-compression`, `-ingester.client.grpc-compression`, `-querier.frontend-client.grpc-compression` to configure compression used by gRPC. Valid values are `gzip`, `snappy`, or empty string (no compression, default). #2940
   792  * [ENHANCEMENT] Clarify limitations of the `/api/v1/series`, `/api/v1/labels` and `/api/v1/label/{name}/values` endpoints. #2953
   793  * [ENHANCEMENT] Ingester: added `Dropped` outcome to metric `cortex_ingester_flushing_dequeued_series_total`. #2998
   794  * [BUGFIX] Fixed a bug with `api/v1/query_range` where no responses would return null values for `result` and empty values for `resultType`. #2962
   795  * [BUGFIX] Fixed a bug in the index intersect code causing storage to return more chunks/series than required. #2796
   796  * [BUGFIX] Fixed the number of reported keys in the background cache queue. #2764
   797  * [BUGFIX] Fix race in processing of headers in sharded queries. #2762
   798  * [BUGFIX] Query Frontend: Do not re-split sharded requests around ingester boundaries. #2766
   799  * [BUGFIX] Experimental Delete Series: Fixed a problem with cache generation numbers prefixed to cache keys. #2800
   800  * [BUGFIX] Ingester: Flushing chunks via `/flush` endpoint could previously lead to panic, if chunks were already flushed before and then removed from memory during the flush caused by `/flush` handler. Immediate flush now doesn't cause chunks to be flushed again. Samples received during flush triggered via `/flush` handler are no longer discarded. #2778
   801  * [BUGFIX] Prometheus upgraded. #2849
   802    * Fixed unknown symbol error during head compaction
   803  * [BUGFIX] Fix panic when using cassandra as store for both index and delete requests. #2774
   804  * [BUGFIX] Experimental Delete Series: Fixed a data race in Purger. #2817
   805  * [BUGFIX] KV: Fixed a bug that triggered a panic due to metrics being registered with the same name but different labels when using a `multi` configured KV client. #2837
   806  * [BUGFIX] Query-frontend: Fix passing HTTP `Host` header if `-frontend.downstream-url` is configured. #2880
   807  * [BUGFIX] Ingester: Improve time-series distribution when `-experimental.distributor.user-subring-size` is enabled. #2887
   808  * [BUGFIX] Set content type to `application/x-protobuf` for remote_read responses. #2915
   809  * [BUGFIX] Fixed ruler and store-gateway instance registration in the ring (when sharding is enabled) when a new instance replaces abruptly terminated one, and the only difference between the two instances is the address. #2954
   810  * [BUGFIX] Fixed `Missing chunks and index config causing silent failure` Absence of chunks and index from schema config is not validated. #2732
   811  * [BUGFIX] Fix panic caused by KVs from boltdb being used beyond their life. #2971
   812  * [BUGFIX] Experimental blocks storage: `/api/v1/series`, `/api/v1/labels` and `/api/v1/label/{name}/values` only query the TSDB head regardless of the configured `-experimental.blocks-storage.tsdb.retention-period`. #2974
   813  * [BUGFIX] Ingester: Avoid indefinite checkpointing in case of surge in number of series. #2955
   814  * [BUGFIX] Querier: query /series from ingesters regardless the `-querier.query-ingesters-within` setting. #3035
   815  * [BUGFIX] Ruler: fixed an unintentional breaking change introduced in the ruler's `alertmanager_url` YAML config option, which changed the value from a string to a list of strings. #2989
   816  
   817  ## 1.2.0 / 2020-07-01
   818  
   819  * [CHANGE] Metric `cortex_kv_request_duration_seconds` now includes `name` label to denote which client is being used as well as the `backend` label to denote the KV backend implementation in use. #2648
   820  * [CHANGE] Experimental Ruler: Rule groups persisted to object storage using the experimental API have an updated object key encoding to better handle special characters. Rule groups previously-stored using object storage must be renamed to the new format. #2646
   821  * [CHANGE] Query Frontend now uses Round Robin to choose a tenant queue to service next. #2553
   822  * [CHANGE] `-promql.lookback-delta` is now deprecated and has been replaced by `-querier.lookback-delta` along with `lookback_delta` entry under `querier` in the config file. `-promql.lookback-delta` will be removed in v1.4.0. #2604
   823  * [CHANGE] Experimental TSDB: removed `-experimental.tsdb.bucket-store.binary-index-header-enabled` flag. Now the binary index-header is always enabled.
   824  * [CHANGE] Experimental TSDB: Renamed index-cache metrics to use original metric names from Thanos, as Cortex is not aggregating them in any way: #2627
   825    * `cortex_<service>_blocks_index_cache_items_evicted_total` => `thanos_store_index_cache_items_evicted_total{name="index-cache"}`
   826    * `cortex_<service>_blocks_index_cache_items_added_total` => `thanos_store_index_cache_items_added_total{name="index-cache"}`
   827    * `cortex_<service>_blocks_index_cache_requests_total` => `thanos_store_index_cache_requests_total{name="index-cache"}`
   828    * `cortex_<service>_blocks_index_cache_items_overflowed_total` => `thanos_store_index_cache_items_overflowed_total{name="index-cache"}`
   829    * `cortex_<service>_blocks_index_cache_hits_total` => `thanos_store_index_cache_hits_total{name="index-cache"}`
   830    * `cortex_<service>_blocks_index_cache_items` => `thanos_store_index_cache_items{name="index-cache"}`
   831    * `cortex_<service>_blocks_index_cache_items_size_bytes` => `thanos_store_index_cache_items_size_bytes{name="index-cache"}`
   832    * `cortex_<service>_blocks_index_cache_total_size_bytes` => `thanos_store_index_cache_total_size_bytes{name="index-cache"}`
   833    * `cortex_<service>_blocks_index_cache_memcached_operations_total` =>  `thanos_memcached_operations_total{name="index-cache"}`
   834    * `cortex_<service>_blocks_index_cache_memcached_operation_failures_total` =>  `thanos_memcached_operation_failures_total{name="index-cache"}`
   835    * `cortex_<service>_blocks_index_cache_memcached_operation_duration_seconds` =>  `thanos_memcached_operation_duration_seconds{name="index-cache"}`
   836    * `cortex_<service>_blocks_index_cache_memcached_operation_skipped_total` =>  `thanos_memcached_operation_skipped_total{name="index-cache"}`
   837  * [CHANGE] Experimental TSDB: Renamed metrics in bucket stores: #2627
   838    * `cortex_<service>_blocks_meta_syncs_total` => `cortex_blocks_meta_syncs_total{component="<service>"}`
   839    * `cortex_<service>_blocks_meta_sync_failures_total` => `cortex_blocks_meta_sync_failures_total{component="<service>"}`
   840    * `cortex_<service>_blocks_meta_sync_duration_seconds` => `cortex_blocks_meta_sync_duration_seconds{component="<service>"}`
   841    * `cortex_<service>_blocks_meta_sync_consistency_delay_seconds` => `cortex_blocks_meta_sync_consistency_delay_seconds{component="<service>"}`
   842    * `cortex_<service>_blocks_meta_synced` => `cortex_blocks_meta_synced{component="<service>"}`
   843    * `cortex_<service>_bucket_store_block_loads_total` => `cortex_bucket_store_block_loads_total{component="<service>"}`
   844    * `cortex_<service>_bucket_store_block_load_failures_total` => `cortex_bucket_store_block_load_failures_total{component="<service>"}`
   845    * `cortex_<service>_bucket_store_block_drops_total` => `cortex_bucket_store_block_drops_total{component="<service>"}`
   846    * `cortex_<service>_bucket_store_block_drop_failures_total` => `cortex_bucket_store_block_drop_failures_total{component="<service>"}`
   847    * `cortex_<service>_bucket_store_blocks_loaded` => `cortex_bucket_store_blocks_loaded{component="<service>"}`
   848    * `cortex_<service>_bucket_store_series_data_touched` => `cortex_bucket_store_series_data_touched{component="<service>"}`
   849    * `cortex_<service>_bucket_store_series_data_fetched` => `cortex_bucket_store_series_data_fetched{component="<service>"}`
   850    * `cortex_<service>_bucket_store_series_data_size_touched_bytes` => `cortex_bucket_store_series_data_size_touched_bytes{component="<service>"}`
   851    * `cortex_<service>_bucket_store_series_data_size_fetched_bytes` => `cortex_bucket_store_series_data_size_fetched_bytes{component="<service>"}`
   852    * `cortex_<service>_bucket_store_series_blocks_queried` => `cortex_bucket_store_series_blocks_queried{component="<service>"}`
   853    * `cortex_<service>_bucket_store_series_get_all_duration_seconds` => `cortex_bucket_store_series_get_all_duration_seconds{component="<service>"}`
   854    * `cortex_<service>_bucket_store_series_merge_duration_seconds` => `cortex_bucket_store_series_merge_duration_seconds{component="<service>"}`
   855    * `cortex_<service>_bucket_store_series_refetches_total` => `cortex_bucket_store_series_refetches_total{component="<service>"}`
   856    * `cortex_<service>_bucket_store_series_result_series` => `cortex_bucket_store_series_result_series{component="<service>"}`
   857    * `cortex_<service>_bucket_store_cached_postings_compressions_total` => `cortex_bucket_store_cached_postings_compressions_total{component="<service>"}`
   858    * `cortex_<service>_bucket_store_cached_postings_compression_errors_total` => `cortex_bucket_store_cached_postings_compression_errors_total{component="<service>"}`
   859    * `cortex_<service>_bucket_store_cached_postings_compression_time_seconds` => `cortex_bucket_store_cached_postings_compression_time_seconds{component="<service>"}`
   860    * `cortex_<service>_bucket_store_cached_postings_original_size_bytes_total` => `cortex_bucket_store_cached_postings_original_size_bytes_total{component="<service>"}`
   861    * `cortex_<service>_bucket_store_cached_postings_compressed_size_bytes_total` => `cortex_bucket_store_cached_postings_compressed_size_bytes_total{component="<service>"}`
   862    * `cortex_<service>_blocks_sync_seconds` => `cortex_bucket_stores_blocks_sync_seconds{component="<service>"}`
   863    * `cortex_<service>_blocks_last_successful_sync_timestamp_seconds` => `cortex_bucket_stores_blocks_last_successful_sync_timestamp_seconds{component="<service>"}`
   864  * [CHANGE] Available command-line flags are printed to stdout, and only when requested via `-help`. Using invalid flag no longer causes printing of all available flags. #2691
   865  * [CHANGE] Experimental Memberlist ring: randomize gossip node names to avoid conflicts when running multiple clients on the same host, or reusing host names (eg. pods in statefulset). Node name randomization can be disabled by using `-memberlist.randomize-node-name=false`. #2715
   866  * [CHANGE] Memberlist KV client is no longer considered experimental. #2725
   867  * [CHANGE] Experimental Delete Series: Make delete request cancellation duration configurable. #2760
   868  * [CHANGE] Removed `-store.fullsize-chunks` option which was undocumented and unused (it broke ingester hand-overs). #2656
   869  * [CHANGE] Query with no metric name that has previously resulted in HTTP status code 500 now returns status code 422 instead. #2571
   870  * [FEATURE] TLS config options added for GRPC clients in Querier (Query-frontend client & Ingester client), Ruler, Store Gateway, as well as HTTP client in Config store client. #2502
   871  * [FEATURE] The flag `-frontend.max-cache-freshness` is now supported within the limits overrides, to specify per-tenant max cache freshness values. The corresponding YAML config parameter has been changed from `results_cache.max_freshness` to `limits_config.max_cache_freshness`. The legacy YAML config parameter (`results_cache.max_freshness`) will continue to be supported till Cortex release `v1.4.0`. #2609
   872  * [FEATURE] Experimental gRPC Store: Added support to 3rd parties index and chunk stores using gRPC client/server plugin mechanism. #2220
   873  * [FEATURE] Add `-cassandra.table-options` flag to customize table options of Cassandra when creating the index or chunk table. #2575
   874  * [ENHANCEMENT] Propagate GOPROXY value when building `build-image`. This is to help the builders building the code in a Network where default Go proxy is not accessible (e.g. when behind some corporate VPN). #2741
   875  * [ENHANCEMENT] Querier: Added metric `cortex_querier_request_duration_seconds` for all requests to the querier. #2708
   876  * [ENHANCEMENT] Cortex is now built with Go 1.14. #2480 #2749 #2753
   877  * [ENHANCEMENT] Experimental TSDB: added the following metrics to the ingester: #2580 #2583 #2589 #2654
   878    * `cortex_ingester_tsdb_appender_add_duration_seconds`
   879    * `cortex_ingester_tsdb_appender_commit_duration_seconds`
   880    * `cortex_ingester_tsdb_refcache_purge_duration_seconds`
   881    * `cortex_ingester_tsdb_compactions_total`
   882    * `cortex_ingester_tsdb_compaction_duration_seconds`
   883    * `cortex_ingester_tsdb_wal_fsync_duration_seconds`
   884    * `cortex_ingester_tsdb_wal_page_flushes_total`
   885    * `cortex_ingester_tsdb_wal_completed_pages_total`
   886    * `cortex_ingester_tsdb_wal_truncations_failed_total`
   887    * `cortex_ingester_tsdb_wal_truncations_total`
   888    * `cortex_ingester_tsdb_wal_writes_failed_total`
   889    * `cortex_ingester_tsdb_checkpoint_deletions_failed_total`
   890    * `cortex_ingester_tsdb_checkpoint_deletions_total`
   891    * `cortex_ingester_tsdb_checkpoint_creations_failed_total`
   892    * `cortex_ingester_tsdb_checkpoint_creations_total`
   893    * `cortex_ingester_tsdb_wal_truncate_duration_seconds`
   894    * `cortex_ingester_tsdb_head_active_appenders`
   895    * `cortex_ingester_tsdb_head_series_not_found_total`
   896    * `cortex_ingester_tsdb_head_chunks`
   897    * `cortex_ingester_tsdb_mmap_chunk_corruptions_total`
   898    * `cortex_ingester_tsdb_head_chunks_created_total`
   899    * `cortex_ingester_tsdb_head_chunks_removed_total`
   900  * [ENHANCEMENT] Experimental TSDB: added metrics useful to alert on critical conditions of the blocks storage: #2573
   901    * `cortex_compactor_last_successful_run_timestamp_seconds`
   902    * `cortex_querier_blocks_last_successful_sync_timestamp_seconds` (when store-gateway is disabled)
   903    * `cortex_querier_blocks_last_successful_scan_timestamp_seconds` (when store-gateway is enabled)
   904    * `cortex_storegateway_blocks_last_successful_sync_timestamp_seconds`
   905  * [ENHANCEMENT] Experimental TSDB: added the flag `-experimental.tsdb.wal-compression-enabled` to allow to enable TSDB WAL compression. #2585
   906  * [ENHANCEMENT] Experimental TSDB: Querier and store-gateway components can now use so-called "caching bucket", which can currently cache fetched chunks into shared memcached server. #2572
   907  * [ENHANCEMENT] Ruler: Automatically remove unhealthy rulers from the ring. #2587
   908  * [ENHANCEMENT] Query-tee: added support to `/metadata`, `/alerts`, and `/rules` endpoints #2600
   909  * [ENHANCEMENT] Query-tee: added support to query results comparison between two different backends. The comparison is disabled by default and can be enabled via `-proxy.compare-responses=true`. #2611
   910  * [ENHANCEMENT] Query-tee: improved the query-tee to not wait all backend responses before sending back the response to the client. The query-tee now sends back to the client first successful response, while honoring the `-backend.preferred` option. #2702
   911  * [ENHANCEMENT] Thanos and Prometheus upgraded. #2602 #2604 #2634 #2659 #2686 #2756
   912    * TSDB now holds less WAL files after Head Truncation.
   913    * TSDB now does memory-mapping of Head chunks and reduces memory usage.
   914  * [ENHANCEMENT] Experimental TSDB: decoupled blocks deletion from blocks compaction in the compactor, so that blocks deletion is not blocked by a busy compactor. The following metrics have been added: #2623
   915    * `cortex_compactor_block_cleanup_started_total`
   916    * `cortex_compactor_block_cleanup_completed_total`
   917    * `cortex_compactor_block_cleanup_failed_total`
   918    * `cortex_compactor_block_cleanup_last_successful_run_timestamp_seconds`
   919  * [ENHANCEMENT] Experimental TSDB: Use shared cache for metadata. This is especially useful when running multiple querier and store-gateway components to reduce number of object store API calls. #2626 #2640
   920  * [ENHANCEMENT] Experimental TSDB: when `-querier.query-store-after` is configured and running the experimental blocks storage, the time range of the query sent to the store is now manipulated to ensure the query end time is not more recent than 'now - query-store-after'. #2642
   921  * [ENHANCEMENT] Experimental TSDB: small performance improvement in concurrent usage of RefCache, used during samples ingestion. #2651
   922  * [ENHANCEMENT] The following endpoints now respond appropriately to an `Accept` header with the value `application/json` #2673
   923    * `/distributor/all_user_stats`
   924    * `/distributor/ha_tracker`
   925    * `/ingester/ring`
   926    * `/store-gateway/ring`
   927    * `/compactor/ring`
   928    * `/ruler/ring`
   929    * `/services`
   930  * [ENHANCEMENT] Experimental Cassandra backend: Add `-cassandra.num-connections` to allow increasing the number of TCP connections to each Cassandra server. #2666
   931  * [ENHANCEMENT] Experimental Cassandra backend: Use separate Cassandra clients and connections for reads and writes. #2666
   932  * [ENHANCEMENT] Experimental Cassandra backend: Add `-cassandra.reconnect-interval` to allow specifying the reconnect interval to a Cassandra server that has been marked `DOWN` by the gocql driver. Also change the default value of the reconnect interval from `60s` to `1s`. #2687
   933  * [ENHANCEMENT] Experimental Cassandra backend: Add option `-cassandra.convict-hosts-on-failure=false` to not convict host of being down when a request fails. #2684
   934  * [ENHANCEMENT] Experimental TSDB: Applied a jitter to the period bucket scans in order to better distribute bucket operations over the time and increase the probability of hitting the shared cache (if configured). #2693
   935  * [ENHANCEMENT] Experimental TSDB: Series limit per user and per metric now work in TSDB blocks. #2676
   936  * [ENHANCEMENT] Experimental Memberlist: Added ability to periodically rejoin the memberlist cluster. #2724
   937  * [ENHANCEMENT] Experimental Delete Series: Added the following metrics for monitoring processing of delete requests: #2730
   938    - `cortex_purger_load_pending_requests_attempts_total`: Number of attempts that were made to load pending requests with status.
   939    - `cortex_purger_oldest_pending_delete_request_age_seconds`: Age of oldest pending delete request in seconds.
   940    - `cortex_purger_pending_delete_requests_count`: Count of requests which are in process or are ready to be processed.
   941  * [ENHANCEMENT] Experimental TSDB: Improved compactor to hard-delete also partial blocks with an deletion mark (even if the deletion mark threshold has not been reached). #2751
   942  * [ENHANCEMENT] Experimental TSDB: Introduced a consistency check done by the querier to ensure all expected blocks have been queried via the store-gateway. If a block is missing on a store-gateway, the querier retries fetching series from missing blocks up to 3 times. If the consistency check fails once all retries have been exhausted, the query execution fails. The following metrics have been added: #2593 #2630 #2689 #2695
   943    * `cortex_querier_blocks_consistency_checks_total`
   944    * `cortex_querier_blocks_consistency_checks_failed_total`
   945    * `cortex_querier_storegateway_refetches_per_query`
   946  * [ENHANCEMENT] Delete requests can now be canceled #2555
   947  * [ENHANCEMENT] Table manager can now provision tables for delete store #2546
   948  * [BUGFIX] Ruler: Ensure temporary rule files with special characters are properly mapped and cleaned up. #2506
   949  * [BUGFIX] Fixes #2411, Ensure requests are properly routed to the prometheus api embedded in the query if `-server.path-prefix` is set. #2372
   950  * [BUGFIX] Experimental TSDB: fixed chunk data corruption when querying back series using the experimental blocks storage. #2400
   951  * [BUGFIX] Fixed collection of tracing spans from Thanos components used internally. #2655
   952  * [BUGFIX] Experimental TSDB: fixed memory leak in ingesters. #2586
   953  * [BUGFIX] QueryFrontend: fixed a situation where HTTP error is ignored and an incorrect status code is set. #2590
   954  * [BUGFIX] Ingester: Fix an ingester starting up in the JOINING state and staying there forever. #2565
   955  * [BUGFIX] QueryFrontend: fixed a panic (`integer divide by zero`) in the query-frontend. The query-frontend now requires the `-querier.default-evaluation-interval` config to be set to the same value of the querier. #2614
   956  * [BUGFIX] Experimental TSDB: when the querier receives a `/series` request with a time range older than the data stored in the ingester, it now ignores the requested time range and returns known series anyway instead of returning an empty response. This aligns the behaviour with the chunks storage. #2617
   957  * [BUGFIX] Cassandra: fixed an edge case leading to an invalid CQL query when querying the index on a Cassandra store. #2639
   958  * [BUGFIX] Ingester: increment series per metric when recovering from WAL or transfer. #2674
   959  * [BUGFIX] Fixed `wrong number of arguments for 'mget' command` Redis error when a query has no chunks to lookup from storage. #2700 #2796
   960  * [BUGFIX] Ingester: Automatically remove old tmp checkpoints, fixing a potential disk space leak after an ingester crashes. #2726
   961  
   962  ## 1.1.0 / 2020-05-21
   963  
   964  This release brings the usual mix of bugfixes and improvements. The biggest change is that WAL support for chunks is now considered to be production-ready!
   965  
   966  Please make sure to review renamed metrics, and update your dashboards and alerts accordingly.
   967  
   968  * [CHANGE] Added v1 API routes documented in #2327. #2372
   969    * Added `-http.alertmanager-http-prefix` flag which allows the configuration of the path where the Alertmanager API and UI can be reached. The default is set to `/alertmanager`.
   970    * Added `-http.prometheus-http-prefix` flag which allows the configuration of the path where the Prometheus API and UI can be reached. The default is set to `/prometheus`.
   971    * Updated the index hosted at the root prefix to point to the updated routes.
   972    * Legacy routes hardcoded with the `/api/prom` prefix now respect the `-http.prefix` flag.
   973  * [CHANGE] The metrics `cortex_distributor_ingester_appends_total` and `distributor_ingester_append_failures_total` now include a `type` label to differentiate between `samples` and `metadata`. #2336
   974  * [CHANGE] The metrics for number of chunks and bytes flushed to the chunk store are renamed. Note that previous metrics were counted pre-deduplication, while new metrics are counted after deduplication. #2463
   975    * `cortex_ingester_chunks_stored_total` > `cortex_chunk_store_stored_chunks_total`
   976    * `cortex_ingester_chunk_stored_bytes_total` > `cortex_chunk_store_stored_chunk_bytes_total`
   977  * [CHANGE] Experimental TSDB: renamed blocks meta fetcher metrics: #2375
   978    * `cortex_querier_bucket_store_blocks_meta_syncs_total` > `cortex_querier_blocks_meta_syncs_total`
   979    * `cortex_querier_bucket_store_blocks_meta_sync_failures_total` > `cortex_querier_blocks_meta_sync_failures_total`
   980    * `cortex_querier_bucket_store_blocks_meta_sync_duration_seconds` > `cortex_querier_blocks_meta_sync_duration_seconds`
   981    * `cortex_querier_bucket_store_blocks_meta_sync_consistency_delay_seconds` > `cortex_querier_blocks_meta_sync_consistency_delay_seconds`
   982  * [CHANGE] Experimental TSDB: Modified default values for `compactor.deletion-delay` option from 48h to 12h and `-experimental.tsdb.bucket-store.ignore-deletion-marks-delay` from 24h to 6h. #2414
   983  * [CHANGE] WAL: Default value of `-ingester.checkpoint-enabled` changed to `true`. #2416
   984  * [CHANGE] `trace_id` field in log files has been renamed to `traceID`. #2518
   985  * [CHANGE] Slow query log has a different output now. Previously used `url` field has been replaced with `host` and `path`, and query parameters are logged as individual log fields with `qs_` prefix. #2520
   986  * [CHANGE] WAL: WAL and checkpoint compression is now disabled. #2436
   987  * [CHANGE] Update in dependency `go-kit/kit` from `v0.9.0` to `v0.10.0`. HTML escaping disabled in JSON Logger. #2535
   988  * [CHANGE] Experimental TSDB: Removed `cortex_<service>_` prefix from Thanos objstore metrics and added `component` label to distinguish which Cortex component is doing API calls to the object storage when running in single-binary mode: #2568
   989    - `cortex_<service>_thanos_objstore_bucket_operations_total` renamed to `thanos_objstore_bucket_operations_total{component="<name>"}`
   990    - `cortex_<service>_thanos_objstore_bucket_operation_failures_total` renamed to `thanos_objstore_bucket_operation_failures_total{component="<name>"}`
   991    - `cortex_<service>_thanos_objstore_bucket_operation_duration_seconds` renamed to `thanos_objstore_bucket_operation_duration_seconds{component="<name>"}`
   992    - `cortex_<service>_thanos_objstore_bucket_last_successful_upload_time` renamed to `thanos_objstore_bucket_last_successful_upload_time{component="<name>"}`
   993  * [CHANGE] FIFO cache: The `-<prefix>.fifocache.size` CLI flag has been renamed to `-<prefix>.fifocache.max-size-items` as well as its YAML config option `size` renamed to `max_size_items`. #2319
   994  * [FEATURE] Ruler: The `-ruler.evaluation-delay` flag was added to allow users to configure a default evaluation delay for all rules in cortex. The default value is 0 which is the current behavior. #2423
   995  * [FEATURE] Experimental: Added a new object storage client for OpenStack Swift. #2440
   996  * [FEATURE] TLS config options added to the Server. #2535
   997  * [FEATURE] Experimental: Added support for `/api/v1/metadata` Prometheus-based endpoint. #2549
   998  * [FEATURE] Add ability to limit concurrent queries to Cassandra with `-cassandra.query-concurrency` flag. #2562
   999  * [FEATURE] Experimental TSDB: Introduced store-gateway service used by the experimental blocks storage to load and query blocks. The store-gateway optionally supports blocks sharding and replication via a dedicated hash ring, configurable via `-experimental.store-gateway.sharding-enabled` and `-experimental.store-gateway.sharding-ring.*` flags. The following metrics have been added: #2433 #2458 #2469 #2523
  1000    * `cortex_querier_storegateway_instances_hit_per_query`
  1001  * [ENHANCEMENT] Experimental TSDB: sample ingestion errors are now reported via existing `cortex_discarded_samples_total` metric. #2370
  1002  * [ENHANCEMENT] Failures on samples at distributors and ingesters return the first validation error as opposed to the last. #2383
  1003  * [ENHANCEMENT] Experimental TSDB: Added `cortex_querier_blocks_meta_synced`, which reflects current state of synced blocks over all tenants. #2392
  1004  * [ENHANCEMENT] Added `cortex_distributor_latest_seen_sample_timestamp_seconds` metric to see how far behind Prometheus servers are in sending data. #2371
  1005  * [ENHANCEMENT] FIFO cache to support eviction based on memory usage. Added `-<prefix>.fifocache.max-size-bytes` CLI flag and YAML config option `max_size_bytes` to specify memory limit of the cache. #2319, #2527
  1006  * [ENHANCEMENT] Added `-querier.worker-match-max-concurrent`. Force worker concurrency to match the `-querier.max-concurrent` option.  Overrides `-querier.worker-parallelism`.  #2456
  1007  * [ENHANCEMENT] Added the following metrics for monitoring delete requests: #2445
  1008    - `cortex_purger_delete_requests_received_total`: Number of delete requests received per user.
  1009    - `cortex_purger_delete_requests_processed_total`: Number of delete requests processed per user.
  1010    - `cortex_purger_delete_requests_chunks_selected_total`: Number of chunks selected while building delete plans per user.
  1011    - `cortex_purger_delete_requests_processing_failures_total`: Number of delete requests processing failures per user.
  1012  * [ENHANCEMENT] Single Binary: Added query-frontend to the single binary.  Single binary users will now benefit from various query-frontend features.  Primarily: sharding, parallelization, load shedding, additional caching (if configured), and query retries. #2437
  1013  * [ENHANCEMENT] Allow 1w (where w denotes week) and 1y (where y denotes year) when setting `-store.cache-lookups-older-than` and `-store.max-look-back-period`. #2454
  1014  * [ENHANCEMENT] Optimize index queries for matchers using "a|b|c"-type regex. #2446 #2475
  1015  * [ENHANCEMENT] Added per tenant metrics for queries and chunks and bytes read from chunk store: #2463
  1016    * `cortex_chunk_store_fetched_chunks_total` and `cortex_chunk_store_fetched_chunk_bytes_total`
  1017    * `cortex_query_frontend_queries_total` (per tenant queries counted by the frontend)
  1018  * [ENHANCEMENT] WAL: New metrics `cortex_ingester_wal_logged_bytes_total` and `cortex_ingester_checkpoint_logged_bytes_total` added to track total bytes logged to disk for WAL and checkpoints. #2497
  1019  * [ENHANCEMENT] Add de-duplicated chunks counter `cortex_chunk_store_deduped_chunks_total` which counts every chunk not sent to the store because it was already sent by another replica. #2485
  1020  * [ENHANCEMENT] Query-frontend now also logs the POST data of long queries. #2481
  1021  * [ENHANCEMENT] WAL: Ingester WAL records now have type header and the custom WAL records have been replaced by Prometheus TSDB's WAL records. Old records will not be supported from 1.3 onwards. Note: once this is deployed, you cannot downgrade without data loss. #2436
  1022  * [ENHANCEMENT] Redis Cache: Added `idle_timeout`, `wait_on_pool_exhaustion` and `max_conn_lifetime` options to redis cache configuration. #2550
  1023  * [ENHANCEMENT] WAL: the experimental tag has been removed on the WAL in ingesters. #2560
  1024  * [ENHANCEMENT] Use newer AWS API for paginated queries - removes 'Deprecated' message from logfiles. #2452
  1025  * [ENHANCEMENT] Experimental memberlist: Add retry with backoff on memberlist join other members. #2705
  1026  * [ENHANCEMENT] Experimental TSDB: when the store-gateway sharding is enabled, unhealthy store-gateway instances are automatically removed from the ring after 10 consecutive `-experimental.store-gateway.sharding-ring.heartbeat-timeout` periods. #2526
  1027  * [BUGFIX] Ruler: Ensure temporary rule files with special characters are properly mapped and cleaned up. #2506
  1028  * [BUGFIX] Ensure requests are properly routed to the prometheus api embedded in the query if `-server.path-prefix` is set. Fixes #2411. #2372
  1029  * [BUGFIX] Experimental TSDB: Fixed chunk data corruption when querying back series using the experimental blocks storage. #2400
  1030  * [BUGFIX] Cassandra Storage: Fix endpoint TLS host verification. #2109
  1031  * [BUGFIX] Experimental TSDB: Fixed response status code from `422` to `500` when an error occurs while iterating chunks with the experimental blocks storage. #2402
  1032  * [BUGFIX] Ring: Fixed a situation where upgrading from pre-1.0 cortex with a rolling strategy caused new 1.0 ingesters to lose their zone value in the ring until manually forced to re-register. #2404
  1033  * [BUGFIX] Distributor: `/all_user_stats` now show API and Rule Ingest Rate correctly. #2457
  1034  * [BUGFIX] Fixed `version`, `revision` and `branch` labels exported by the `cortex_build_info` metric. #2468
  1035  * [BUGFIX] QueryFrontend: fixed a situation where span context missed when downstream_url is used. #2539
  1036  * [BUGFIX] Querier: Fixed a situation where querier would crash because of an unresponsive frontend instance. #2569
  1037  
  1038  ## 1.0.1 / 2020-04-23
  1039  
  1040  * [BUGFIX] Fix gaps when querying ingesters with replication factor = 3 and 2 ingesters in the cluster. #2503
  1041  
  1042  ## 1.0.0 / 2020-04-02
  1043  
  1044  This is the first major release of Cortex. We made a lot of **breaking changes** in this release which have been detailed below. Please also see the stability guarantees we provide as part of a major release: https://cortexmetrics.io/docs/configuration/v1guarantees/
  1045  
  1046  * [CHANGE] Remove the following deprecated flags: #2339
  1047    - `-metrics.error-rate-query` (use `-metrics.write-throttle-query` instead).
  1048    - `-store.cardinality-cache-size` (use `-store.index-cache-read.enable-fifocache` and `-store.index-cache-read.fifocache.size` instead).
  1049    - `-store.cardinality-cache-validity` (use `-store.index-cache-read.enable-fifocache` and `-store.index-cache-read.fifocache.duration` instead).
  1050    - `-distributor.limiter-reload-period` (flag unused)
  1051    - `-ingester.claim-on-rollout` (flag unused)
  1052    - `-ingester.normalise-tokens` (flag unused)
  1053  * [CHANGE] Renamed YAML file options to be more consistent. See [full config file changes below](#config-file-breaking-changes). #2273
  1054  * [CHANGE] AWS based autoscaling has been removed. You can only use metrics based autoscaling now. `-applicationautoscaling.url` has been removed. See https://cortexmetrics.io/docs/production/aws/#dynamodb-capacity-provisioning on how to migrate. #2328
  1055  * [CHANGE] Renamed the `memcache.write-back-goroutines` and `memcache.write-back-buffer` flags to `background.write-back-concurrency` and `background.write-back-buffer`. This affects the following flags: #2241
  1056    - `-frontend.memcache.write-back-buffer` --> `-frontend.background.write-back-buffer`
  1057    - `-frontend.memcache.write-back-goroutines` --> `-frontend.background.write-back-concurrency`
  1058    - `-store.index-cache-read.memcache.write-back-buffer` --> `-store.index-cache-read.background.write-back-buffer`
  1059    - `-store.index-cache-read.memcache.write-back-goroutines` --> `-store.index-cache-read.background.write-back-concurrency`
  1060    - `-store.index-cache-write.memcache.write-back-buffer` --> `-store.index-cache-write.background.write-back-buffer`
  1061    - `-store.index-cache-write.memcache.write-back-goroutines` --> `-store.index-cache-write.background.write-back-concurrency`
  1062    - `-memcache.write-back-buffer` --> `-store.chunks-cache.background.write-back-buffer`. Note the next change log for the difference.
  1063    - `-memcache.write-back-goroutines` --> `-store.chunks-cache.background.write-back-concurrency`. Note the next change log for the difference.
  1064  
  1065  * [CHANGE] Renamed the chunk cache flags to have `store.chunks-cache.` as prefix. This means the following flags have been changed: #2241
  1066    - `-cache.enable-fifocache` --> `-store.chunks-cache.cache.enable-fifocache`
  1067    - `-default-validity` --> `-store.chunks-cache.default-validity`
  1068    - `-fifocache.duration` --> `-store.chunks-cache.fifocache.duration`
  1069    - `-fifocache.size` --> `-store.chunks-cache.fifocache.size`
  1070    - `-memcache.write-back-buffer` --> `-store.chunks-cache.background.write-back-buffer`. Note the previous change log for the difference.
  1071    - `-memcache.write-back-goroutines` --> `-store.chunks-cache.background.write-back-concurrency`. Note the previous change log for the difference.
  1072    - `-memcached.batchsize` --> `-store.chunks-cache.memcached.batchsize`
  1073    - `-memcached.consistent-hash` --> `-store.chunks-cache.memcached.consistent-hash`
  1074    - `-memcached.expiration` --> `-store.chunks-cache.memcached.expiration`
  1075    - `-memcached.hostname` --> `-store.chunks-cache.memcached.hostname`
  1076    - `-memcached.max-idle-conns` --> `-store.chunks-cache.memcached.max-idle-conns`
  1077    - `-memcached.parallelism` --> `-store.chunks-cache.memcached.parallelism`
  1078    - `-memcached.service` --> `-store.chunks-cache.memcached.service`
  1079    - `-memcached.timeout` --> `-store.chunks-cache.memcached.timeout`
  1080    - `-memcached.update-interval` --> `-store.chunks-cache.memcached.update-interval`
  1081    - `-redis.enable-tls` --> `-store.chunks-cache.redis.enable-tls`
  1082    - `-redis.endpoint` --> `-store.chunks-cache.redis.endpoint`
  1083    - `-redis.expiration` --> `-store.chunks-cache.redis.expiration`
  1084    - `-redis.max-active-conns` --> `-store.chunks-cache.redis.max-active-conns`
  1085    - `-redis.max-idle-conns` --> `-store.chunks-cache.redis.max-idle-conns`
  1086    - `-redis.password` --> `-store.chunks-cache.redis.password`
  1087    - `-redis.timeout` --> `-store.chunks-cache.redis.timeout`
  1088  * [CHANGE] Rename the `-store.chunk-cache-stubs` to `-store.chunks-cache.cache-stubs` to be more inline with above. #2241
  1089  * [CHANGE] Change prefix of flags `-dynamodb.periodic-table.*` to `-table-manager.index-table.*`. #2359
  1090  * [CHANGE] Change prefix of flags `-dynamodb.chunk-table.*` to `-table-manager.chunk-table.*`. #2359
  1091  * [CHANGE] Change the following flags: #2359
  1092    - `-dynamodb.poll-interval` --> `-table-manager.poll-interval`
  1093    - `-dynamodb.periodic-table.grace-period` --> `-table-manager.periodic-table.grace-period`
  1094  * [CHANGE] Renamed the following flags: #2273
  1095    - `-dynamodb.chunk.gang.size` --> `-dynamodb.chunk-gang-size`
  1096    - `-dynamodb.chunk.get.max.parallelism` --> `-dynamodb.chunk-get-max-parallelism`
  1097  * [CHANGE] Don't support mixed time units anymore for duration. For example, 168h5m0s doesn't work anymore, please use just one unit (s|m|h|d|w|y). #2252
  1098  * [CHANGE] Utilize separate protos for rule state and storage. Experimental ruler API will not be functional until the rollout is complete. #2226
  1099  * [CHANGE] Frontend worker in querier now starts after all Querier module dependencies are started. This fixes issue where frontend worker started to send queries to querier before it was ready to serve them (mostly visible when using experimental blocks storage). #2246
  1100  * [CHANGE] Lifecycler component now enters Failed state on errors, and doesn't exit the process. (Important if you're vendoring Cortex and use Lifecycler) #2251
  1101  * [CHANGE] `/ready` handler now returns 200 instead of 204. #2330
  1102  * [CHANGE] Better defaults for the following options: #2344
  1103    - `-<prefix>.consul.consistent-reads`: Old default: `true`, new default: `false`. This reduces the load on Consul.
  1104    - `-<prefix>.consul.watch-rate-limit`: Old default: 0, new default: 1. This rate limits the reads to 1 per second. Which is good enough for ring watches.
  1105    - `-distributor.health-check-ingesters`: Old default: `false`, new default: `true`.
  1106    - `-ingester.max-stale-chunk-idle`: Old default: 0, new default: 2m. This lets us expire series that we know are stale early.
  1107    - `-ingester.spread-flushes`: Old default: false, new default: true. This allows to better de-duplicate data and use less space.
  1108    - `-ingester.chunk-age-jitter`: Old default: 20mins, new default: 0. This is to enable the `-ingester.spread-flushes` to true.
  1109    - `-<prefix>.memcached.batchsize`: Old default: 0, new default: 1024. This allows batching of requests and keeps the concurrent requests low.
  1110    - `-<prefix>.memcached.consistent-hash`: Old default: false, new default: true. This allows for better cache hits when the memcaches are scaled up and down.
  1111    - `-querier.batch-iterators`: Old default: false, new default: true.
  1112    - `-querier.ingester-streaming`: Old default: false, new default: true.
  1113  * [CHANGE] Experimental TSDB: Added `-experimental.tsdb.bucket-store.postings-cache-compression-enabled` to enable postings compression when storing to cache. #2335
  1114  * [CHANGE] Experimental TSDB: Added `-compactor.deletion-delay`, which is time before a block marked for deletion is deleted from bucket. If not 0, blocks will be marked for deletion and compactor component will delete blocks marked for deletion from the bucket. If delete-delay is 0, blocks will be deleted straight away. Note that deleting blocks immediately can cause query failures, if store gateway / querier still has the block loaded, or compactor is ignoring the deletion because it's compacting the block at the same time. Default value is 48h. #2335
  1115  * [CHANGE] Experimental TSDB: Added `-experimental.tsdb.bucket-store.index-cache.postings-compression-enabled`, to set duration after which the blocks marked for deletion will be filtered out while fetching blocks used for querying. This option allows querier to ignore blocks that are marked for deletion with some delay. This ensures store can still serve blocks that are meant to be deleted but do not have a replacement yet. Default is 24h, half of the default value for `-compactor.deletion-delay`. #2335
  1116  * [CHANGE] Experimental TSDB: Added `-experimental.tsdb.bucket-store.index-cache.memcached.max-item-size` to control maximum size of item that is stored to memcached. Defaults to 1 MiB. #2335
  1117  * [FEATURE] Added experimental storage API to the ruler service that is enabled when the `-experimental.ruler.enable-api` is set to true #2269
  1118    * `-ruler.storage.type` flag now allows `s3`,`gcs`, and `azure` values
  1119    * `-ruler.storage.(s3|gcs|azure)` flags exist to allow the configuration of object clients set for rule storage
  1120  * [CHANGE] Renamed table manager metrics. #2307 #2359
  1121    * `cortex_dynamo_sync_tables_seconds` -> `cortex_table_manager_sync_duration_seconds`
  1122    * `cortex_dynamo_table_capacity_units` -> `cortex_table_capacity_units`
  1123  * [FEATURE] Flusher target to flush the WAL. #2075
  1124    * `-flusher.wal-dir` for the WAL directory to recover from.
  1125    * `-flusher.concurrent-flushes` for number of concurrent flushes.
  1126    * `-flusher.flush-op-timeout` is duration after which a flush should timeout.
  1127  * [FEATURE] Ingesters can now have an optional availability zone set, to ensure metric replication is distributed across zones. This is set via the `-ingester.availability-zone` flag or the `availability_zone` field in the config file. #2317
  1128  * [ENHANCEMENT] Better re-use of connections to DynamoDB and S3. #2268
  1129  * [ENHANCEMENT] Reduce number of goroutines used while executing a single index query. #2280
  1130  * [ENHANCEMENT] Experimental TSDB: Add support for local `filesystem` backend. #2245
  1131  * [ENHANCEMENT] Experimental TSDB: Added memcached support for the TSDB index cache. #2290
  1132  * [ENHANCEMENT] Experimental TSDB: Removed gRPC server to communicate between querier and BucketStore. #2324
  1133  * [ENHANCEMENT] Allow 1w (where w denotes week) and 1y (where y denotes year) when setting table period and retention. #2252
  1134  * [ENHANCEMENT] Added FIFO cache metrics for current number of entries and memory usage. #2270
  1135  * [ENHANCEMENT] Output all config fields to /config API, including those with empty value. #2209
  1136  * [ENHANCEMENT] Add "missing_metric_name" and "metric_name_invalid" reasons to cortex_discarded_samples_total metric. #2346
  1137  * [ENHANCEMENT] Experimental TSDB: sample ingestion errors are now reported via existing `cortex_discarded_samples_total` metric. #2370
  1138  * [BUGFIX] Ensure user state metrics are updated if a transfer fails. #2338
  1139  * [BUGFIX] Fixed etcd client keepalive settings. #2278
  1140  * [BUGFIX] Register the metrics of the WAL. #2295
  1141  * [BUXFIX] Experimental TSDB: fixed error handling when ingesting out of bound samples. #2342
  1142  
  1143  ### Known issues
  1144  
  1145  - This experimental blocks storage in Cortex `1.0.0` has a bug which may lead to the error `cannot iterate chunk for series` when running queries. This bug has been fixed in #2400. If you're running the experimental blocks storage, please build Cortex from `master`.
  1146  
  1147  ### Config file breaking changes
  1148  
  1149  In this section you can find a config file diff showing the breaking changes introduced in Cortex. You can also find the [full configuration file reference doc](https://cortexmetrics.io/docs/configuration/configuration-file/) in the website.
  1150  
  1151  ```diff
  1152  ### ingester_config
  1153  
  1154   # Period with which to attempt to flush chunks.
  1155   # CLI flag: -ingester.flush-period
  1156  -[flushcheckperiod: <duration> | default = 1m0s]
  1157  +[flush_period: <duration> | default = 1m0s]
  1158  
  1159   # Period chunks will remain in memory after flushing.
  1160   # CLI flag: -ingester.retain-period
  1161  -[retainperiod: <duration> | default = 5m0s]
  1162  +[retain_period: <duration> | default = 5m0s]
  1163  
  1164   # Maximum chunk idle time before flushing.
  1165   # CLI flag: -ingester.max-chunk-idle
  1166  -[maxchunkidle: <duration> | default = 5m0s]
  1167  +[max_chunk_idle_time: <duration> | default = 5m0s]
  1168  
  1169   # Maximum chunk idle time for chunks terminating in stale markers before
  1170   # flushing. 0 disables it and a stale series is not flushed until the
  1171   # max-chunk-idle timeout is reached.
  1172   # CLI flag: -ingester.max-stale-chunk-idle
  1173  -[maxstalechunkidle: <duration> | default = 0s]
  1174  +[max_stale_chunk_idle_time: <duration> | default = 2m0s]
  1175  
  1176   # Timeout for individual flush operations.
  1177   # CLI flag: -ingester.flush-op-timeout
  1178  -[flushoptimeout: <duration> | default = 1m0s]
  1179  +[flush_op_timeout: <duration> | default = 1m0s]
  1180  
  1181   # Maximum chunk age before flushing.
  1182   # CLI flag: -ingester.max-chunk-age
  1183  -[maxchunkage: <duration> | default = 12h0m0s]
  1184  +[max_chunk_age: <duration> | default = 12h0m0s]
  1185  
  1186  -# Range of time to subtract from MaxChunkAge to spread out flushes
  1187  +# Range of time to subtract from -ingester.max-chunk-age to spread out flushes
  1188   # CLI flag: -ingester.chunk-age-jitter
  1189  -[chunkagejitter: <duration> | default = 20m0s]
  1190  +[chunk_age_jitter: <duration> | default = 0]
  1191  
  1192   # Number of concurrent goroutines flushing to dynamodb.
  1193   # CLI flag: -ingester.concurrent-flushes
  1194  -[concurrentflushes: <int> | default = 50]
  1195  +[concurrent_flushes: <int> | default = 50]
  1196  
  1197  -# If true, spread series flushes across the whole period of MaxChunkAge
  1198  +# If true, spread series flushes across the whole period of
  1199  +# -ingester.max-chunk-age.
  1200   # CLI flag: -ingester.spread-flushes
  1201  -[spreadflushes: <boolean> | default = false]
  1202  +[spread_flushes: <boolean> | default = true]
  1203  
  1204   # Period with which to update the per-user ingestion rates.
  1205   # CLI flag: -ingester.rate-update-period
  1206  -[rateupdateperiod: <duration> | default = 15s]
  1207  +[rate_update_period: <duration> | default = 15s]
  1208  
  1209  
  1210  ### querier_config
  1211  
  1212   # The maximum number of concurrent queries.
  1213   # CLI flag: -querier.max-concurrent
  1214  -[maxconcurrent: <int> | default = 20]
  1215  +[max_concurrent: <int> | default = 20]
  1216  
  1217   # Use batch iterators to execute query, as opposed to fully materialising the
  1218   # series in memory.  Takes precedent over the -querier.iterators flag.
  1219   # CLI flag: -querier.batch-iterators
  1220  -[batchiterators: <boolean> | default = false]
  1221  +[batch_iterators: <boolean> | default = true]
  1222  
  1223   # Use streaming RPCs to query ingester.
  1224   # CLI flag: -querier.ingester-streaming
  1225  -[ingesterstreaming: <boolean> | default = false]
  1226  +[ingester_streaming: <boolean> | default = true]
  1227  
  1228   # Maximum number of samples a single query can load into memory.
  1229   # CLI flag: -querier.max-samples
  1230  -[maxsamples: <int> | default = 50000000]
  1231  +[max_samples: <int> | default = 50000000]
  1232  
  1233   # The default evaluation interval or step size for subqueries.
  1234   # CLI flag: -querier.default-evaluation-interval
  1235  -[defaultevaluationinterval: <duration> | default = 1m0s]
  1236  +[default_evaluation_interval: <duration> | default = 1m0s]
  1237  
  1238  ### query_frontend_config
  1239  
  1240   # URL of downstream Prometheus.
  1241   # CLI flag: -frontend.downstream-url
  1242  -[downstream: <string> | default = ""]
  1243  +[downstream_url: <string> | default = ""]
  1244  
  1245  
  1246  ### ruler_config
  1247  
  1248   # URL of alerts return path.
  1249   # CLI flag: -ruler.external.url
  1250  -[externalurl: <url> | default = ]
  1251  +[external_url: <url> | default = ]
  1252  
  1253   # How frequently to evaluate rules
  1254   # CLI flag: -ruler.evaluation-interval
  1255  -[evaluationinterval: <duration> | default = 1m0s]
  1256  +[evaluation_interval: <duration> | default = 1m0s]
  1257  
  1258   # How frequently to poll for rule changes
  1259   # CLI flag: -ruler.poll-interval
  1260  -[pollinterval: <duration> | default = 1m0s]
  1261  +[poll_interval: <duration> | default = 1m0s]
  1262  
  1263  -storeconfig:
  1264  +storage:
  1265  
  1266   # file path to store temporary rule files for the prometheus rule managers
  1267   # CLI flag: -ruler.rule-path
  1268  -[rulepath: <string> | default = "/rules"]
  1269  +[rule_path: <string> | default = "/rules"]
  1270  
  1271   # URL of the Alertmanager to send notifications to.
  1272   # CLI flag: -ruler.alertmanager-url
  1273  -[alertmanagerurl: <url> | default = ]
  1274  +[alertmanager_url: <url> | default = ]
  1275  
  1276   # Use DNS SRV records to discover alertmanager hosts.
  1277   # CLI flag: -ruler.alertmanager-discovery
  1278  -[alertmanagerdiscovery: <boolean> | default = false]
  1279  +[enable_alertmanager_discovery: <boolean> | default = false]
  1280  
  1281   # How long to wait between refreshing alertmanager hosts.
  1282   # CLI flag: -ruler.alertmanager-refresh-interval
  1283  -[alertmanagerrefreshinterval: <duration> | default = 1m0s]
  1284  +[alertmanager_refresh_interval: <duration> | default = 1m0s]
  1285  
  1286   # If enabled requests to alertmanager will utilize the V2 API.
  1287   # CLI flag: -ruler.alertmanager-use-v2
  1288  -[alertmanangerenablev2api: <boolean> | default = false]
  1289  +[enable_alertmanager_v2: <boolean> | default = false]
  1290  
  1291   # Capacity of the queue for notifications to be sent to the Alertmanager.
  1292   # CLI flag: -ruler.notification-queue-capacity
  1293  -[notificationqueuecapacity: <int> | default = 10000]
  1294  +[notification_queue_capacity: <int> | default = 10000]
  1295  
  1296   # HTTP timeout duration when sending notifications to the Alertmanager.
  1297   # CLI flag: -ruler.notification-timeout
  1298  -[notificationtimeout: <duration> | default = 10s]
  1299  +[notification_timeout: <duration> | default = 10s]
  1300  
  1301   # Distribute rule evaluation using ring backend
  1302   # CLI flag: -ruler.enable-sharding
  1303  -[enablesharding: <boolean> | default = false]
  1304  +[enable_sharding: <boolean> | default = false]
  1305  
  1306   # Time to spend searching for a pending ruler when shutting down.
  1307   # CLI flag: -ruler.search-pending-for
  1308  -[searchpendingfor: <duration> | default = 5m0s]
  1309  +[search_pending_for: <duration> | default = 5m0s]
  1310  
  1311   # Period with which to attempt to flush rule groups.
  1312   # CLI flag: -ruler.flush-period
  1313  -[flushcheckperiod: <duration> | default = 1m0s]
  1314  +[flush_period: <duration> | default = 1m0s]
  1315  
  1316  ### alertmanager_config
  1317  
  1318   # Base path for data storage.
  1319   # CLI flag: -alertmanager.storage.path
  1320  -[datadir: <string> | default = "data/"]
  1321  +[data_dir: <string> | default = "data/"]
  1322  
  1323   # will be used to prefix all HTTP endpoints served by Alertmanager. If omitted,
  1324   # relevant URL components will be derived automatically.
  1325   # CLI flag: -alertmanager.web.external-url
  1326  -[externalurl: <url> | default = ]
  1327  +[external_url: <url> | default = ]
  1328  
  1329   # How frequently to poll Cortex configs
  1330   # CLI flag: -alertmanager.configs.poll-interval
  1331  -[pollinterval: <duration> | default = 15s]
  1332  +[poll_interval: <duration> | default = 15s]
  1333  
  1334   # Listen address for cluster.
  1335   # CLI flag: -cluster.listen-address
  1336  -[clusterbindaddr: <string> | default = "0.0.0.0:9094"]
  1337  +[cluster_bind_address: <string> | default = "0.0.0.0:9094"]
  1338  
  1339   # Explicit address to advertise in cluster.
  1340   # CLI flag: -cluster.advertise-address
  1341  -[clusteradvertiseaddr: <string> | default = ""]
  1342  +[cluster_advertise_address: <string> | default = ""]
  1343  
  1344   # Time to wait between peers to send notifications.
  1345   # CLI flag: -cluster.peer-timeout
  1346  -[peertimeout: <duration> | default = 15s]
  1347  +[peer_timeout: <duration> | default = 15s]
  1348  
  1349   # Filename of fallback config to use if none specified for instance.
  1350   # CLI flag: -alertmanager.configs.fallback
  1351  -[fallbackconfigfile: <string> | default = ""]
  1352  +[fallback_config_file: <string> | default = ""]
  1353  
  1354   # Root of URL to generate if config is http://internal.monitor
  1355   # CLI flag: -alertmanager.configs.auto-webhook-root
  1356  -[autowebhookroot: <string> | default = ""]
  1357  +[auto_webhook_root: <string> | default = ""]
  1358  
  1359  ### table_manager_config
  1360  
  1361  -store:
  1362  +storage:
  1363  
  1364  -# How frequently to poll DynamoDB to learn our capacity.
  1365  -# CLI flag: -dynamodb.poll-interval
  1366  -[dynamodb_poll_interval: <duration> | default = 2m0s]
  1367  +# How frequently to poll backend to learn our capacity.
  1368  +# CLI flag: -table-manager.poll-interval
  1369  +[poll_interval: <duration> | default = 2m0s]
  1370  
  1371  -# DynamoDB periodic tables grace period (duration which table will be
  1372  -# created/deleted before/after it's needed).
  1373  -# CLI flag: -dynamodb.periodic-table.grace-period
  1374  +# Periodic tables grace period (duration which table will be created/deleted
  1375  +# before/after it's needed).
  1376  +# CLI flag: -table-manager.periodic-table.grace-period
  1377   [creation_grace_period: <duration> | default = 10m0s]
  1378  
  1379   index_tables_provisioning:
  1380     # Enables on demand throughput provisioning for the storage provider (if
  1381  -  # supported). Applies only to tables which are not autoscaled
  1382  -  # CLI flag: -dynamodb.periodic-table.enable-ondemand-throughput-mode
  1383  -  [provisioned_throughput_on_demand_mode: <boolean> | default = false]
  1384  +  # supported). Applies only to tables which are not autoscaled. Supported by
  1385  +  # DynamoDB
  1386  +  # CLI flag: -table-manager.index-table.enable-ondemand-throughput-mode
  1387  +  [enable_ondemand_throughput_mode: <boolean> | default = false]
  1388  
  1389  
  1390     # Enables on demand throughput provisioning for the storage provider (if
  1391  -  # supported). Applies only to tables which are not autoscaled
  1392  -  # CLI flag: -dynamodb.periodic-table.inactive-enable-ondemand-throughput-mode
  1393  -  [inactive_throughput_on_demand_mode: <boolean> | default = false]
  1394  +  # supported). Applies only to tables which are not autoscaled. Supported by
  1395  +  # DynamoDB
  1396  +  # CLI flag: -table-manager.index-table.inactive-enable-ondemand-throughput-mode
  1397  +  [enable_inactive_throughput_on_demand_mode: <boolean> | default = false]
  1398  
  1399  
  1400   chunk_tables_provisioning:
  1401     # Enables on demand throughput provisioning for the storage provider (if
  1402  -  # supported). Applies only to tables which are not autoscaled
  1403  -  # CLI flag: -dynamodb.chunk-table.enable-ondemand-throughput-mode
  1404  -  [provisioned_throughput_on_demand_mode: <boolean> | default = false]
  1405  +  # supported). Applies only to tables which are not autoscaled. Supported by
  1406  +  # DynamoDB
  1407  +  # CLI flag: -table-manager.chunk-table.enable-ondemand-throughput-mode
  1408  +  [enable_ondemand_throughput_mode: <boolean> | default = false]
  1409  
  1410  ### storage_config
  1411  
  1412   aws:
  1413  -  dynamodbconfig:
  1414  +  dynamodb:
  1415       # DynamoDB endpoint URL with escaped Key and Secret encoded. If only region
  1416       # is specified as a host, proper endpoint will be deduced. Use
  1417       # inmemory:///<table-name> to use a mock in-memory implementation.
  1418       # CLI flag: -dynamodb.url
  1419  -    [dynamodb: <url> | default = ]
  1420  +    [dynamodb_url: <url> | default = ]
  1421  
  1422       # DynamoDB table management requests per second limit.
  1423       # CLI flag: -dynamodb.api-limit
  1424  -    [apilimit: <float> | default = 2]
  1425  +    [api_limit: <float> | default = 2]
  1426  
  1427       # DynamoDB rate cap to back off when throttled.
  1428       # CLI flag: -dynamodb.throttle-limit
  1429  -    [throttlelimit: <float> | default = 10]
  1430  +    [throttle_limit: <float> | default = 10]
  1431  -
  1432  -    # ApplicationAutoscaling endpoint URL with escaped Key and Secret encoded.
  1433  -    # CLI flag: -applicationautoscaling.url
  1434  -    [applicationautoscaling: <url> | default = ]
  1435  
  1436  
  1437         # Queue length above which we will scale up capacity
  1438         # CLI flag: -metrics.target-queue-length
  1439  -      [targetqueuelen: <int> | default = 100000]
  1440  +      [target_queue_length: <int> | default = 100000]
  1441  
  1442         # Scale up capacity by this multiple
  1443         # CLI flag: -metrics.scale-up-factor
  1444  -      [scaleupfactor: <float> | default = 1.3]
  1445  +      [scale_up_factor: <float> | default = 1.3]
  1446  
  1447         # Ignore throttling below this level (rate per second)
  1448         # CLI flag: -metrics.ignore-throttle-below
  1449  -      [minthrottling: <float> | default = 1]
  1450  +      [ignore_throttle_below: <float> | default = 1]
  1451  
  1452         # query to fetch ingester queue length
  1453         # CLI flag: -metrics.queue-length-query
  1454  -      [queuelengthquery: <string> | default = "sum(avg_over_time(cortex_ingester_flush_queue_length{job=\"cortex/ingester\"}[2m]))"]
  1455  +      [queue_length_query: <string> | default = "sum(avg_over_time(cortex_ingester_flush_queue_length{job=\"cortex/ingester\"}[2m]))"]
  1456  
  1457         # query to fetch throttle rates per table
  1458         # CLI flag: -metrics.write-throttle-query
  1459  -      [throttlequery: <string> | default = "sum(rate(cortex_dynamo_throttled_total{operation=\"DynamoDB.BatchWriteItem\"}[1m])) by (table) > 0"]
  1460  +      [write_throttle_query: <string> | default = "sum(rate(cortex_dynamo_throttled_total{operation=\"DynamoDB.BatchWriteItem\"}[1m])) by (table) > 0"]
  1461  
  1462         # query to fetch write capacity usage per table
  1463         # CLI flag: -metrics.usage-query
  1464  -      [usagequery: <string> | default = "sum(rate(cortex_dynamo_consumed_capacity_total{operation=\"DynamoDB.BatchWriteItem\"}[15m])) by (table) > 0"]
  1465  +      [write_usage_query: <string> | default = "sum(rate(cortex_dynamo_consumed_capacity_total{operation=\"DynamoDB.BatchWriteItem\"}[15m])) by (table) > 0"]
  1466  
  1467         # query to fetch read capacity usage per table
  1468         # CLI flag: -metrics.read-usage-query
  1469  -      [readusagequery: <string> | default = "sum(rate(cortex_dynamo_consumed_capacity_total{operation=\"DynamoDB.QueryPages\"}[1h])) by (table) > 0"]
  1470  +      [read_usage_query: <string> | default = "sum(rate(cortex_dynamo_consumed_capacity_total{operation=\"DynamoDB.QueryPages\"}[1h])) by (table) > 0"]
  1471  
  1472         # query to fetch read errors per table
  1473         # CLI flag: -metrics.read-error-query
  1474  -      [readerrorquery: <string> | default = "sum(increase(cortex_dynamo_failures_total{operation=\"DynamoDB.QueryPages\",error=\"ProvisionedThroughputExceededException\"}[1m])) by (table) > 0"]
  1475  +      [read_error_query: <string> | default = "sum(increase(cortex_dynamo_failures_total{operation=\"DynamoDB.QueryPages\",error=\"ProvisionedThroughputExceededException\"}[1m])) by (table) > 0"]
  1476  
  1477       # Number of chunks to group together to parallelise fetches (zero to
  1478       # disable)
  1479  -    # CLI flag: -dynamodb.chunk.gang.size
  1480  -    [chunkgangsize: <int> | default = 10]
  1481  +    # CLI flag: -dynamodb.chunk-gang-size
  1482  +    [chunk_gang_size: <int> | default = 10]
  1483  
  1484       # Max number of chunk-get operations to start in parallel
  1485  -    # CLI flag: -dynamodb.chunk.get.max.parallelism
  1486  -    [chunkgetmaxparallelism: <int> | default = 32]
  1487  +    # CLI flag: -dynamodb.chunk.get-max-parallelism
  1488  +    [chunk_get_max_parallelism: <int> | default = 32]
  1489  
  1490       backoff_config:
  1491         # Minimum delay when backing off.
  1492         # CLI flag: -bigtable.backoff-min-period
  1493  -      [minbackoff: <duration> | default = 100ms]
  1494  +      [min_period: <duration> | default = 100ms]
  1495  
  1496         # Maximum delay when backing off.
  1497         # CLI flag: -bigtable.backoff-max-period
  1498  -      [maxbackoff: <duration> | default = 10s]
  1499  +      [max_period: <duration> | default = 10s]
  1500  
  1501         # Number of times to backoff and retry before failing.
  1502         # CLI flag: -bigtable.backoff-retries
  1503  -      [maxretries: <int> | default = 10]
  1504  +      [max_retries: <int> | default = 10]
  1505  
  1506     # If enabled, once a tables info is fetched, it is cached.
  1507     # CLI flag: -bigtable.table-cache.enabled
  1508  -  [tablecacheenabled: <boolean> | default = true]
  1509  +  [table_cache_enabled: <boolean> | default = true]
  1510  
  1511     # Duration to cache tables before checking again.
  1512     # CLI flag: -bigtable.table-cache.expiration
  1513  -  [tablecacheexpiration: <duration> | default = 30m0s]
  1514  +  [table_cache_expiration: <duration> | default = 30m0s]
  1515  
  1516   # Cache validity for active index entries. Should be no higher than
  1517   # -ingester.max-chunk-idle.
  1518   # CLI flag: -store.index-cache-validity
  1519  -[indexcachevalidity: <duration> | default = 5m0s]
  1520  +[index_cache_validity: <duration> | default = 5m0s]
  1521  
  1522  ### ingester_client_config
  1523  
  1524   grpc_client_config:
  1525     backoff_config:
  1526       # Minimum delay when backing off.
  1527       # CLI flag: -ingester.client.backoff-min-period
  1528  -    [minbackoff: <duration> | default = 100ms]
  1529  +    [min_period: <duration> | default = 100ms]
  1530  
  1531       # Maximum delay when backing off.
  1532       # CLI flag: -ingester.client.backoff-max-period
  1533  -    [maxbackoff: <duration> | default = 10s]
  1534  +    [max_period: <duration> | default = 10s]
  1535  
  1536       # Number of times to backoff and retry before failing.
  1537       # CLI flag: -ingester.client.backoff-retries
  1538  -    [maxretries: <int> | default = 10]
  1539  +    [max_retries: <int> | default = 10]
  1540  
  1541  ### frontend_worker_config
  1542  
  1543  -# Address of query frontend service.
  1544  +# Address of query frontend service, in host:port format.
  1545   # CLI flag: -querier.frontend-address
  1546  -[address: <string> | default = ""]
  1547  +[frontend_address: <string> | default = ""]
  1548  
  1549   # How often to query DNS.
  1550   # CLI flag: -querier.dns-lookup-period
  1551  -[dnslookupduration: <duration> | default = 10s]
  1552  +[dns_lookup_duration: <duration> | default = 10s]
  1553  
  1554   grpc_client_config:
  1555     backoff_config:
  1556       # Minimum delay when backing off.
  1557       # CLI flag: -querier.frontend-client.backoff-min-period
  1558  -    [minbackoff: <duration> | default = 100ms]
  1559  +    [min_period: <duration> | default = 100ms]
  1560  
  1561       # Maximum delay when backing off.
  1562       # CLI flag: -querier.frontend-client.backoff-max-period
  1563  -    [maxbackoff: <duration> | default = 10s]
  1564  +    [max_period: <duration> | default = 10s]
  1565  
  1566       # Number of times to backoff and retry before failing.
  1567       # CLI flag: -querier.frontend-client.backoff-retries
  1568  -    [maxretries: <int> | default = 10]
  1569  +    [max_retries: <int> | default = 10]
  1570  
  1571  ### consul_config
  1572  
  1573   # ACL Token used to interact with Consul.
  1574  -# CLI flag: -<prefix>.consul.acltoken
  1575  -[acltoken: <string> | default = ""]
  1576  +# CLI flag: -<prefix>.consul.acl-token
  1577  +[acl_token: <string> | default = ""]
  1578  
  1579   # HTTP timeout when talking to Consul
  1580   # CLI flag: -<prefix>.consul.client-timeout
  1581  -[httpclienttimeout: <duration> | default = 20s]
  1582  +[http_client_timeout: <duration> | default = 20s]
  1583  
  1584   # Enable consistent reads to Consul.
  1585   # CLI flag: -<prefix>.consul.consistent-reads
  1586  -[consistentreads: <boolean> | default = true]
  1587  +[consistent_reads: <boolean> | default = false]
  1588  
  1589   # Rate limit when watching key or prefix in Consul, in requests per second. 0
  1590   # disables the rate limit.
  1591   # CLI flag: -<prefix>.consul.watch-rate-limit
  1592  -[watchkeyratelimit: <float> | default = 0]
  1593  +[watch_rate_limit: <float> | default = 1]
  1594  
  1595   # Burst size used in rate limit. Values less than 1 are treated as 1.
  1596   # CLI flag: -<prefix>.consul.watch-burst-size
  1597  -[watchkeyburstsize: <int> | default = 1]
  1598  +[watch_burst_size: <int> | default = 1]
  1599  
  1600  
  1601  ### configstore_config
  1602   # URL of configs API server.
  1603   # CLI flag: -<prefix>.configs.url
  1604  -[configsapiurl: <url> | default = ]
  1605  +[configs_api_url: <url> | default = ]
  1606  
  1607   # Timeout for requests to Weave Cloud configs service.
  1608   # CLI flag: -<prefix>.configs.client-timeout
  1609  -[clienttimeout: <duration> | default = 5s]
  1610  +[client_timeout: <duration> | default = 5s]
  1611  ```
  1612  
  1613  ## 0.7.0 / 2020-03-16
  1614  
  1615  Cortex `0.7.0` is a major step forward the upcoming `1.0` release. In this release, we've got 164 contributions from 26 authors. Thanks to all contributors! ❤️
  1616  
  1617  Please be aware that Cortex `0.7.0` introduces some **breaking changes**. You're encouraged to read all the `[CHANGE]` entries below before upgrading your Cortex cluster. In particular:
  1618  
  1619  - Cleaned up some configuration options in preparation for the Cortex `1.0.0` release (see also the [annotated config file breaking changes](#annotated-config-file-breaking-changes) below):
  1620    - Removed CLI flags support to configure the schema (see [how to migrate from flags to schema file](https://cortexmetrics.io/docs/configuration/schema-configuration/#migrating-from-flags-to-schema-file))
  1621    - Renamed CLI flag `-config-yaml` to `-schema-config-file`
  1622    - Removed CLI flag `-store.min-chunk-age` in favor of `-querier.query-store-after`. The corresponding YAML config option `ingestermaxquerylookback` has been renamed to [`query_ingesters_within`](https://cortexmetrics.io/docs/configuration/configuration-file/#querier-config)
  1623    - Deprecated CLI flag `-frontend.cache-split-interval` in favor of `-querier.split-queries-by-interval`
  1624    - Renamed the YAML config option `defaul_validity` to `default_validity`
  1625    - Removed the YAML config option `config_store` (in the [`alertmanager YAML config`](https://cortexmetrics.io/docs/configuration/configuration-file/#alertmanager-config)) in favor of `store`
  1626    - Removed the YAML config root block `configdb` in favor of [`configs`](https://cortexmetrics.io/docs/configuration/configuration-file/#configs-config). This change is also reflected in the following CLI flags renaming:
  1627        * `-database.*` -> `-configs.database.*`
  1628        * `-database.migrations` -> `-configs.database.migrations-dir`
  1629    - Removed the fluentd-based billing infrastructure including the CLI flags:
  1630        * `-distributor.enable-billing`
  1631        * `-billing.max-buffered-events`
  1632        * `-billing.retry-delay`
  1633        * `-billing.ingester`
  1634  - Removed support for using denormalised tokens in the ring. Before upgrading, make sure your Cortex cluster is already running `v0.6.0` or an earlier version with `-ingester.normalise-tokens=true`
  1635  
  1636  ### Full changelog
  1637  
  1638  * [CHANGE] Removed support for flags to configure schema. Further, the flag for specifying the config file (`-config-yaml`) has been deprecated. Please use `-schema-config-file`. See the [Schema Configuration documentation](https://cortexmetrics.io/docs/configuration/schema-configuration/) for more details on how to configure the schema using the YAML file. #2221
  1639  * [CHANGE] In the config file, the root level `config_store` config option has been moved to `alertmanager` > `store` > `configdb`. #2125
  1640  * [CHANGE] Removed unnecessary `frontend.cache-split-interval` in favor of `querier.split-queries-by-interval` both to reduce configuration complexity and guarantee alignment of these two configs. Starting from now, `-querier.cache-results` may only be enabled in conjunction with `-querier.split-queries-by-interval` (previously the cache interval default was `24h` so if you want to preserve the same behaviour you should set `-querier.split-queries-by-interval=24h`). #2040
  1641  * [CHANGE] Renamed Configs configuration options. #2187
  1642    * configuration options
  1643      * `-database.*` -> `-configs.database.*`
  1644      * `-database.migrations` -> `-configs.database.migrations-dir`
  1645    * config file
  1646      * `configdb.uri:` -> `configs.database.uri:`
  1647      * `configdb.migrationsdir:` -> `configs.database.migrations_dir:`
  1648      * `configdb.passwordfile:` -> `configs.database.password_file:`
  1649  * [CHANGE] Moved `-store.min-chunk-age` to the Querier config as `-querier.query-store-after`, allowing the store to be skipped during query time if the metrics wouldn't be found. The YAML config option `ingestermaxquerylookback` has been renamed to `query_ingesters_within` to match its CLI flag. #1893
  1650  * [CHANGE] Renamed the cache configuration setting `defaul_validity` to `default_validity`. #2140
  1651  * [CHANGE] Remove fluentd-based billing infrastructure and flags such as `-distributor.enable-billing`. #1491
  1652  * [CHANGE] Removed remaining support for using denormalised tokens in the ring. If you're still running ingesters with denormalised tokens (Cortex 0.4 or earlier, with `-ingester.normalise-tokens=false`), such ingesters will now be completely invisible to distributors and need to be either switched to Cortex 0.6.0 or later, or be configured to use normalised tokens. #2034
  1653  * [CHANGE] The frontend http server will now send 502 in case of deadline exceeded and 499 if the user requested cancellation. #2156
  1654  * [CHANGE] We now enforce queries to be up to `-querier.max-query-into-future` into the future (defaults to 10m). #1929
  1655    * `-store.min-chunk-age` has been removed
  1656    * `-querier.query-store-after` has been added in it's place.
  1657  * [CHANGE] Removed unused `/validate_expr endpoint`. #2152
  1658  * [CHANGE] Updated Prometheus dependency to v2.16.0. This Prometheus version uses Active Query Tracker to limit concurrent queries. In order to keep `-querier.max-concurrent` working, Active Query Tracker is enabled by default, and is configured to store its data to `active-query-tracker` directory (relative to current directory when Cortex started). This can be changed by using `-querier.active-query-tracker-dir` option. Purpose of Active Query Tracker is to log queries that were running when Cortex crashes. This logging happens on next Cortex start. #2088
  1659  * [CHANGE] Default to BigChunk encoding; may result in slightly higher disk usage if many timeseries have a constant value, but should generally result in fewer, bigger chunks. #2207
  1660  * [CHANGE] WAL replays are now done while the rest of Cortex is starting, and more specifically, when HTTP server is running. This makes it possible to scrape metrics during WAL replays. Applies to both chunks and experimental blocks storage. #2222
  1661  * [CHANGE] Cortex now has `/ready` probe for all services, not just ingester and querier as before. In single-binary mode, /ready reports 204 only if all components are running properly. #2166
  1662  * [CHANGE] If you are vendoring Cortex and use its components in your project, be aware that many Cortex components no longer start automatically when they are created. You may want to review PR and attached document. #2166
  1663  * [CHANGE] Experimental TSDB: the querier in-memory index cache used by the experimental blocks storage shifted from per-tenant to per-querier. The `-experimental.tsdb.bucket-store.index-cache-size-bytes` now configures the per-querier index cache max size instead of a per-tenant cache and its default has been increased to 1GB. #2189
  1664  * [CHANGE] Experimental TSDB: TSDB head compaction interval and concurrency is now configurable (defaults to 1 min interval and 5 concurrent head compactions). New options: `-experimental.tsdb.head-compaction-interval` and `-experimental.tsdb.head-compaction-concurrency`. #2172
  1665  * [CHANGE] Experimental TSDB: switched the blocks storage index header to the binary format. This change is expected to have no visible impact, except lower startup times and memory usage in the queriers. It's possible to switch back to the old JSON format via the flag `-experimental.tsdb.bucket-store.binary-index-header-enabled=false`. #2223
  1666  * [CHANGE] Experimental Memberlist KV store can now be used in single-binary Cortex. Attempts to use it previously would fail with panic. This change also breaks existing binary protocol used to exchange gossip messages, so this version will not be able to understand gossiped Ring when used in combination with the previous version of Cortex. Easiest way to upgrade is to shutdown old Cortex installation, and restart it with new version. Incremental rollout works too, but with reduced functionality until all components run the same version. #2016
  1667  * [FEATURE] Added a read-only local alertmanager config store using files named corresponding to their tenant id. #2125
  1668  * [FEATURE] Added flag `-experimental.ruler.enable-api` to enable the ruler api which implements the Prometheus API `/api/v1/rules` and `/api/v1/alerts` endpoints under the configured `-http.prefix`. #1999
  1669  * [FEATURE] Added sharding support to compactor when using the experimental TSDB blocks storage. #2113
  1670  * [FEATURE] Added ability to override YAML config file settings using environment variables. #2147
  1671    * `-config.expand-env`
  1672  * [FEATURE] Added flags to disable Alertmanager notifications methods. #2187
  1673    * `-configs.notifications.disable-email`
  1674    * `-configs.notifications.disable-webhook`
  1675  * [FEATURE] Add /config HTTP endpoint which exposes the current Cortex configuration as YAML. #2165
  1676  * [FEATURE] Allow Prometheus remote write directly to ingesters. #1491
  1677  * [FEATURE] Introduced new standalone service `query-tee` that can be used for testing purposes to send the same Prometheus query to multiple backends (ie. two Cortex clusters ingesting the same metrics) and compare the performances. #2203
  1678  * [FEATURE] Fan out parallelizable queries to backend queriers concurrently. #1878
  1679    * `querier.parallelise-shardable-queries` (bool)
  1680    * Requires a shard-compatible schema (v10+)
  1681    * This causes the number of traces to increase accordingly.
  1682    * The query-frontend now requires a schema config to determine how/when to shard queries, either from a file or from flags (i.e. by the `config-yaml` CLI flag). This is the same schema config the queriers consume. The schema is only required to use this option.
  1683    * It's also advised to increase downstream concurrency controls as well:
  1684      * `querier.max-outstanding-requests-per-tenant`
  1685      * `querier.max-query-parallelism`
  1686      * `querier.max-concurrent`
  1687      * `server.grpc-max-concurrent-streams` (for both query-frontends and queriers)
  1688  * [FEATURE] Added user sub rings to distribute users to a subset of ingesters. #1947
  1689    * `-experimental.distributor.user-subring-size`
  1690  * [FEATURE] Add flag `-experimental.tsdb.stripe-size` to expose TSDB stripe size option. #2185
  1691  * [FEATURE] Experimental Delete Series: Added support for Deleting Series with Prometheus style API. Needs to be enabled first by setting `-purger.enable` to `true`. Deletion only supported when using `boltdb` and `filesystem` as index and object store respectively. Support for other stores to follow in separate PRs #2103
  1692  * [ENHANCEMENT] Alertmanager: Expose Per-tenant alertmanager metrics #2124
  1693  * [ENHANCEMENT] Add `status` label to `cortex_alertmanager_configs` metric to gauge the number of valid and invalid configs. #2125
  1694  * [ENHANCEMENT] Cassandra Authentication: added the `custom_authenticators` config option that allows users to authenticate with cassandra clusters using password authenticators that are not approved by default in [gocql](https://github.com/gocql/gocql/blob/81b8263d9fe526782a588ef94d3fa5c6148e5d67/conn.go#L27) #2093
  1695  * [ENHANCEMENT] Cassandra Storage: added `max_retries`, `retry_min_backoff` and `retry_max_backoff` configuration options to enable retrying recoverable errors. #2054
  1696  * [ENHANCEMENT] Allow to configure HTTP and gRPC server listen address, maximum number of simultaneous connections and connection keepalive settings.
  1697    * `-server.http-listen-address`
  1698    * `-server.http-conn-limit`
  1699    * `-server.grpc-listen-address`
  1700    * `-server.grpc-conn-limit`
  1701    * `-server.grpc.keepalive.max-connection-idle`
  1702    * `-server.grpc.keepalive.max-connection-age`
  1703    * `-server.grpc.keepalive.max-connection-age-grace`
  1704    * `-server.grpc.keepalive.time`
  1705    * `-server.grpc.keepalive.timeout`
  1706  * [ENHANCEMENT] PostgreSQL: Bump up `github.com/lib/pq` from `v1.0.0` to `v1.3.0` to support PostgreSQL SCRAM-SHA-256 authentication. #2097
  1707  * [ENHANCEMENT] Cassandra Storage: User no longer need `CREATE` privilege on `<all keyspaces>` if given keyspace exists. #2032
  1708  * [ENHANCEMENT] Cassandra Storage: added `password_file` configuration options to enable reading Cassandra password from file. #2096
  1709  * [ENHANCEMENT] Configs API: Allow GET/POST configs in YAML format. #2181
  1710  * [ENHANCEMENT] Background cache writes are batched to improve parallelism and observability. #2135
  1711  * [ENHANCEMENT] Add automatic repair for checkpoint and WAL. #2105
  1712  * [ENHANCEMENT] Support `lastEvaluation` and `evaluationTime` in `/api/v1/rules` endpoints and make order of groups stable. #2196
  1713  * [ENHANCEMENT] Skip expired requests in query-frontend scheduling. #2082
  1714  * [ENHANCEMENT] Add ability to configure gRPC keepalive settings. #2066
  1715  * [ENHANCEMENT] Experimental TSDB: Export TSDB Syncer metrics from Compactor component, they are prefixed with `cortex_compactor_`. #2023
  1716  * [ENHANCEMENT] Experimental TSDB: Added dedicated flag `-experimental.tsdb.bucket-store.tenant-sync-concurrency` to configure the maximum number of concurrent tenants for which blocks are synched. #2026
  1717  * [ENHANCEMENT] Experimental TSDB: Expose metrics for objstore operations (prefixed with `cortex_<component>_thanos_objstore_`, component being one of `ingester`, `querier` and `compactor`). #2027
  1718  * [ENHANCEMENT] Experimental TSDB: Added support for Azure Storage to be used for block storage, in addition to S3 and GCS. #2083
  1719  * [ENHANCEMENT] Experimental TSDB: Reduced memory allocations in the ingesters when using the experimental blocks storage. #2057
  1720  * [ENHANCEMENT] Experimental Memberlist KV: expose `-memberlist.gossip-to-dead-nodes-time` and `-memberlist.dead-node-reclaim-time` options to control how memberlist library handles dead nodes and name reuse. #2131
  1721  * [BUGFIX] Alertmanager: fixed panic upon applying a new config, caused by duplicate metrics registration in the `NewPipelineBuilder` function. #211
  1722  * [BUGFIX] Azure Blob ChunkStore: Fixed issue causing `invalid chunk checksum` errors. #2074
  1723  * [BUGFIX] The gauge `cortex_overrides_last_reload_successful` is now only exported by components that use a `RuntimeConfigManager`. Previously, for components that do not initialize a `RuntimeConfigManager` (such as the compactor) the gauge was initialized with 0 (indicating error state) and then never updated, resulting in a false-negative permanent error state. #2092
  1724  * [BUGFIX] Fixed WAL metric names, added the `cortex_` prefix.
  1725  * [BUGFIX] Restored histogram `cortex_configs_request_duration_seconds` #2138
  1726  * [BUGFIX] Fix wrong syntax for `url` in config-file-reference. #2148
  1727  * [BUGFIX] Fixed some 5xx status code returned by the query-frontend when they should actually be 4xx. #2122
  1728  * [BUGFIX] Fixed leaked goroutines in the querier. #2070
  1729  * [BUGFIX] Experimental TSDB: fixed `/all_user_stats` and `/api/prom/user_stats` endpoints when using the experimental TSDB blocks storage. #2042
  1730  * [BUGFIX] Experimental TSDB: fixed ruler to correctly work with the experimental TSDB blocks storage. #2101
  1731  
  1732  ### Changes to denormalised tokens in the ring
  1733  
  1734  Cortex 0.4.0 is the last version that can *write* denormalised tokens. Cortex 0.5.0 and above always write normalised tokens.
  1735  
  1736  Cortex 0.6.0 is the last version that can *read* denormalised tokens. Starting with Cortex 0.7.0 only normalised tokens are supported, and ingesters writing denormalised tokens to the ring (running Cortex 0.4.0 or earlier with `-ingester.normalise-tokens=false`) are ignored by distributors. Such ingesters should either switch to using normalised tokens, or be upgraded to Cortex 0.5.0 or later.
  1737  
  1738  ### Known issues
  1739  
  1740  - The gRPC streaming for ingesters doesn't work when using the experimental TSDB blocks storage. Please do not enable `-querier.ingester-streaming` if you're using the TSDB blocks storage. If you want to enable it, you can build Cortex from `master` given the issue has been fixed after Cortex `0.7` branch has been cut and the fix wasn't included in the `0.7` because related to an experimental feature.
  1741  
  1742  ### Annotated config file breaking changes
  1743  
  1744  In this section you can find a config file diff showing the breaking changes introduced in Cortex `0.7`. You can also find the [full configuration file reference doc](https://cortexmetrics.io/docs/configuration/configuration-file/) in the website.
  1745  
  1746   ```diff
  1747  ### Root level config
  1748  
  1749   # "configdb" has been moved to "alertmanager > store > configdb".
  1750  -[configdb: <configdb_config>]
  1751  
  1752   # "config_store" has been renamed to "configs".
  1753  -[config_store: <configstore_config>]
  1754  +[configs: <configs_config>]
  1755  
  1756  
  1757  ### `distributor_config`
  1758  
  1759   # The support to hook an external billing system has been removed.
  1760  -[enable_billing: <boolean> | default = false]
  1761  -billing:
  1762  -  [maxbufferedevents: <int> | default = 1024]
  1763  -  [retrydelay: <duration> | default = 500ms]
  1764  -  [ingesterhostport: <string> | default = "localhost:24225"]
  1765  
  1766  
  1767  ### `querier_config`
  1768  
  1769   # "ingestermaxquerylookback" has been renamed to "query_ingesters_within".
  1770  -[ingestermaxquerylookback: <duration> | default = 0s]
  1771  +[query_ingesters_within: <duration> | default = 0s]
  1772  
  1773  
  1774  ### `queryrange_config`
  1775  
  1776  results_cache:
  1777    cache:
  1778       # "defaul_validity" has been renamed to "default_validity".
  1779  -    [defaul_validity: <duration> | default = 0s]
  1780  +    [default_validity: <duration> | default = 0s]
  1781  
  1782     # "cache_split_interval" has been deprecated in favor of "split_queries_by_interval".
  1783  -  [cache_split_interval: <duration> | default = 24h0m0s]
  1784  
  1785  
  1786  ### `alertmanager_config`
  1787  
  1788  # The "store" config block has been added. This includes "configdb" which previously
  1789  # was the "configdb" root level config block.
  1790  +store:
  1791  +  [type: <string> | default = "configdb"]
  1792  +  [configdb: <configstore_config>]
  1793  +  local:
  1794  +    [path: <string> | default = ""]
  1795  
  1796  
  1797  ### `storage_config`
  1798  
  1799  index_queries_cache_config:
  1800     # "defaul_validity" has been renamed to "default_validity".
  1801  -  [defaul_validity: <duration> | default = 0s]
  1802  +  [default_validity: <duration> | default = 0s]
  1803  
  1804  
  1805  ### `chunk_store_config`
  1806  
  1807  chunk_cache_config:
  1808     # "defaul_validity" has been renamed to "default_validity".
  1809  -  [defaul_validity: <duration> | default = 0s]
  1810  +  [default_validity: <duration> | default = 0s]
  1811  
  1812  write_dedupe_cache_config:
  1813     # "defaul_validity" has been renamed to "default_validity".
  1814  -  [defaul_validity: <duration> | default = 0s]
  1815  +  [default_validity: <duration> | default = 0s]
  1816  
  1817   # "min_chunk_age" has been removed in favor of "querier > query_store_after".
  1818  -[min_chunk_age: <duration> | default = 0s]
  1819  
  1820  
  1821  ### `configs_config`
  1822  
  1823  -# "uri" has been moved to "database > uri".
  1824  -[uri: <string> | default = "postgres://postgres@configs-db.weave.local/configs?sslmode=disable"]
  1825  
  1826  -# "migrationsdir" has been moved to "database > migrations_dir".
  1827  -[migrationsdir: <string> | default = ""]
  1828  
  1829  -# "passwordfile" has been moved to "database > password_file".
  1830  -[passwordfile: <string> | default = ""]
  1831  
  1832  +database:
  1833  +  [uri: <string> | default = "postgres://postgres@configs-db.weave.local/configs?sslmode=disable"]
  1834  +  [migrations_dir: <string> | default = ""]
  1835  +  [password_file: <string> | default = ""]
  1836  ```
  1837  
  1838  ## 0.6.1 / 2020-02-05
  1839  
  1840  * [BUGFIX] Fixed parsing of the WAL configuration when specified in the YAML config file. #2071
  1841  
  1842  ## 0.6.0 / 2020-01-28
  1843  
  1844  Note that the ruler flags need to be changed in this upgrade. You're moving from a single node ruler to something that might need to be sharded.
  1845  Further, if you're using the configs service, we've upgraded the migration library and this requires some manual intervention. See full instructions below to upgrade your PostgreSQL.
  1846  
  1847  * [CHANGE] The frontend component now does not cache results if it finds a `Cache-Control` header and if one of its values is `no-store`. #1974
  1848  * [CHANGE] Flags changed with transition to upstream Prometheus rules manager:
  1849    * `-ruler.client-timeout` is now `ruler.configs.client-timeout` in order to match `ruler.configs.url`.
  1850    * `-ruler.group-timeout`has been removed.
  1851    * `-ruler.num-workers` has been removed.
  1852    * `-ruler.rule-path` has been added to specify where the prometheus rule manager will sync rule files.
  1853    * `-ruler.storage.type` has beem added to specify the rule store backend type, currently only the configdb.
  1854    * `-ruler.poll-interval` has been added to specify the interval in which to poll new rule groups.
  1855    * `-ruler.evaluation-interval` default value has changed from `15s` to `1m` to match the default evaluation interval in Prometheus.
  1856    * Ruler sharding requires a ring which can be configured via the ring flags prefixed by `ruler.ring.`. #1987
  1857  * [CHANGE] Use relative links from /ring page to make it work when used behind reverse proxy. #1896
  1858  * [CHANGE] Deprecated `-distributor.limiter-reload-period` flag. #1766
  1859  * [CHANGE] Ingesters now write only normalised tokens to the ring, although they can still read denormalised tokens used by other ingesters. `-ingester.normalise-tokens` is now deprecated, and ignored. If you want to switch back to using denormalised tokens, you need to downgrade to Cortex 0.4.0. Previous versions don't handle claiming tokens from normalised ingesters correctly. #1809
  1860  * [CHANGE] Overrides mechanism has been renamed to "runtime config", and is now separate from limits. Runtime config is simply a file that is reloaded by Cortex every couple of seconds. Limits and now also multi KV use this mechanism.<br />New arguments were introduced: `-runtime-config.file` (defaults to empty) and `-runtime-config.reload-period` (defaults to 10 seconds), which replace previously used `-limits.per-user-override-config` and `-limits.per-user-override-period` options. Old options are still used if `-runtime-config.file` is not specified. This change is also reflected in YAML configuration, where old `limits.per_tenant_override_config` and `limits.per_tenant_override_period` fields are replaced with `runtime_config.file` and `runtime_config.period` respectively. #1749
  1861  * [CHANGE] Cortex now rejects data with duplicate labels. Previously, such data was accepted, with duplicate labels removed with only one value left. #1964
  1862  * [CHANGE] Changed the default value for `-distributor.ha-tracker.prefix` from `collectors/` to `ha-tracker/` in order to not clash with other keys (ie. ring) stored in the same key-value store. #1940
  1863  * [FEATURE] Experimental: Write-Ahead-Log added in ingesters for more data reliability against ingester crashes. #1103
  1864    * `--ingester.wal-enabled`: Setting this to `true` enables writing to WAL during ingestion.
  1865    * `--ingester.wal-dir`: Directory where the WAL data should be stored and/or recovered from.
  1866    * `--ingester.checkpoint-enabled`: Set this to `true` to enable checkpointing of in-memory chunks to disk.
  1867    * `--ingester.checkpoint-duration`: This is the interval at which checkpoints should be created.
  1868    * `--ingester.recover-from-wal`: Set this to `true` to recover data from an existing WAL.
  1869    * For more information, please checkout the ["Ingesters with WAL" guide](https://cortexmetrics.io/docs/guides/ingesters-with-wal/).
  1870  * [FEATURE] The distributor can now drop labels from samples (similar to the removal of the replica label for HA ingestion) per user via the `distributor.drop-label` flag. #1726
  1871  * [FEATURE] Added flag `debug.mutex-profile-fraction` to enable mutex profiling #1969
  1872  * [FEATURE] Added `global` ingestion rate limiter strategy. Deprecated `-distributor.limiter-reload-period` flag. #1766
  1873  * [FEATURE] Added support for Microsoft Azure blob storage to be used for storing chunk data. #1913
  1874  * [FEATURE] Added readiness probe endpoint`/ready` to queriers. #1934
  1875  * [FEATURE] Added "multi" KV store that can interact with two other KV stores, primary one for all reads and writes, and secondary one, which only receives writes. Primary/secondary store can be modified in runtime via runtime-config mechanism (previously "overrides"). #1749
  1876  * [FEATURE] Added support to store ring tokens to a file and read it back on startup, instead of generating/fetching the tokens to/from the ring. This feature can be enabled with the flag `-ingester.tokens-file-path`. #1750
  1877  * [FEATURE] Experimental TSDB: Added `/series` API endpoint support with TSDB blocks storage. #1830
  1878  * [FEATURE] Experimental TSDB: Added TSDB blocks `compactor` component, which iterates over users blocks stored in the bucket and compact them according to the configured block ranges. #1942
  1879  * [ENHANCEMENT] metric `cortex_ingester_flush_reasons` gets a new `reason` value: `Spread`, when `-ingester.spread-flushes` option is enabled. #1978
  1880  * [ENHANCEMENT] Added `password` and `enable_tls` options to redis cache configuration. Enables usage of Microsoft Azure Cache for Redis service. #1923
  1881  * [ENHANCEMENT] Upgraded Kubernetes API version for deployments from `extensions/v1beta1` to `apps/v1`. #1941
  1882  * [ENHANCEMENT] Experimental TSDB: Open existing TSDB on startup to prevent ingester from becoming ready before it can accept writes. The max concurrency is set via `--experimental.tsdb.max-tsdb-opening-concurrency-on-startup`. #1917
  1883  * [ENHANCEMENT] Experimental TSDB: Querier now exports aggregate metrics from Thanos bucket store and in memory index cache (many metrics to list, but all have `cortex_querier_bucket_store_` or `cortex_querier_blocks_index_cache_` prefix). #1996
  1884  * [ENHANCEMENT] Experimental TSDB: Improved multi-tenant bucket store. #1991
  1885    * Allowed to configure the blocks sync interval via `-experimental.tsdb.bucket-store.sync-interval` (0 disables the sync)
  1886    * Limited the number of tenants concurrently synched by `-experimental.tsdb.bucket-store.block-sync-concurrency`
  1887    * Renamed `cortex_querier_sync_seconds` metric to `cortex_querier_blocks_sync_seconds`
  1888    * Track `cortex_querier_blocks_sync_seconds` metric for the initial sync too
  1889  * [BUGFIX] Fixed unnecessary CAS operations done by the HA tracker when the jitter is enabled. #1861
  1890  * [BUGFIX] Fixed ingesters getting stuck in a LEAVING state after coming up from an ungraceful exit. #1921
  1891  * [BUGFIX] Reduce memory usage when ingester Push() errors. #1922
  1892  * [BUGFIX] Table Manager: Fixed calculation of expected tables and creation of tables from next active schema considering grace period. #1976
  1893  * [BUGFIX] Experimental TSDB: Fixed ingesters consistency during hand-over when using experimental TSDB blocks storage. #1854 #1818
  1894  * [BUGFIX] Experimental TSDB: Fixed metrics when using experimental TSDB blocks storage. #1981 #1982 #1990 #1983
  1895  * [BUGFIX] Experimental memberlist: Use the advertised address when sending packets to other peers of the Gossip memberlist. #1857
  1896  * [BUGFIX] Experimental TSDB: Fixed incorrect query results introduced in #2604 caused by a buffer incorrectly reused while iterating samples. #2697
  1897  
  1898  ### Upgrading PostgreSQL (if you're using configs service)
  1899  
  1900  Reference: <https://github.com/golang-migrate/migrate/tree/master/database/postgres#upgrading-from-v1>
  1901  
  1902  1. Install the migrate package cli tool: <https://github.com/golang-migrate/migrate/tree/master/cmd/migrate#installation>
  1903  2. Drop the `schema_migrations` table: `DROP TABLE schema_migrations;`.
  1904  2. Run the migrate command:
  1905  
  1906  ```bash
  1907  migrate  -path <absolute_path_to_cortex>/cmd/cortex/migrations -database postgres://localhost:5432/database force 2
  1908  ```
  1909  
  1910  ### Known issues
  1911  
  1912  - The `cortex_prometheus_rule_group_last_evaluation_timestamp_seconds` metric, tracked by the ruler, is not unregistered for rule groups not being used anymore. This issue will be fixed in the next Cortex release (see [2033](https://github.com/cortexproject/cortex/issues/2033)).
  1913  
  1914  - Write-Ahead-Log (WAL) does not have automatic repair of corrupt checkpoint or WAL segments, which is possible if ingester crashes abruptly or the underlying disk corrupts. Currently the only way to resolve this is to manually delete the affected checkpoint and/or WAL segments. Automatic repair will be added in the future releases.
  1915  
  1916  ## 0.4.0 / 2019-12-02
  1917  
  1918  * [CHANGE] The frontend component has been refactored to be easier to re-use. When upgrading the frontend, cache entries will be discarded and re-created with the new protobuf schema. #1734
  1919  * [CHANGE] Removed direct DB/API access from the ruler. `-ruler.configs.url` has been now deprecated. #1579
  1920  * [CHANGE] Removed `Delta` encoding. Any old chunks with `Delta` encoding cannot be read anymore. If `ingester.chunk-encoding` is set to `Delta` the ingester will fail to start. #1706
  1921  * [CHANGE] Setting `-ingester.max-transfer-retries` to 0 now disables hand-over when ingester is shutting down. Previously, zero meant infinite number of attempts. #1771
  1922  * [CHANGE] `dynamo` has been removed as a valid storage name to make it consistent for all components. `aws` and `aws-dynamo` remain as valid storage names.
  1923  * [CHANGE/FEATURE] The frontend split and cache intervals can now be configured using the respective flag `--querier.split-queries-by-interval` and `--frontend.cache-split-interval`.
  1924    * If `--querier.split-queries-by-interval` is not provided request splitting is disabled by default.
  1925    * __`--querier.split-queries-by-day` is still accepted for backward compatibility but has been deprecated. You should now use `--querier.split-queries-by-interval`. We recommend a to use a multiple of 24 hours.__
  1926  * [FEATURE] Global limit on the max series per user and metric #1760
  1927    * `-ingester.max-global-series-per-user`
  1928    * `-ingester.max-global-series-per-metric`
  1929    * Requires `-distributor.replication-factor` and `-distributor.shard-by-all-labels` set for the ingesters too
  1930  * [FEATURE] Flush chunks with stale markers early with `ingester.max-stale-chunk-idle`. #1759
  1931  * [FEATURE] EXPERIMENTAL: Added new KV Store backend based on memberlist library. Components can gossip about tokens and ingester states, instead of using Consul or Etcd. #1721
  1932  * [FEATURE] EXPERIMENTAL: Use TSDB in the ingesters & flush blocks to S3/GCS ala Thanos. This will let us use an Object Store more efficiently and reduce costs. #1695
  1933  * [FEATURE] Allow Query Frontend to log slow queries with `frontend.log-queries-longer-than`. #1744
  1934  * [FEATURE] Add HTTP handler to trigger ingester flush & shutdown - used when running as a stateful set with the WAL enabled.  #1746
  1935  * [FEATURE] EXPERIMENTAL: Added GCS support to TSDB blocks storage. #1772
  1936  * [ENHANCEMENT] Reduce memory allocations in the write path. #1706
  1937  * [ENHANCEMENT] Consul client now follows recommended practices for blocking queries wrt returned Index value. #1708
  1938  * [ENHANCEMENT] Consul client can optionally rate-limit itself during Watch (used e.g. by ring watchers) and WatchPrefix (used by HA feature) operations. Rate limiting is disabled by default. New flags added: `--consul.watch-rate-limit`, and `--consul.watch-burst-size`. #1708
  1939  * [ENHANCEMENT] Added jitter to HA deduping heartbeats, configure using `distributor.ha-tracker.update-timeout-jitter-max` #1534
  1940  * [ENHANCEMENT] Add ability to flush chunks with stale markers early. #1759
  1941  * [BUGFIX] Stop reporting successful actions as 500 errors in KV store metrics. #1798
  1942  * [BUGFIX] Fix bug where duplicate labels can be returned through metadata APIs. #1790
  1943  * [BUGFIX] Fix reading of old, v3 chunk data. #1779
  1944  * [BUGFIX] Now support IAM roles in service accounts in AWS EKS. #1803
  1945  * [BUGFIX] Fixed duplicated series returned when querying both ingesters and store with the experimental TSDB blocks storage. #1778
  1946  
  1947  In this release we updated the following dependencies:
  1948  
  1949  - gRPC v1.25.0  (resulted in a drop of 30% CPU usage when compression is on)
  1950  - jaeger-client v2.20.0
  1951  - aws-sdk-go to v1.25.22
  1952  
  1953  ## 0.3.0 / 2019-10-11
  1954  
  1955  This release adds support for Redis as an alternative to Memcached, and also includes many optimisations which reduce CPU and memory usage.
  1956  
  1957  * [CHANGE] Gauge metrics were renamed to drop the `_total` suffix. #1685
  1958    * In Alertmanager, `alertmanager_configs_total` is now `alertmanager_configs`
  1959    * In Ruler, `scheduler_configs_total` is now `scheduler_configs`
  1960    * `scheduler_groups_total` is now `scheduler_groups`.
  1961  * [CHANGE] `--alertmanager.configs.auto-slack-root` flag was dropped as auto Slack root is not supported anymore. #1597
  1962  * [CHANGE] In table-manager, default DynamoDB capacity was reduced from 3,000 units to 1,000 units. We recommend you do not run with the defaults: find out what figures are needed for your environment and set that via `-dynamodb.periodic-table.write-throughput` and `-dynamodb.chunk-table.write-throughput`.
  1963  * [FEATURE] Add Redis support for caching #1612
  1964  * [FEATURE] Allow spreading chunk writes across multiple S3 buckets #1625
  1965  * [FEATURE] Added `/shutdown` endpoint for ingester to shutdown all operations of the ingester. #1746
  1966  * [ENHANCEMENT] Upgraded Prometheus to 2.12.0 and Alertmanager to 0.19.0. #1597
  1967  * [ENHANCEMENT] Cortex is now built with Go 1.13 #1675, #1676, #1679
  1968  * [ENHANCEMENT] Many optimisations, mostly impacting ingester and querier: #1574, #1624, #1638, #1644, #1649, #1654, #1702
  1969  
  1970  Full list of changes: <https://github.com/cortexproject/cortex/compare/v0.2.0...v0.3.0>
  1971  
  1972  ## 0.2.0 / 2019-09-05
  1973  
  1974  This release has several exciting features, the most notable of them being setting `-ingester.spread-flushes` to potentially reduce your storage space by upto 50%.
  1975  
  1976  * [CHANGE] Flags changed due to changes upstream in Prometheus Alertmanager #929:
  1977    * `alertmanager.mesh.listen-address` is now `cluster.listen-address`
  1978    * `alertmanager.mesh.peer.host` and `alertmanager.mesh.peer.service` can be replaced by `cluster.peer`
  1979    * `alertmanager.mesh.hardware-address`, `alertmanager.mesh.nickname`, `alertmanager.mesh.password`, and `alertmanager.mesh.peer.refresh-interval` all disappear.
  1980  * [CHANGE] --claim-on-rollout flag deprecated; feature is now always on #1566
  1981  * [CHANGE] Retention period must now be a multiple of periodic table duration #1564
  1982  * [CHANGE] The value for the name label for the chunks memcache in all `cortex_cache_` metrics is now `chunksmemcache` (before it was `memcache`) #1569
  1983  * [FEATURE] Makes the ingester flush each timeseries at a specific point in the max-chunk-age cycle with `-ingester.spread-flushes`. This means multiple replicas of a chunk are very likely to contain the same contents which cuts chunk storage space by up to 66%. #1578
  1984  * [FEATURE] Make minimum number of chunk samples configurable per user #1620
  1985  * [FEATURE] Honor HTTPS for custom S3 URLs #1603
  1986  * [FEATURE] You can now point the query-frontend at a normal Prometheus for parallelisation and caching #1441
  1987  * [FEATURE] You can now specify `http_config` on alert receivers #929
  1988  * [FEATURE] Add option to use jump hashing to load balance requests to memcached #1554
  1989  * [FEATURE] Add status page for HA tracker to distributors #1546
  1990  * [FEATURE] The distributor ring page is now easier to read with alternate rows grayed out #1621
  1991  
  1992  ## 0.1.0 / 2019-08-07
  1993  
  1994  * [CHANGE] HA Tracker flags were renamed to provide more clarity #1465
  1995    * `distributor.accept-ha-labels` is now `distributor.ha-tracker.enable`
  1996    * `distributor.accept-ha-samples` is now `distributor.ha-tracker.enable-for-all-users`
  1997    * `ha-tracker.replica` is now `distributor.ha-tracker.replica`
  1998    * `ha-tracker.cluster` is now `distributor.ha-tracker.cluster`
  1999  * [FEATURE] You can specify "heap ballast" to reduce Go GC Churn #1489
  2000  * [BUGFIX] HA Tracker no longer always makes a request to Consul/Etcd when a request is not from the active replica #1516
  2001  * [BUGFIX] Queries are now correctly cancelled by the query-frontend #1508