官术网_书友最值得收藏!

  • The DevOps 2.2 Toolkit
  • Viktor Farcic
  • 338字
  • 2021-06-24 18:46:21

Which tool should we choose?

All the tools we listed are (or were) good in their merit. They are different in many aspects while similar in others.

Nagios and Sensu served us well in the past. They were designed in a different era and based on principles that are today considered obsolete. They work well with static clusters and monolithic applications and services running on predefined locations. The metrics they store (or lack of them) are not suitable for more complex decision making. We would have a hard time using them as means to accomplish our goals of operating a scheduler like Docker Swarm running in an auto-scalable cluster. Among the solutions we explored, they are the first ones we should discard. One is out; three are left to choose from.

Dot-separated metrics format used by Graphite is limiting. Excluding elements of a metric with asterisks (*) is often inadequate for proper filtering, grouping, and other operations. Its query language, when compared with InfluxDB and Prometheus, is the main reason we'll discard it.

We're left with InfluxDB and Prometheus as finalists and are facing only minor differences.

InfluxDB and Prometheus are similar in many ways, so the choice is not going to be an easy one. Truth be told, we cannot make a wrong decision. Whichever we choose of the two, the choice will be based on slight differences.

If we would not limit ourselves to open source solutions as the only candidates, InfluxDB enterprise version could be the winner due to its scalability. However, we will discard it in favor of Prometheus. It provides a more complete solution. More importantly, Prometheus is slowly becoming the de-facto standard, at least when working with schedulers. It is a preferred solution in Kubernetes. Docker (and therefore Swarm) is soon going to expose its metrics in Prometheus format. That, in itself, is the tipping point that should make us lean slightly more towards Prometheus.

The decision is made. We'll use Prometheus to store metrics, to query them, and to trigger alerts.

主站蜘蛛池模板: 蛟河市| 宜宾县| 明星| 洪泽县| 红河县| 郑州市| 镇康县| 连城县| 大竹县| 通化县| 宜丰县| 大渡口区| 井冈山市| 扶沟县| 德庆县| 沧州市| 柏乡县| 牙克石市| 汉川市| 恩平市| 蒙阴县| 平乐县| 佛冈县| 蓬安县| 榕江县| 镇远县| 凤庆县| 闽侯县| 南乐县| 平定县| 榕江县| 安远县| 绍兴县| 吉林省| 额济纳旗| 瑞丽市| 文山县| 绍兴市| 锦屏县| 镇江市| 铜川市|