官术网_书友最值得收藏!

ML nodes

First and foremost, since Elasticsearch is, by nature, a distributed multi-node solution, it is only natural that the ML feature of the Elastic Stack works as a native plugin that obeys many of the same operational concepts. As described in the documentation, ML can be enabled on any or all nodes, but it is a best practice in a production system to have dedicated ML nodes. This is helpful to optimize the types of resources specifically required by ML. Unlike data nodes that are involved in a fair amount of I/O load due to indexing and searching, ML nodes are more compute and memory intensive. With this knowledge, you can size the hardware appropriately for dedicated ML nodes.

One key thing to note—the ML algorithms do not run in the JVM. They are C++-based executables that will use the RAM that is left over from whatever is allocated for the Java Virtual Machine (JVM) heap. When running a job, the main process that invokes the analysis (called autodetect) can be seen in the process list:



View of top processes when a ML job is running

There will be one autodetect process for every actively running ML job. In multi-node setups, ML will distribute the jobs to each of the ML-enabled nodes to balance the load of the work.

主站蜘蛛池模板: 泾阳县| 大渡口区| 水城县| 汨罗市| 新野县| 右玉县| 五原县| 正定县| 简阳市| 贡嘎县| 乐山市| 江都市| 林周县| 从化市| 冕宁县| 永年县| 定安县| 建昌县| 溆浦县| 锡林郭勒盟| 博野县| 繁昌县| 开阳县| 依安县| 安化县| 姚安县| 石嘴山市| 甘南县| 双辽市| 高邑县| 都兰县| 朝阳区| 天镇县| 贵德县| 丘北县| 金川县| 临清市| 宜黄县| 陆丰市| 黄梅县| 抚松县|