官术网_书友最值得收藏!

Deploying cloud-based machine learning pipelines

We observe that only a small fraction of real-world ML systems are composed of ML code (the small black box in the figure shown here). However, the infrastructure surrounding this ML code is vast and complex. There are many services offered by cloud providers to provide and manage the infrastructure required for modern machine learning applications:

The following figure illustrates a typical machine learning pipeline at a conceptual level. However, real-life ML pipelines are a lot more complicated with several models being trained, tuned, combined, and so on.

The following figure shows core elements of a typical machine learning application split into two parts—the modeling including model training and deployed model (used on streaming data to output the results):

Typically, the data scientists experiment or do their modeling work in Python and/or R. Their work is then re-implemented in Java/Scala before deployment in a production environment. The enterprise production environments often consist of web servers, application servers, databases, middleware, and so on. The conversion of prototypical models to production-ready models, typically results in additional design and development efforts that lead to delays in rolling out updated models. However, with cloud services and platforms now available, this effort is reduced substantially as we will see in Chapter 8, Designing a Big Data Application.

主站蜘蛛池模板: 磴口县| 固镇县| 乐清市| 伊春市| 镇巴县| 十堰市| 迁西县| 建德市| 兴业县| 牡丹江市| 文化| 资源县| 玉溪市| 亳州市| 易门县| 贡觉县| 福安市| 长阳| 广南县| 泗阳县| 寻甸| 夏津县| 太谷县| 甘肃省| 忻城县| 略阳县| 同心县| 垦利县| 上林县| 遵义市| 达孜县| 青川县| 肃宁县| 且末县| 长顺县| 东平县| 修文县| 钦州市| 南昌县| 酉阳| 上饶县|