官术网_书友最值得收藏!

Traditional machine learning architecture

Structured data, such as transactional, customers, analytical, and market data, usually resides within a local relational database. Given a query language, such as SQL, we can query the data used for processing, as shown in the workflow in the preceding diagram. Usually, all the data can be stored in memory and further processed with a machine learning library such as Weka, Java-ML, or MALLET.

A common practice in the architecture design is to create data pipelines, where different steps in the workflow are split. For instance, in order to create a client data record, we might have to scrap the data from different data sources. The record can be then saved in an intermediate database for further processing.

To understand how the high-level aspects of big data architecture differ, let's first clarify when data is considered big.

主站蜘蛛池模板: 萨嘎县| 疏勒县| 黄梅县| 曲靖市| 如皋市| 乌兰县| 五大连池市| 通许县| 皮山县| 宁波市| 红原县| 安仁县| 石门县| 汉中市| 呼伦贝尔市| 巫溪县| 玛曲县| 沙雅县| 额敏县| 马关县| 绥阳县| 当雄县| 永福县| 公安县| 赤壁市| 鄂托克旗| 布尔津县| 绵竹市| 明水县| 兴安盟| 玉树县| 莎车县| 武城县| 西丰县| 易门县| 山丹县| 东乌| 永城市| 年辖:市辖区| 灵丘县| 孟村|