官术网_书友最值得收藏!

Data abstraction

IoT devices generate mountains of data that must be captured, aggregated, and processed by analytic systems. Preprocessing of IoT-collected data often occurs at the edge, where an initial filter is applied leaving only filtered data to be passed to a data analytic system in the fog or in the cloud.

Preprocessing also includes the classification of data objects. Classification can be done based on the types and/or sensitivities of the data. Metadata is added, which includes tags that represent the security sensitivity and other attributes of the data or the sources that collected the data. For example, any sensitive data that requires confidentiality protections should be tagged as such. At this stage, both data and metadata should be digitally signed.

Data is cleaned and de-duplicated next. The cleansing process includes corrections that must be made based on bad data. Clean data is then input into data models where it can be produced into products and visualizations.

A key consideration within the data life cycle is the need for data lineage assurance. Data lineage tracks the origin of data and the transformations and actions that were applied to that data over time. Data lineage tools can visually represent data flows and movements across a system. There are a number of data lineage tools on the market today. Apache Falcon is an open source data lineage tool that can be applied to IoT systems. You can learn more about Apache Falcon here: https://falcon.apache.org/.

主站蜘蛛池模板: 徐州市| 宽甸| 隆昌县| 大厂| 乌兰察布市| 浪卡子县| 清镇市| 明溪县| 新营市| 鹤壁市| 永年县| 贵阳市| 南投县| 利津县| 宁津县| 南汇区| 故城县| 云浮市| 闵行区| 惠东县| 阿瓦提县| 二连浩特市| 灯塔市| 济宁市| 金溪县| 溧阳市| 珲春市| 康保县| 曲水县| 施甸县| 策勒县| 武定县| 沅江市| 康保县| 拜城县| 梁河县| 福贡县| 大足县| 那坡县| 鱼台县| 哈巴河县|