官术网_书友最值得收藏!

Data mining

In Chapter 1, Transitioning from Data Developer to Data Scientist, we said, with data mining, one is usually more absorbed in the data relationships (or the potential relationships between points of data, sometimes referred to as variables) and cognitive analysis.

To further define this term, we can mention that data mining is sometimes more simply referred to as knowledge discovery or even just discovery, based upon processing through or analyzing data from new or different viewpoints and summarizing it into valuable insights that can be used to increase revenue, cuts costs, or both.

Using software dedicated to data mining is just one of several analytical approaches to data mining. Although there are tools dedicated to this purpose (such as IBM Cognos BI and Planning Analytics, Tableau, SAS, and so on.), data mining is all about the analysis process finding correlations or patterns among dozens of fields in the data and that can be effectively accomplished using tools such as MS Excel or any number of open source technologies.

A common technique to data mining is through the creation of custom scripts using tools such as R or Python. In this way, the data scientist has the ability to customize the logic and processing to their exact project needs.
主站蜘蛛池模板: 保定市| 乌审旗| 梅河口市| 大渡口区| 东兰县| 马尔康县| 灌南县| 大城县| 呼图壁县| 常德市| 仙游县| 台东市| 万盛区| 车险| 安义县| 晴隆县| 红河县| 许昌市| 道真| 莱州市| 巧家县| 望江县| 上林县| 江达县| 东光县| 灌云县| 双辽市| 上杭县| 保康县| 修水县| 湘潭市| 龙海市| 平乐县| 闽清县| 大名县| 五家渠市| 贵州省| 江北区| 东山县| 四平市| 灵寿县|