官术网_书友最值得收藏!

Summary

This chapter focused on some rather boring, but important tasks that we usually do every day. Importing data is among the first steps of every data science projects, thus mastering data analysis should start with how to load data into the R session in an efficient way.

But efficiency is an ambiguous term in this sense: loading data should be quick in a technical point of view so as not to waste our time, although coding for long hours to speed up the importing process does not make much sense either.

The chapter gave a general overview on the most popular available options to read text files, to interact with databases, and to query subsets of data in R. Now you should be able to deal with all the most often used different data sources, and probably you can also choose which data source would be the ideal candidate in your projects and then do the benchmarks on your own, as we did previously.

The next chapter will extend this knowledge further by providing use cases for fetching data from the Web and different APIs. This simply means that you will be able to use public data in your projects, even if you do not yet have those in binary dataset files or on database backends.

主站蜘蛛池模板: 合肥市| 达日县| 新民市| 宁远县| 漳平市| 黄骅市| 施甸县| 汝南县| 平原县| 九江县| 汾阳市| 东阿县| 高雄县| 淮阳县| 高清| 霍山县| 景德镇市| 武川县| 深州市| 深州市| 当阳市| 隆林| 都江堰市| 新乡县| 措勤县| 宁河县| 阜城县| 大化| 霍林郭勒市| 攀枝花市| 平塘县| 曲靖市| 丽江市| 大竹县| 沁阳市| 枝江市| 东港市| 东宁县| 南宫市| 乌鲁木齐市| 黑山县|