官术网_书友最值得收藏!

Why open data?

Many books on machine learning use datasets that come with the language install (such as R or Hadoop) or point to public repositories that have considerable visibility in the data science community. The most common ones are Kaggle (especially the Titanic competition) and the UC Irvine's datasets. While these are great datasets and give a common denominator, this book will expose you to datasets that come from government entities. The notion of getting data from government and hacking for social good is typically called open data. I believe that open data will transform how the government interacts with its citizens and will make government entities more efficient and transparent. Therefore, we will use open datasets in this book and hopefully you will consider helping out with the open data movement.

主站蜘蛛池模板: 筠连县| 邓州市| 大丰市| 邳州市| 庆安县| 安国市| 册亨县| 吉林市| 凌云县| 台中市| 南漳县| 武鸣县| 宁津县| 临城县| 安丘市| 铜川市| 襄城县| 邳州市| 和平县| 温宿县| 宝清县| 鄂州市| 清涧县| 峨眉山市| 玉田县| 大丰市| 北辰区| 广水市| 化州市| 郴州市| 礼泉县| 富顺县| 平昌县| 沭阳县| 舒兰市| 石楼县| 海淀区| 元朗区| 新和县| 临湘市| 磐安县|