官术网_书友最值得收藏!

Data gathering

We need to obtain data and organize it appropriately for the current problem (in our example, this could mean building a dataset linking users to songs they've listened to in the past). Depending on the size of the data, we might pick different technologies for storing the data. For example, it might be fine to train on a local machine using scikit-learn if we're working through a few million records. However, if the data doesn't fit on a single computer, then we must consider AWS solutions such as S3 for storage and Apache Spark, or SageMaker's built-in algorithms for model building.

主站蜘蛛池模板: 修水县| 罗江县| 宣威市| 玉树县| 个旧市| 东明县| 济南市| 葫芦岛市| 河间市| 张北县| 玉田县| 安顺市| 弥勒县| 永济市| 湘乡市| 金山区| 沙洋县| 杨浦区| 曲麻莱县| 视频| 蓬莱市| 蓝田县| 灯塔市| 东莞市| 呼玛县| 木兰县| 乌拉特前旗| 正阳县| 雅安市| 科技| 定边县| 类乌齐县| 兴海县| 林州市| 通山县| 随州市| 松阳县| 江西省| 青冈县| 横山县| 千阳县|