官术网_书友最值得收藏!

Loading the data

As for this data, it is the same that we used in Chapter 2, Linear Regression. What is different is that I've prepared the data exactly as before, but saved the features and response as an RData file. You can download that from GitHub: https://github.com/PacktPublishing/Advanced-Machine-Learning-with-R/blob/master/Data/amesDL.RData.

Once you have that in your working directory, load it into the environment:

> load("amesDL.RData")

Notice that you now have four new objects:

  • trained: The training data features
  • tested: The testing data features
  • train_logy: The log of home sales
  • test_logy: The log of home sales

It is essential that the data is centered and scaled for a neural network (in the prior exercise, all features were either zero or one, which is acceptable). To perform this task, a function is available in the caret package. Let's use the training data to create the mean and standard deviation values that we will apply to both train and test data:

> prep <- caret::preProcess(trained, method = c("center", "scale"))

> trainT <- predict(prep, trained)

This gives us our transformed training data. However, Keras will not accept a dataframe as an input. It needs an array for both the features and the response. This is an easy fix with the data.matrix() function:

> train_logy <- data.matrix(train_logy)

> trainT <- data.matrix(trainT)

Now, you can just repeat these steps with the test data features:

> testT <- predict(prep, tested)

> testT <- data.matrix(testT)

It's about to get interesting.

主站蜘蛛池模板: 密云县| 山阴县| 新巴尔虎左旗| 金塔县| 海安县| 岗巴县| 衡阳市| 新宁县| 朔州市| 工布江达县| 陵川县| 石渠县| 马山县| 古交市| 商丘市| 合肥市| 苍山县| 林芝县| 阿尔山市| 高安市| 盐山县| 高安市| 淮滨县| 余庆县| 闵行区| 通许县| 九寨沟县| 潼南县| 新津县| 富民县| 色达县| 新田县| 肥西县| 高尔夫| 北川| 东丰县| 韶关市| 邵武市| 庐江县| 观塘区| 尼勒克县|