官术网_书友最值得收藏!

  • Java Data Science Cookbook
  • Rushdi Shams
  • 125字
  • 2021-07-09 18:44:25

Chapter 1. Obtaining and Cleaning Data

In this chapter, we will cover the following recipes:

  • Retrieving all file names from hierarchical directories using Java
  • Retrieving all file names from hierarchical directories using Apache Commons IO
  • Reading contents from text files all at once using Java 8
  • Reading contents from text files all at once using Apache Commons IO
  • Extracting PDF text using Apache Tika
  • Cleaning ASCII text files using Regular Expressions
  • Parsing Comma Separated Value files using Univocity
  • Parsing Tab Separated Value files using Univocity
  • Parsing XML files using JDOM
  • Writing JSON files using JSON.simple
  • Reading JSON files using JSON.simple
  • Extracting web data from a URL using JSoup
  • Extracting web data from a website using Selenium Webdriver
  • Reading table data from MySQL database
主站蜘蛛池模板: 宿松县| 奉新县| 舟曲县| 绥芬河市| 垦利县| 马鞍山市| 民县| 荆门市| 新昌县| 澜沧| 安阳县| 息烽县| 德化县| 鄂伦春自治旗| 台州市| 得荣县| 米林县| 大兴区| 分宜县| 和顺县| 苏尼特右旗| 都江堰市| 乌兰县| 茂名市| 鲁山县| 华宁县| 剑阁县| 襄垣县| 皋兰县| 蕉岭县| 辽阳县| 阜南县| 社会| 虎林市| 防城港市| 环江| 东山县| 汉源县| 保山市| 深水埗区| 于田县|