- Java Data Science Cookbook
- Rushdi Shams
- 125字
- 2021-07-09 18:44:25
Chapter 1. Obtaining and Cleaning Data
In this chapter, we will cover the following recipes:
- Retrieving all file names from hierarchical directories using Java
- Retrieving all file names from hierarchical directories using Apache Commons IO
- Reading contents from text files all at once using Java 8
- Reading contents from text files all at once using Apache Commons IO
- Extracting PDF text using Apache Tika
- Cleaning ASCII text files using Regular Expressions
- Parsing Comma Separated Value files using Univocity
- Parsing Tab Separated Value files using Univocity
- Parsing XML files using JDOM
- Writing JSON files using JSON.simple
- Reading JSON files using JSON.simple
- Extracting web data from a URL using JSoup
- Extracting web data from a website using Selenium
Webdriver
- Reading table data from MySQL database
推薦閱讀
- 數(shù)據(jù)庫(kù)原理及應(yīng)用教程(第4版)(微課版)
- Developing Mobile Games with Moai SDK
- Hadoop大數(shù)據(jù)實(shí)戰(zhàn)權(quán)威指南(第2版)
- 網(wǎng)站數(shù)據(jù)庫(kù)技術(shù)
- 新基建:數(shù)據(jù)中心創(chuàng)新之路
- 云數(shù)據(jù)中心網(wǎng)絡(luò)與SDN:技術(shù)架構(gòu)與實(shí)現(xiàn)
- Python數(shù)據(jù)分析與挖掘?qū)崙?zhàn)(第3版)
- Chef Essentials
- 探索新型智庫(kù)發(fā)展之路:藍(lán)迪國(guó)際智庫(kù)報(bào)告·2015(上冊(cè))
- Power BI智能數(shù)據(jù)分析與可視化從入門(mén)到精通
- 從實(shí)踐中學(xué)習(xí)sqlmap數(shù)據(jù)庫(kù)注入測(cè)試
- 區(qū)塊鏈+:落地場(chǎng)景與應(yīng)用實(shí)戰(zhàn)
- 數(shù)據(jù)指標(biāo)體系:構(gòu)建方法與應(yīng)用實(shí)踐
- 數(shù)據(jù)分析思維:產(chǎn)品經(jīng)理的成長(zhǎng)筆記
- 數(shù)據(jù)庫(kù)原理及應(yīng)用:SQL Server 2016