官术网_书友最值得收藏!

Unstructured

Unstructured data is data that is without any defined organization and which specifically does not break down into stringently defined columns of specific types. This can consist of many types of information such as photos and graphic images, videos, streaming sensor data, web pages, PDF files, PowerPoint presentations, emails, blog entries, wikis, and word processing documents.

While pandas does not manipulate unstructured data directly, it provides a number of facilities to extract structured data from unstructured sources. As a specific example that we will examine, pandas has tools to retrieve web pages and extract specific pieces of content into a DataFrame.

主站蜘蛛池模板: 合作市| 义马市| 共和县| 东乌珠穆沁旗| 镇平县| 桐柏县| 顺义区| 庄河市| 凤庆县| 伊金霍洛旗| 德州市| 都昌县| 宁海县| 承德市| 阿图什市| 青海省| 周宁县| 湾仔区| 濮阳市| 峨山| 韶关市| 连州市| 刚察县| 河北省| 岳池县| 靖安县| 枣阳市| 临潭县| 丹巴县| 曲麻莱县| 牡丹江市| 东海县| 正阳县| 兴安县| 池州市| 禹城市| 孙吴县| 老河口市| 神木县| 西乡县| 岳阳市|