官术网_书友最值得收藏!

Recipe 1 – installing OpenRefine

In this recipe, you will learn where to look in order to download the latest release of OpenRefine and how to get it running on your favorite operating system.

First things first: start by downloading OpenRefine from http://openrefine.org/. OpenRefine was previously known as Freebase Gridworks, then as Google Refine for a few years. Since October 2012, the project has been taken over by the community, which makes OpenRefine really open. OpenRefine 2.6 is the first version carrying the new branding. If you are interested in the development version, you can also check https://github.com/OpenRefine.

OpenRefine is based on the Java environment, which makes it platform-independent. Just make sure that you have an up-to-date version of Java running on your machine (available from http://java.com/download) and follow the following instructions, depending on your operating system:

Windows

  1. Download the ZIP archive.
  2. Unzip and extract the contents of the archive to a folder of your choice.
  3. To launch OpenRefine, double-click on openrefine.exe.

Mac

  1. Download the DMG file.
  2. Open the disk image and drag the OpenRefine icon into the Applications folder.
  3. Double-click on the icon to start OpenRefine.

Linux

  1. Download the gzipped tarball.
  2. Extract the folder to your home directory.
  3. In a terminal, enter ./refine to start.

It should be noted that, by default, OpenRefine will allocate only 1 GB of RAM to Java. While this is sufficient to handle small datasets, it soon becomes restrictive when dealing with larger collections of data. In Recipe 7 – going for more memory, we will detail how to allow OpenRefine to allocate more memory, an operation that also differs from one OS to the other.

主站蜘蛛池模板: 普格县| 南平市| 大洼县| 石狮市| 内黄县| 岳池县| 云梦县| 桂林市| 桂阳县| 禹城市| 池州市| 重庆市| 嘉定区| 三台县| 峡江县| 平陆县| 九龙城区| 天镇县| 申扎县| 临邑县| 阜城县| 象山县| 色达县| 大安市| 莱西市| 封开县| 皋兰县| 达孜县| 大新县| 卓尼县| 凤山市| 宽城| 新闻| 青龙| 出国| 乌兰县| 肇州县| 左权县| 崇义县| 龙泉市| 怀来县|