WikiData Processing

Data Accessing

To download latest-all.json.bz2 on the page https://dumps.wikimedia.org/wikidatawiki/entities/ .

wget is the best tool to download huge file:

wget -c -t 0 https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.json.bz2

Installing Neo4j on Ubuntu

From Repositories (Not recommended)

This way needs supervisor authority, which is inconvenient.

Step 1. (Optional) To install OpenJRE and OpenJDK firstly, if there are no Java runtime environment on the computer.

$ sudo apt-get update
$ sudo apt-get install default-jre
$ sudo apt-get install default-jdk

Step 2. Installing Neo4j.

To use the repository add it to the list of sources:

$ wget -O - https://debian.neo4j.org/neotechnology.gpg.key | sudo apt-key add -
$ echo 'deb https://debian.neo4j.org/repo stable/' | sudo tee /etc/apt/sources.list.d/neo4j.list
$ sudo apt-get update

To install the latest Neo4j Community Edition:

$ sudo apt-get install neo4j

User Installation

To extract files from neo4j-community-3.5.8-unix.tar.gz

$ tar zxvf neo4j-community-3.5.8-unix.tar.gz

Adding follow lines to ~/.bashrc, and 'source' it.

# neo4j
export NEO4J_HOME="/home/fyb/neo4j-community-3.5.8"
export PATH=$PATH:$NEO4J_HOME/bin

To open remote accessing authority.

$ vi neo4j-community-3.5.8/conf/neo4j.conf

change #dbms.connector.http.listen_address=:7474
to dbms.connector.http.listen_address=0.0.0.0:7474 change #dbms.connector.bolt.listen_address=:7687
to dbms.connector.bolt.listen_address=0.0.0.0:7687

To start service, <NEO4J_HOME> is the top level directory referred to neo4j-community-3.5.8

$ neo4j console

Installing NodeJs

Node.js v12.x:

$ curl -sL https://deb.nodesource.com/setup_12.x | sudo -E bash -
$ sudo apt-get install -y nodejs

Importing

We employ wikidata-neo4j-importer to import wikidata into neo4J.

https://github.com/findie/wikidata-neo4j-importer

some requirements need to be installed.

$ npm install neo4j-driver async n-readlines cli-color slugify

Now, we can run index.js to import WikiData into neo4j !

$ node index.js

Bibliography

http://debian.neo4j.org/

https://github.com/nodesource/distributions/blob/master/README.md#deb

https://neo4j.com/download-thanks/?edition=community&release=3.5.8&flavour=unix

WikiData Processing的更多相关文章

  1. OLTP(on-line transaction processing)与OLAP(On-Line Analytical Processing)

    OLTP与OLAP的介绍 数据处理大致可以分成两大类:联机事务处理OLTP(on-line transaction processing).联机分析处理OLAP(On-Line Analytical ...

  2. 新书到手 TRANSACTION PROCESSING:CONCEPTS AND TECHNIQUES

    新书到手 TRANSACTION PROCESSING:CONCEPTS AND TECHNIQUES Jim Gray大神的著作 本文版权归作者所有,未经作者同意不得转载.

  3. Report processing of Microsoft Dynamic AX

    Report processing of Microsoft Dynamic AX 版权声明:本文为博主原创文章,未经博主允许不得转载. The implementation of a general ...

  4. ANNOTATION PROCESSING 101 by Hannes Dorfmann — 10 Jan 2015

    原文地址:http://hannesdorfmann.com/annotation-processing/annotationprocessing101 In this blog entry I wo ...

  5. ORA-01078: failure in processing system parameters & LRM-00109: could not open parameter file

    安装了Oracle 12C后,启动数据库的过程中出现如下错误 SQL> startup ORA-01078: failure in processing system parameters LR ...

  6. Processing基础之绘画

    图形 //在(x, y)绘制点 point(x, y); //(x1, y1)到(x2, y2)的一条线 line(x1, y1, x2, y2); rect(x, y, weight, height ...

  7. 【目录】processing

    Processing 小代码 小代码2 小代码3 小代码4 小代码5

  8. 转债---Pregel: A System for Large-Scale Graph Processing(译)

    转载:http://duanple.blog.163.com/blog/static/70971767201281610126277/   作者:Grzegorz Malewicz, Matthew ...

  9. 【Duke-Image】Week_3 Spatial processing

    Chapter_3 Intensity Transsformations and Spatial Filtering 灰度变换与空间滤波 Intensity transformation functi ...

随机推荐

  1. 第十八篇 JS传参数

    JS传参数   参数,这是个什么东西呢?简单的说吧,我们去玩别人的网站,一般来个登录,有用户名和密码,当我们输入正确之后,那么这用户名和密码里面的值,就是参数的值,它将这个值传给“参数”,然后提交到后 ...

  2. 运行期优化 Java内存模型与线程 线程安全与优化

  3. 9、linux权限-ACL权限

    来自为知笔记(Wiz)

  4. nginx服务学习第一章

    一.ubuntu系统安装nginx服务 # apt-get install nginx 二.nginx.config配置文件详解 配置文件结构: 全局块(全局变量) events{ } http{ h ...

  5. Import Error:cannot import name main解决方案

    在Ubuntu上安装软件,不小心升级了pip,导致使用时报错如下: Import Error:cannot import name main 后来发现是因为将pip更新为10.0.0后库里面的函数有所 ...

  6. Linux查看所有子文件夹及文件的数量

    find命令查看(推荐): 所有子目录的数量: [root@localhost ~]# find afish -type d | wc -l158[root@localhost ~]# find af ...

  7. C#基础进阶

    观看C#高级教程进行学习.巩固基础,进阶学习. 1.委托 把方法当做参数来传递就是委托.委托的关键字是delegate. class Program { private delegate string ...

  8. JAVA》eclipse——(三)jsp学习

    导出一个war包

  9. pandas的dataframe与spark的dataframe

  10. Python CGI编程Ⅳ

    使用POST方法传递数据 使用POST方法向服务器传递数据是更安全可靠的,像一些敏感信息如用户密码等需要使用POST传输数据. 以下同样是hello_get.py ,它也可以处理浏览器提交的POST表 ...