项目作者: panchicore

项目描述 :
Indexing GDELT database into Elasticsearch, entire database including the -each 15 minutes- real time events
高级语言: JavaScript
项目地址: git://github.com/panchicore/es-gdelt.git
创建时间: 2017-06-19T12:09:53Z
项目社区:https://github.com/panchicore/es-gdelt

开源协议:

下载


GDELT on Elasticsearch.

TODO: explain HISTORIC and REALTIME.

Environment Variables

KEY: Explain here.

GDELT Data Files

Explain python downloaders here.

TODO

  • Process KGK
  • Process Mentions
  • Automaticaly create gdelt index and schema

Run it

Make sure you have a healthy elasticsearch node ready.

  1. Install python requirements with pip install -r config/requirements.txt
  2. Set the env vars: https://github.com/panchicore/es-gdelt/blob/master/config/enviroment_variables.sample.sh, use source enviroment_variables.sample.sh
  3. Install the elasticsearch index: python gdelt_create_index.py
  4. Collect real time gdelt records: gdelt_realtime_downloader.py