项目作者: VikramTiwari

项目描述 :
samples for dataflow
高级语言: Python
项目地址: git://github.com/VikramTiwari/dataflow-samples.git
创建时间: 2017-04-14T07:04:28Z
项目社区:https://github.com/VikramTiwari/dataflow-samples

开源协议:

下载


Dataflow/Apache Beam Python Samples

Requirements

  • An active Google Cloud project
  • Dataflow API activated
  • python 2.7

Setup

Using python 2.7, pip and virtualenv

  1. # clone the project
  2. git clone https://github.com/VikramTiwari/dataflow-samples.git dataflow-samples
  3. # switch to the directory
  4. cd dataflow-sample
  5. # make sure it's 2.7
  6. python --version
  7. # this will create a virtual environment without affecting anything on your system
  8. virtualenv dataflow
  9. # activate virual environment
  10. source dataflow/bin/activate
  11. # install necessary packages
  12. pip install --upgrade setuptools apache-beam
  13. pip install --upgrade apache-beam[gcp]

Samples

To run samples, follow the instructions at the top of each file

Google Cloud - Dataflow Console

Hello World on Google Cloud