项目作者: jinpeng

项目描述 :
Apache Flink tutorial code.
高级语言: Java
项目地址: git://github.com/jinpeng/flink-tutor.git
创建时间: 2018-10-06T16:28:54Z
项目社区:https://github.com/jinpeng/flink-tutor

开源协议:

下载


flink-tutor

Apache Flink tutorial code.
Using the following techniques:

  • Apache Flink 1.7.2 (Scala 2.11)
  • Gradle 5.3
  • Gradle Kotlin DSL

Flink features:

  • Load dataset from CSV files
  • Filter dataset
  • GroupBy and reduce
  • Partition to sort
  • Passing parameters by command line arguments

Gradle Kotlin DSL features:

  • Java application plugins
  • Java source and target comapatibility
  • Respositories of Aliyun for Chinese developers
  • Task for run multiple Java applications besides default run
  • Passing arguments to Java applications

Run:

  1. $ gradle clean build
  2. $ gradle run --args="--input ../ml-latest-small/movies.csv --output filter-output"
  3. $ gradle averageRating --args="--movies ../ml-latest-small/movies.csv --ratings ../ml-latest-small/ratings.csv"

Datasets:
Movielens data downloaded from:
https://grouplens.org/datasets/movielens/