项目作者: redvg

项目描述 :
Monte Carlo simulations with PySpark on GCP Cloud Dataproc clusters
高级语言: Python
项目地址: git://github.com/redvg/dataproc-pyspark-monte-carlo.git
创建时间: 2018-08-09T09:17:16Z
项目社区:https://github.com/redvg/dataproc-pyspark-monte-carlo

开源协议:

下载


dataproc-pyspark-monte-carlo

Monte Carlo simulations with PySpark on GCP Cloud Dataproc clusters
as per https://cloud.google.com/solutions/monte-carlo-methods-with-hadoop-spark

Dataproc

via cloud shell: \
chmod u+x init.sh \
./init.sh \
creates cluster, uploads job to bucket

PySpark

simulates portfolio growth under gaussian distribution

Screenshot