项目作者: France1

项目描述 :
Tutorials and example code to prepare for Cloudera "CCA Spark and Hadoop Developer" Certification
高级语言: Java
项目地址: git://github.com/France1/Hadoop-Spark-Training.git
创建时间: 2020-01-25T13:20:03Z
项目社区:https://github.com/France1/Hadoop-Spark-Training

开源协议:

下载


Hadoop-Spark-Training

Tutorials and example code to prepare for Cloudera “CCA Spark and Hadoop Developer” Certification

All the examples and exercises are run using Cloudera QuickStart Docker Image. It is recommended to increase the RAM used by Docker in Preferences->Advanced to at least 8 GiB as well as the number of CPUs (I am currently using 6).

To start the Docker image run

  1. docker run --hostname=quickstart.cloudera --privileged=true -it \
  2. -v $(pwd):/cloudera -p 8888:8888 -p 80:80 \
  3. cloudera/quickstart /usr/bin/docker-quickstart

Cloudera Live VM can be accessed at localhost:80 and Hue at localhost:8888 using admin for both username and password.

To upgrade to Spark 2.4

  1. ./cloudera/upgrade_spark.sh
  2. source ~/.bash_profile

Other references