项目作者: gaurprabhakar94

项目描述 :
Basic Hadoop MapReduce Codes for data analysis
高级语言: Python
项目地址: git://github.com/gaurprabhakar94/Hadoop-and-MapReduce.git
创建时间: 2017-07-31T16:54:59Z
项目社区:https://github.com/gaurprabhakar94/Hadoop-and-MapReduce

开源协议:

下载


Hadoop-and-MapReduce

In this short course offered by Udacity, I learnt the fundamentals of MapReduce and Apache Hadoop to start making sense of Big Data in the real world!

Every assignment folder has a ‘testfile’ that acts as a local subset of the main dataset on the server.
The Mapper and Reducer codes are first tested on this file and then run on the main dataset.

The comment section of each mapper code talks about its particular problem statement.