Hadoop Progaming
Guys,
Here you will find some sample MapReduce programs to process different types of files like Text,PDF,CSV,Log,XML,Doc/Docx,XLS/XLSX etc..
You can also find a single programe which can read any kind of file types specified above .
You can also find how to write custom Data Types and custom Partitioners in Map Reduce .
In trendfinder folder you will find how to work with multiple Mappers and Reducers .
Here we are processing twitter data based on the occurence of the tweet.
It contains a program to count the number of views on a per-hour basis on a particular web-site.
It contains how to write our own Custome FileInPutFormat,RecordReader classes in hadoop inorder to process PDF files .
In this package you will find how to connect to HDFS from Java , Writting a file in to HDFS using Java and how to display the contents of the directory in HDFS from Java Programing .
You can reach me for any suggestions/clarifications on : revanthkumar95@gmail.com
Feel free to share any insights or constructive criticism. Cheers!!