Assignment 15A – Hadoop

Assignment #15 – Hadoop

 

Objectives

Learn to setup and use Hadoop tools

Assignment

You will follow the tutorial at http://hortonworks.com/blog/hortonworks-sandbox-azure/

This will get Hadoop and a few tools up and running in an Azure account. You get one month free (and only one month) so only do this when you are working through this to turn in.

There are tons of tutorials on Hadoop and its tools – we will use Hive (a query language) and baseball statistics – it is at http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/  (you may want to do the Pig tutorial first at http://hortonworks.com/hadoop-tutorial/hello-world-an-introduction-to-hadoop-hcatalog-hive-and-pig/ )

In the end you will get a screen of the maximum hits for the players – capture this screen for 1900-1910 (at a minimum) and submit.

 

Information

This is all done through the tutorials at the links in the assignment

Estimated Completion Time

 

Supporting Lectures 

All included in tutorials (please also read comments where version changes in screen captures are noted)

Questions and Answers

 

External Resources

 

Grading Criteria