Tasks and conclusion
Post-training tasks:
- Try setting up your own 3 node Hadoop cluster.
- A VM based solution can be found here
- Write a simple spark/MR job of your choice and understand how to generate analytics from data.
- Sample dataset can be found here