This tutorial talks about various resources that you can use to leard about hadoop and map reduce. It also talks about how you can think about learning about Big data as a subject in totality
Studying Hadoop or MapReduce can be a daunting task if you get your hand dirty at the start.
Some of the prerequisites for learning Hadoop are having a good experience in Java. Good Analytical skills help a lot as well and final secret sauce for being successful is – you need to be motivated to self learn lot of things in the bigdata arena.
For Learning Hadoop ,I followed the schedule as follows :
- Start with very basics of MR with code.google.com/edu/parallel/dsd-tutorial.html code.google.com/edu/parallel/mapreduce-tutorial.html
- Then go for the first two lectures in http://www.cs.washington.edu/education/courses/cse490h/08au/lectures.htm A very good course intro to MapReduce and Hadoop.
- Read the seminal paper labs.google.com/papers/mapreduce.html and its improvements in the updated version http://www.cs.washington.edu/education/courses/cse490h/08au/readings/communications200801-dl.pdf
- Then go for all the other videos in the U.Washington link given above.
- Try youtubing the terms Map reduce and hadoop to find videos by ORielly and Google RoundTable for good overview of the future of Hadoop and MapReduce
- Then off to the most important videos –
Cloudera Videos
http://www.cloudera.com/resources/?media=Video
and
Google MiniLecture Series
code.google.com/edu/submissions/mapreduce-minilecture/listing.html
Along with all the Multimedia above we need good written material
Documents:
- Architecture diagrams at hadooper.blogspot.com are good to have on your wall
- Hadoop: The definitive guide goes more into the nuts and bolts of the whole system where as Hadoop in Action is a good read with lots of teaching examples to learn the concepts of hadoop. Pro Hadoop is not for beginners
- pdfs of the documentation from Apache Foundation
hadoop.apache.org/common/docs/current/
and hadoop.apache.org/common/docs/stable/
will help you learn as to how model your problem into a MR solution in order to gain the advantages of Hadoop in total.- HDFS paper by Yahoo! Research is also a good read in order to gain in depth knowledge of hadoop
- Subscribe to the User Mailing List of Commons, MapReduce and HDFS in order to know problems, solutions and future solutions.
- Try the http://developer.yahoo.com/hadoop/tutorial/module1.html link for beginners to expert path to Hadoop
In Addition following 2 books are good resources:
- Hadoop – Definitive Guide
- Hadoop in Action
For Any Queries …
Contact Apache, Google, Bing, Yahoo!
thnaks….
LikeLike
Hi Varad,
I don’t have any experience in Java. Still can I able to learn Hadoop?
LikeLike
Hello I’m a network admin Windows side looking into getting into Hadoop training I have no knowledge except reading about it as well no programming skills, I see Java is required and understanding of it . What Java course u recommend I take ( intro to java programming or some other one in java)? Would java be good only then I can take Hadoop Administration course or I need Linux experenince as well? If u can please let me know thanks.
LikeLike
Hi Mehul,
MapReduce and HDFS is not about Language, but more about conceptually understanding how distributed components work. If you are good with Computer Science, and algorithms, language won’t be a difficult thing to pick up for Hadoop. Also, as hadoop supports hadoop-streaming, you are not bound by the lanuage. I’ll prefer Python as its easier to learn and has good learning resources around it (http://www.codecademy.com/tracks/python).
If you want more tutorials on hadoop internals, you would be interested in looking at the following links –
A presentation I gave in the colleges at Solapur – You’d be interested in going through it – http://www.slideshare.net/VaradMeru/big-data-hadoop-nosql-and-more
Please find the links of the blog articles I’ve written. Its in the logical flow –
Hadoop Setup on a Single Node (for Dev)
http://www.orzota.com/single-node-hadoop-setup-2/
Eclipse Setup for programming
http://www.orzota.com/eclipse-setup-for-hadoop-development/
Step-by-Step MapReduce Programming
http://www.orzota.com/step-by-step-mapreduce-programming/
Pig for Beginners
http://www.orzota.com/pig-for-beginners/
Hive for Beginners
http://www.orzota.com/hive-for-beginners/
LikeLike
Hi,
I recently started my job search, and heard about hadoop in recent times. Before hearing about Hadoop I thought of getting trained in Java. Now, I’m in a dilemma to choose between Java Developer/ Hadoop. I need some advice regarding this… If we know java can we get ourself trained in Hadoop and Do Hadoop really need hands on experience in Java. I just know oops concepts in Java..
Please help me out
LikeLike
Hi Varad,
I don’t have experience in any computer language, can I still be able to learn Hadoop? I know it will be very difficult.
LikeLike
hello….sir
i had one question for u…
can any one answer this question …..
why we are using Writables in hadoop for data transformation …..through networks …?
already we have a serialization in java ….?
any one can…?
LikeLike
Hello Sir I’m a System admin on Windows side and i am looking into getting into Hadoop.
I have no very less programming skills, I dont know much about Java
Can I take Hadoop Administration course or I need to learn Jave & Linux First?
LikeLike
Reblogged this on Romanegloo: the Flying Torch and commented:
Great place to start learning Hadoop
LikeLike
Hi,
I recently started my job search, and heard about hadoop in recent times. Before hearing about Hadoop I thought of getting trained in Java. Now, I’m in a dilemma to choose between Java Developer/ Hadoop. I need some advice regarding this I had worked on SAP and Tally ealier
LikeLike
Hi Prasad,
One does not need to be a Java programmer to program MapReduce Jobs. You can use any language of choice and write your MapReduce jobs. Look at https://www.udacity.com/course/intro-to-hadoop-and-mapreduce–ud617 for example. A Course on MapReduce and only uses Python.
LikeLike