Hadoop Tutorial : Installing Hadoop on a Single Node Cluster – A Walkthrough

Hadoop Tutorial : Installing Hadoop on a Single Node Cluster – A Walkthrough

This article attempts to give a step by step walk through for creating a single Node Hadoop Cluster. It is an hands on tutorial so that even a novice user can follow the steps and create the Hadoop Cluster.

Continue reading “Hadoop Tutorial : Installing Hadoop on a Single Node Cluster – A Walkthrough”

Advertisements

Research Paper : Parallel Computing Solutions – Hadoop Mapreduce

Data Export using Sqoop

This was a research paper that we submitted to ICAPADS-2012 an IEEE – Institute of High performance  distributed computing conference . It talks about a map reduce based solution to maze traversal problem which is applicable in many practical problems.

Continue reading “Research Paper : Parallel Computing Solutions – Hadoop Mapreduce”

Sqoop Tutorial : Hadoop : Importing data from RDBMS to HDFS

Sqoop Tutorial : Hadoop : Importing data from RDBMS to HDFS

In this article we will go through a very important technique – importing data from SQL table to HDFS. We will do so on a sample database say ‘bigdata’ and a sample table say ’employee’ containing employee data.

We will do this in 3 parts. Part 1 will be in scope of this article. We will look at the next parts in subsequent article

Continue reading “Sqoop Tutorial : Hadoop : Importing data from RDBMS to HDFS”

Hadoop Tutorial : Custom Record Reader with TextInputFormat

Hadoop Tutorial : Custom Record Reader with TextInputFormat

In this hadoop tutorial we will have a look at the modification to our previous program wordcount with our own custom mapper and reducer by implementing a concept called as custom record reader. Before we attack the problem let us look at some theory required to understand the topic.

Continue reading “Hadoop Tutorial : Custom Record Reader with TextInputFormat”