Mahout Tutorial : Introduction & Setting up Mahout

Mahout Tutorial : Introduction & Setting up Mahout

In this article we will try to introduce you and walk you through a step by step Mahout Installation. Mahout is a Scalable Machine Learning library by Apache .

Continue reading “Mahout Tutorial : Introduction & Setting up Mahout”

Hadoop Tutorial : Installing Hadoop on a Single Node Cluster – A Walkthrough

Hadoop Tutorial : Installing Hadoop on a Single Node Cluster – A Walkthrough

This article attempts to give a step by step walk through for creating a single Node Hadoop Cluster. It is an hands on tutorial so that even a novice user can follow the steps and create the Hadoop Cluster.

Continue reading “Hadoop Tutorial : Installing Hadoop on a Single Node Cluster – A Walkthrough”

Hadoop Tutorial : Map Reduce Introduction and Internal Data flow

Hadoop Tutorial : Map Reduce Introduction and Internal Data flow

This tutorial talks about Map reduce programming paradigm used widely in the Big Data analytics arena. We will also run through an example step by step to understand various mechanisms involved.

Continue reading “Hadoop Tutorial : Map Reduce Introduction and Internal Data flow”

Hadoop Tutorial :Intro to Big Data and HDFS

Hadoop Tutorial :Intro to Big Data and HDFS

This tutorial attempts to give you overall understanding of Big Data in analytics and various strategies. It also talks about HDFS – the most important component of hadoop in details.

Continue reading “Hadoop Tutorial :Intro to Big Data and HDFS”

Hadoop Tutorial : Custom Record Reader with TextInputFormat

Hadoop Tutorial : Custom Record Reader with TextInputFormat

In this hadoop tutorial we will have a look at the modification to our previous program wordcount with our own custom mapper and reducer by implementing a concept called as custom record reader. Before we attack the problem let us look at some theory required to understand the topic.

Continue reading “Hadoop Tutorial : Custom Record Reader with TextInputFormat”