In this tutorial we will discuss you how to install Spark on Ubuntu VM. Spark do not have particular dependency on Hadoop or other tools. But if you are planning to use Spark with Hadoop then you should follow my Part-1, Part-2 and Part-3 tutorial which covers installation of Hadoop and Hive. Install Java and… Read More »
In this part we will discuss how to install HIVE on Hadoop HDFS file system.
In this part we will discuss how to install a new data node on existing Hadoop setup. Follow step by step guide in video tutorial.
In this guide we will discuss how to install Hadoop HDFS on a single node cluster with Google Cloud Virtual Machine. Follow video tutorial below. To copy various commands, you can come back on this page. Prepare new server Create a new VM in google cloud with Ubuntu as base image. Create an instance with… Read More »
This article will guide you on how to install Apache Maven on Ubuntu. Same instructions could be followed for other Linux distributions as well.
This article will show how you can install Apache Oozie on hadoop 2.8 single node cluster. Oozie is a workflow scheduler system to manage Apache Hadoop jobs. I assume, you have followed previous articles on how to setup hadoop single node cluster or have a Hadoop server already running. Apache Maven should be installed first.… Read More »