Big Data

Hadoop Commands with Examples

Hadoop Commands with Examples

Apache Hadoop is an open-source project that provides a new method for storing and processing large amounts of data. The Java-based software architecture is designed for distributed storage and processing of very large data sets on commodity-based computer clusters. If you want to know more about Hadoop click here. In this article, we will look…

Install Apache Pig in Ubuntu

Install Apache Pig in Ubuntu

Introduction Apache Pig is a platform or a tool that is developed originally at Facebook and is used to perform MapReduce tasks on huge datasets. It is basically used to carry out the operations on top of Hadoop. Executing Pig commands for MapReduce tasks is fairly easy to perform and easy to understand for the…

How to Install Hadoop in Ubuntu

How to Install Hadoop in Ubuntu

Introduction Apache Hadoop is an open-source software about which you can learn more here. In this tutorial, we will learn how to install Hadoop in Ubuntu. Here, we are using a cloud platform (Amazon Web Service to be particular), but you can follow the same steps for your local system (Ubuntu) as well. Also, this…

Install Apache Hive 2.x in CentOS

Install Apache Hive 2.x in CentOS

Introduction Apache Hive is a data warehouse software project that provides data query and analysis on top of Apache Hadoop. Hive provides a SQL-like interface for querying data stored in Hadoop-integrated databases and file systems. In order to execute SQL applications and queries over larger datasets, standard SQL queries must be used in the MapReduce…

What is Hadoop? Introduction, Modules, Architecture

What is Hadoop? Introduction, Modules, Architecture

Hadoop Introduction Hadoop is an Apache open-source platform for storing, processing, and analyzing massive amounts of data in a distributed manner through large clusters of commodity hardware. Apache Hadoop is written in Java and is basically used for batch processing. Large data sets are spread through clusters of commodity computers, and applications developed with Hadoop…