Hadoop

How to Import Data from MySQL to HDFS Using Sqoop

Apache Sqoop is a tool in Hadoop ecosystem which is used to import/export data between RDBMS and HDFS. This data is in structured format and has a schema. There are multiple cases where you want to analyze some data in [...]

Linoxide 3:00 am

How to Run Hadoop MapReduce Program on Ubuntu 16.04

In this blog, I will show you how to run a MapReduce program. MapReduce is one of the core part of Apache Hadoop, it is the processing layer of Apache Hadoop. So before I show you how to run a [...]

Linoxide 3:00 am

How to Setup Single Node Hadoop Cluster Using Docker

In this article, I will show you how to setup a single node hadoop cluster using Docker. Before I start with the setup, let me briefly remind you what Docker and Hadoop are. Docker is a software containerization platform where [...]

Linoxide 3:00 am

Awesome ! Hadoop HDFS Commands Cheat Sheet

HDFS is now an Apache Hadoop subproject. An HDFS instance contains a vast amount of servers and each store a part of file system. A typical file size in HDFS would be in gigabytes or terabytes in size hence applications [...]

Linoxide 3:00 am

How to Install Apache Sqoop on Ubuntu 16.04

Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. For example : MySQL, Oracle, Microsoft SQL Server. You can import and export data between relational databases and hadoop. You [...]

Linoxide 3:00 am

30 Most Frequently Used Hadoop HDFS Shell Commands

In this tutorial, we will walk you through the Hadoop Distributed File System (HDFS) commands you will need to manage files on HDFS. HDFS command is used most of the times when working with Hadoop File System. It includes various shell-like commands [...]

Linoxide 3:00 am

How to Setup Hadoop Multi-Node Cluster on Ubuntu

In this tutorial, we will learn how to setup a multi-node hadoop cluster on Ubuntu 16.04. A hadoop cluster which has more than 1 datanode is a multi-node hadoop cluster, hence, the goal of this tutorial is to get 2 [...]

Linoxide 3:00 am