Category Hadoop

MapR: The Magic Which Drives Hadoop

When Google faced the trouble of analyzing huge datasets, such as web access logs and the page rank of a website. They used to employ an algorithmic approach which used to take up a lot of time and had to…

Hadoop Distributed File System

The Hadoop Distributed File System (HDFS) was developed following the distributed file system design principles. Running on commodity hardware, HDFS is extremely fault-tolerant and robust, unlike any other distributed systems.

Hadoop Architecture

Apache Hadoop is a java based open source software. Basically, it’s a framework which is used to execute batch processing jobs on huge clusters. It is designed so that it can be scaled from a single server to hundreds of…

Format aborted in /app/hadoop/tmp/dfs/name

While startng the Hadoop i was just trying to format the filesystem. For that i have executed below command. bin/hadoop namenode -format But i am getting below error while doing this. Could you please let me know what could be…

Hadoop Interview Question

What is the difference between start-all.sh and start-dfs.sh in Hadoop   There are different scripts in bin dir in Hadoop which is used to launch Hadoop DFS and Hadoop Map/Reduce Daemons. start-dfs.sh – Starts the Hadoop DFS daemons (Namenode and Datanodes)…