Hadoop Distributed File System

The Hadoop Distributed File System (HDFS) was developed following the distributed file system design principles. Running on commodity hardware, HDFS is extremely fault-tolerant and robust, unlike any other distributed systems. […]

Hadoop Architecture

Apache Hadoop is a java based open source software. Basically, it’s a framework which is used to execute batch processing jobs on huge clusters. It is designed so that it can be scaled from a single server to hundreds [...]

Some Hadoop Use Cases

The Big Data movement has been pretty amazing and also very exhaustive for potential beneficiaries. That said, every business owner needs to be educated about the impact of this tech on their yields. Hadoop has proved itself to be [...]

Format aborted in /app/hadoop/tmp/dfs/name

While startng the Hadoop i was just trying to format the filesystem. For that i have executed below command. bin/hadoop namenode -format But i am getting below error while doing this. Could you please let me know what could be the reason behind this. Warning: $HADOOP_HOME is deprecated. 13/08/10 13:52:56 INFO namenode.NameNode: STARTUP_MSG: /************************************************************ STARTUP_MSG: [...]

Hadoop Interview Question

What is the difference between start-all.sh and start-dfs.sh in Hadoop   There are different scripts in bin dir in Hadoop which is used to launch Hadoop DFS and Hadoop Map/Reduce Daemons. start-dfs.sh - Starts the Hadoop DFS daemons (Namenode and Datanodes) stop-dfs.sh - Stops the Hadoop DFS daemons (Namenode and Datanodes) start-all.sh - Starts all Hadoop daemons (Namenode, datanodes, [...]

