arrow_back
Back
lock
Section 1
lock
Introduction
lock
How to unzip .gz files in a new directory in hadoop?
lock
Scenario Based Question
lock
How does Hadoop Namenode failover process works?
lock
Scenario Based Question
lock
How can we initiate a manual failover when automatic failover is configured?
lock
When not use Hadoop?
lock
Is there a simple command for hadoop that can change the name of a file ?
lock
When To Use Hadoop?
lock
Scenario Based Question
lock
Section 2
lock
Can I have multiple files in HDFS use different block sizes?
lock
Scenario Based Question
lock
As we talk about Hadoop is Highly scalable how well does it Scale?
lock
What platforms and Java versions does Hadoop run on?
lock
What kind of hardware scales best for Hadoop?
lock
Is there an easy way to see the status and health of a cluster?
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Section 3
lock
I am seeing connection refused in the logs. How do I troubleshoot this?
lock
Does Hadoop require SSH?
lock
What does NFS: Cannot create lock on (some dir) mean?
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Section 4
lock
What is the purpose of the secondary name-node?
lock
Scenario Based Question
lock
How do I set up a hadoop node to use multiple volumes?
lock
Scenario Based Question
lock
Does HDFS make block boundaries between records?
lock
Does Wildcard characters work correctly in FsShell?
lock
What does "file could only be replicated to 0 nodes, instead of 1" mean?
lock
Scenario Based Question
lock
What happens when two clients try to write into the same HDFS file?
lock
How to limit Data node's disk usage?
lock
Section 5
lock
Scenario Based Question
lock
Scenario Based Question
lock
On an individual data node, how do you balance the blocks on the disk?
lock
Scenario Based Question
lock
Difference between hadoop fs -put and hadoop fs -copyFromLocal?
lock
Scenario Based Question
lock
How to check HDFS Directory size?
lock
Scenario Based Question
lock
On what concept the Hadoop framework works?
lock
What is Hadoop streaming?
lock
Section 6
lock
Explain about the process of inter cluster data copying.?
lock
Scenario Based Question
lock
Differentiate between Structured and Unstructured data?
lock
Explain the difference between NameNode, Backup Node and Checkpoint NameNode?
lock
How can you overwrite the replication factors in HDFS?
lock
What is the process to change the files at arbitrary locations in HDFS?
lock
Explain about the indexing process in HDFS?
lock
What is a rack awareness and on what basis is data stored in a rack?
lock
What happens to a NameNode that has no data?
lock
Scenario Based Question
lock
Section 7
lock
Scenario Based Question
lock
Whenever a client submits a hadoop job, who receives it?
lock
What do you understand by edge nodes in Hadoop?
lock
What are real-time industry applications of Hadoop?
lock
What all modes Hadoop can be run in?
lock
Explain the major difference between HDFS block and InputSplit?
lock
What are the most common Input Formats in Hadoop?
lock
What is Speculative Execution in Hadoop?
lock
What is Fault Tolerance?
lock
What is a heartbeat in HDFS?
lock
Section 8
lock
How to keep HDFS cluster balanced?
lock
How to deal with small files in Hadoop?
lock
Scenario Based Question
lock
What type of problems can mapreduce solve?
lock
What is the difference between Hadoop Map Reduce and Google Map Reduce?
lock
How to get the input file name in the mapper in a Hadoop program?
lock
Scenario Based Question
lock
Scenario Based Question
lock
Scenario Based Question
lock
Can you set number of map task in Map reduce?
lock
Section 9
lock
If your Mapreduce Job launches 20 task for 1 job can you limit to 10 task?
lock
Scenario Based Question
lock
What is Shuffling and Sorting in Hadoop MapReduce?
lock
How do I submit extra content (jars, static files, etc) for Mapreduce job to use
lock
How do I get my MapReduce Java Program to read the Cluster's set configuration?
lock
Explain what happens when Hadoop spawned 50 tasks for a job and one of the task
lock
What is OutputCommitter?
lock
What is RecordReader in a Map Reduce?
lock
What is a MapReduce Combiner?
lock
What do you understand by the term Straggler ?
lock
Section 10
lock
What is identity Mapper and identity reducer?
lock
What is the role of a MapReduce partitioner?
lock
When should you use a reducer?
lock
What steps do you follow in order to improve the performace of Mapreduce Job?
lock
What is the purpose of shuffling and sorting phase in the reducer in Map Reduce
lock
Scenario Based Question
lock
What do you understand by compute and storage nodes?
lock
Is it possible to rename the output file?
lock
What is the default input type in MapReduce?
lock
How is reporting controlled in hadoop?
lock
Section 11
lock
Scenario Based Question
lock
How do Map/Reduce InputSplit's handle record boundaries correctly?
lock
Scenario Based Question
lock
Can we search files using wildcards?
lock
What is the difference between Hadoop and RDBMS?
lock
Can reducers communicate with each other?
lock
What is a TaskInstance?
lock
What are the primary phases of a Reducer?
lock
Scenario Based Question
lock
How do you gracefully stop a running job?
lock
Section 12
lock
How do I limit Limiting Task Slot Usage
lock
How to increase the number of slots used?
lock
Scenario Based Question
lock
What is the process of changing the split size if there is limited storage space
lock
Is it important for Hadoop MapReduce jobs to be written in Java?
lock
What is the relationship between Job and Task in Hadoop?
lock
When is it suggested to use a combiner in a MapReduce job?
lock
Explain the differences between a combiner and reducer.
lock
Where is Mapper output stored?
lock
Is it possible to split 100 lines of input as a single split in MapReduce?
lock
Section 13
lock
List the configuration parameters that have to be specified when running a MRjob
lock
Scenario Based Question
lock
When is it not recommended to use MapReduce paradigm for large scale data?
lock
What is the fundamental difference between a MapReduce Split and a HDFS block?
lock
What happens when a DataNode fails during the write process?
lock
Scenario Based Question
lock
How data is spilt in Hadoop?
lock
Explain about the basic parameters of mapper and reducer function.
Preview - Apache Hadoop and Mapreduce Interview Questions and Answers
Discuss (
0
)
navigate_before
Previous
Next
navigate_next