Question - Can you list the limitations of Hadoop?
Answer -
Hadoop is considered a very important Big Data management tool. However, like other tools, it also has some limitations of its own. They are as below:
- In Hadoop, you can configure only one NameCode.
- Hadoop is suitable only for the batch processing of a large amount of data.
- Only map or reduce jobs can be run by Hadoop.
- Hadoop supports only one Name No and One Namespace for each cluster.
- Hadoop does not facilitate horizontal scalability of NameNode.
- Hourly backup of MetaData from NameNode needs to be given to the Secondary NameNode.
- Hadoop can support only up to 4000 nodes per cluster.
- In Hadoop, the JobTracker, one and only single component, performs a majority of the activities such as managing Hadoop resources, job schedules, job monitoring, rescheduling jobs, etc.
- Real-time data processing is not possible with Hadoop.
- Due to the preceding reason, JobTracker is the only possible single point of failure in Hadoop.