What is a partition in Hive and why is partitioning required in Hive

Related Subjects

Machine Learning
Deep Learning
Data Science
Artificial Intelligence

All Subject

Machine Learning Interview Questions and Answers | Deep Learning Interview Questions and Answers | Data Science Interview Questions and Answers | Artificial Intelligence Interview Questions and Answers

Question - What is a partition in Hive and why is partitioning required in Hive

Answer -

Partition is a process for grouping similar types of data together based on columns or partition keys. Each table can have one or more partition keys to identify a particular partition.

Partitioning provides granularity in a Hive table. It reduces the query latency by scanning only relevant partitioned data instead of the entire data set. We can partition the transaction data for a bank based on month — January, February, etc. Any operation regarding a particular month, say February, will only have to scan the February partition, rather than the entire table data.

Comment(S)

Show all Coment

NCERT Solutions

Share your email for latest updates

Name:

Email:

Hadoop Interview Questions and Answers

Related Subjects

Comment(S)

Leave a Comment

NCERT Solutions

Share your email for latest updates

Latest News

10000+ interview questions in different categories

Freshers and experienced

Testimonial

NCERT Questions Answers

Halpura.com

Our partners