• +91 9723535972
  • info@interviewmaterial.com

Big Data Interview Questions and Answers

Question - What are three common input formats in Hadoop?

Answer -

Hadoop supports multiple input formats, which determine the shape of the data when it is collected into the Hadoop platform. The following input formats are three of the most common:

  • Text. This is the default input format. Each line within a file is treated as a separate record. The records are saved as key/value pairs, with the line of text treated as the value.
  • Key-Value Text. This input format is similar to the Text format, breaking each line into separate records. Unlike the Text format, which treats the entire line as the value, the Key-Value Text format breaks the line itself into a key and a value, using the tab character as a separator.
  • Sequence File. This format reads binary files that store sequences of user-defined key-value pairs as individual records.
  • Hadoop supports other input formats as well, so you also should have a good understanding of them, in addition to the ones described here.

Comment(S)

Show all Coment

Leave a Comment




NCERT Solutions

 

Share your email for latest updates

Name:
Email:

Our partners