Question
a.
Pig
b.
Hive
c.
HCatalog
d.
Impala
Posted under Hadoop
Engage with the Community - Add Your Comment
Confused About the Answer? Ask for Details Here.
Know the Explanation? Add it Here.
Q. Sally in data processing uses __________ to cleanse and prepare the data.
Similar Questions
Discover Related MCQs
Q. For ___________ partitioning jobs, simply specifying a custom directory is not good enough.
View solution
Q. ___________ property allows us to specify a custom dir location pattern for all the writes, and will interpolate each variable.
View solution
Q. HCatalog maintains a cache of _________ to talk to the metastore.
View solution
Q. On the write side, it is expected that the user pass in valid _________ with data correctly.
View solution
Q. A float parameter, defaults to 0.0001f, which means we can deal with 1 error every __________ rows.
View solution
Q. _________________ property allow users to override the expiry time specified.
View solution
Q. ____________ is used with Pig scripts to write data to HCatalog-managed tables.
View solution
Q. Hive does not have a data type corresponding to the ____________ type in Pig.
View solution
Q. _______________ method is used to include a projection schema, to specify the output fields.
View solution
Q. The first call on the HCatOutputFormat must be ____________
View solution
Q. ___________ is the type supported for storing values in HCatalog tables.
View solution
Q. The output descriptor for the table to be written is created by calling ____________
View solution
Q. Which of the following Hive commands is not supported by HCatalog?
View solution
Q. Mahout provides ____________ libraries for common and primitive Java collections.
View solution
Q. _________ does not restrict contributions to Hadoop based implementations.
View solution
Q. Mahout provides an implementation of a ______________ identification algorithm which scores collocations using log-likelihood ratio.
View solution
Q. The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
View solution
Q. The _________ collocation identifier is integrated into the process that is used to create vectors from sequence files of text keys and values.
View solution
Q. ____________ generates NGrams and counts frequencies for ngrams, head and tail subgrams.
View solution
Q. A key of type ___________ is generated which is used later to join ngrams with their heads and tails in the reducer phase.
View solution
Suggested Topics
Are you eager to expand your knowledge beyond Hadoop? We've curated a selection of related categories that you might find intriguing.
Click on the categories below to discover a wealth of MCQs and enrich your understanding of Computer Science. Happy exploring!