Question
a.
Drill
b.
Mahout
c.
Oozie
d.
All of the mentioned
Posted under Hadoop
Engage with the Community - Add Your Comment
Confused About the Answer? Ask for Details Here.
Know the Explanation? Add it Here.
Q. Apache _________ provides direct queries on self-describing and semi-structured data in files.
Similar Questions
Discover Related MCQs
Q. Drill provides a __________ like internal data model to represent and process data.
View solution
Q. Drill also provides intuitive extensions to SQL to work with _______ data types.
View solution
Q. The Apache Crunch Java library provides a framework for writing, testing, and running ___________ pipelines.
View solution
Q. For Scala users, there is the __________ API, which is built on top of the Java APIs.
View solution
Q. The Crunch APIs are modeled after _________ which is the library that Google uses for building data pipelines on top of their own implementation of MapReduce.
View solution
Q. Crunch was designed for developers who understand __________ and want to use MapReduce effectively.
View solution
Q. Hive, Pig, and Cascading all use a _________ data model.
View solution
Q. A __________ represents a distributed, immutable collection of elements of type T.
View solution
Q. ___________ executes the pipeline as a series of MapReduce jobs.
View solution
Q. __________ represent the logical computations of your Crunch pipelines.
View solution
Q. PCollection, PTable, and PGroupedTable all support a __________ operation.
View solution
Q. Crunch uses Java serialization to serialize the contents of all of the ______ in a pipeline definition.
View solution
Q. Inline DoFn that splits a line up into words is an inner class ____________
View solution
Q. DoFns provide direct access to the __________ object that is used within a given Map or Reduce task via the getContext method.
View solution
Q. The top-level ___________ package contains three of the most important specializations in Crunch.
View solution
Q. The Avros class also has a _____ method for creating PTypes for POJOs using Avro’s reflection-based serialization mechanism.
View solution
Q. The ______________ class defines a configuration parameter named LINES_PER_MAP that controls how the input file is split.
View solution
Q. The ________ class allows developers to exercise precise control over how data is partitioned, sorted, and grouped by the underlying execution engine.
View solution
Q. Which of the following project is interface definition language for hadoop?
View solution
Q. __________ is used as a remote procedure call (RPC) framework for facebook.
View solution
Suggested Topics
Are you eager to expand your knowledge beyond Hadoop? We've curated a selection of related categories that you might find intriguing.
Click on the categories below to discover a wealth of MCQs and enrich your understanding of Computer Science. Happy exploring!