When you configure a MapReduce job, the inputs can include:
A. A single file
B. Paths to one or more directories
C. A file pattern (e.g., mypath/*.csv)
D. All of the above
A large bank was planning to offload existing data from a data warehouse into Hadoop and use SQL queries to access historical data. Which one of the following statements is true for using HiveQL?
A. It supports four logical operators in query predicates: IN, NOT IN, EXISTS, and NOT EXISTS
B. It does not support nested sub-queries
C. Hive supports all ANSI SQL 2011 syntax
D. All of the above
PCI compliance requirements allow the use of real customer data during testing and development only when:
A. The data is only processed in memory
B. Customer data is never allowed in testing or development
C. The data is never stored longer than 24 hours in the test system
D. The data is only stored in volatile storage that expires on power loss
What are the available document formats beside PDF and MS Word when export a redacted document using Optim Review Tool?
A. TIFF, and CSF
B. TIFF, and PNG
C. JPEG, and PNG
D. Plain Text, and CSV
What is Flume?
A. A distributed filesystem
B. A platform for executing MapReduce jobs
C. A programming language that translates high-level queries into map tasks and reduce tasks
D. A service for moving large amounts of data around a cluster soon after the data is produced.
Extracting structured data from various database into a "sandbox" location without writing code can be performed using which tool include with BigInsights?
A. Flume
B. Data Click
C. DataStage
D. Big SQL Load
Use of Bulk Load in HBase for loading large volume of data will result in which of the following?
A. It will use less CPU but will use more network resource
B. It will use less network resource but more CPU
C. It will behave same way as using HBase API for loading large volume of data
D. None of the above
When embedding SPSS models within InfoSphere Streams, what SPSS product must be installed on the same machine with InfoSphere Streams?
A. SPSS Modeler
B. SPSS Solution Publisher
C. SPSS Accelerator for InfoSphere Streams
D. None, the SPSS software runs remotely to the Streams machine
Which of the following statements regarding Sqoop is TRUE? (Choose two.)
A. All columns in a table must be imported
B. Sqoop bypasses MapReduce for enhanced performance
C. Each row from a source table is represented as a separate record in HDFS
D. When using a password file, the file containing the password must reside in HDFS
E. Multiple options files can be specified when invoking Sqoop from the command line
Which of the following techniques is NOT employed by Big SQL to improve performance?
A. Query Optimization
B. Predicate Push down
C. Compression efficiency
D. Load data into DB2 and return the data
Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only IBM exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your C2090-101 exam preparations and IBM certification application, do not hesitate to visit our Vcedump.com to find your solutions here.