Exam Details

  • Exam Code
    :E20-007
  • Exam Name
    :Data Science and Big Data Analytics
  • Certification
    :EMC Certifications
  • Vendor
    :EMC
  • Total Questions
    :198 Q&As
  • Last Updated
    :Mar 30, 2025

EMC EMC Certifications E20-007 Questions & Answers

  • Question 61:

    A Data Scientist is assigned to build a model from a reporting data warehouse. The warehouse contains data collected from many sources and transformed through a complex, multi-stage ETL process. What is a concern the data scientist should have about the data?

    A. It is too processed

    B. It is not structured

    C. It is not normalized

    D. It is too centralized

  • Question 62:

    Refer to the exhibit.

    You are building a decision tree. In this exhibit, four variables are listed with their respective values of info-gain.

    Based on this information, on which attribute would you expect the next split to be in the decision tree?

    A. Credit Score

    B. Age

    C. Income

    D. Gender

  • Question 63:

    Consider these itemsets:

    (hat, scarf, coat)

    (hat, scarf, coat, gloves)

    (hat, scarf, gloves)

    (hat, gloves)

    (scarf, coat, gloves)

    What is the confidence of the rule (gloves -> hat)?

    A. 75%

    B. 60%

    C. 66%

    D. 80%

  • Question 64:

    A call center for a large electronics company handles an average of 35, 000 support calls a day. The head of the call center would like to optimize the staffing of the call center during the rollout of a new product due to recent customer complaints of long wait times. You have been asked to create a model to optimize call center costs and customer wait times.

    The goals for this project include:

    1.

    Relative to the release of a product, how does the call volume change over time?

    2.

    How to best optimize staffing based on the call volume for the newly released product, relative to old products.

    3.

    Historically, what time of day does the call center need to be most heavily staffed?

    4.

    Determine the frequency of calls by both product type and customer language. Which goals are suitable to be completed with MapReduce?

    A. Goal 2 and 4

    B. Goal 1 and 3

    C. Goals 1, 2, 3, 4

    D. Goals 2, 3, 4

  • Question 65:

    In R, functions like plot() and hist() are known as what?

    A. generic functions

    B. virtual methods

    C. virtual functions

    D. generic methods

  • Question 66:

    Which data asset is an example of quasi-structured data?

    A. Webserver log

    B. XML data file

    C. Database table

    D. News article

  • Question 67:

    What is the reason for using LOESS?

    A. Fits a smoothed curve to scatterplot data; providing a general idea of the data's behavior

    B. Significance test for the correlation between two variables

    C. Plots a continuous variable versus a discrete variable; comparing distributions across classes

    D. Runs after a one-way ANOVA; determining which population has the highest mean value

  • Question 68:

    In a Student's t-test, what is the meaning of the p-value?

    A. it is the area under the appropriate tails of the Student's distribution

    B. it is the "power" of the Student's t-test

    C. it is the mean of the distribution for the null hypothesis

    D. it is the mean of the distribution for the alternate hypothesis

  • Question 69:

    How are window functions different from regular aggregate functions?

    A. Rows retain their separate identities and the window function can access more than the current row.

    B. Rows are grouped into an output row and the window function can access more than the current row.

    C. Rows retain their separate identities and the window function can only access the current row.

    D. Rows are grouped into an output row and the window function can only access the current row.

  • Question 70:

    Which graphical representation shows the distribution and multiple summary statistics of a continuous variable for each value of a corresponding discrete variable?

    A. box and whisker plot

    B. dotplot

    C. scatterplot

    D. binplot

Tips on How to Prepare for the Exams

Nowadays, the certification exams become more and more important and required by more and more enterprises when applying for a job. But how to prepare for the exam effectively? How to prepare for the exam in a short time with less efforts? How to get a ideal result and how to find the most reliable resources? Here on Vcedump.com, you will find all the answers. Vcedump.com provide not only EMC exam questions, answers and explanations but also complete assistance on your exam preparation and certification application. If you are confused on your E20-007 exam preparations and EMC certification application, do not hesitate to visit our Vcedump.com to find your solutions here.