Question 6 of 59
A decision tree is being built. An internal node is being evaluated for partitioning on variables A and B. The entropy of the internal node is 0.8. The entropy for each of the variables is as follows:

Variable A: 0.5 -

Variable B: 0.4 -
Which variable will be used to partition the data and what is the information gain?
Correct Answer: B

Question 7 of 59
In association rules, given items X and Y, what does lift measure?
Correct Answer: D

Question 8 of 59
What are categorized as cluster and workflow management tools for Hadoop?
Correct Answer: D

Question 9 of 59
Consider the following text:
“Stop!” he shouted. “Don’t go there!”
What set of words result from using a tokenizer for punctuation on the text?
Correct Answer: D

Question 10 of 59
Which phase of the data analytic lifecycle includes conducting project sponsor interviews and drafting a problem statement?
Correct Answer: D