Data Science MCQs focuses on “Caret”.
1. Which of the following function is a wrapper for different lattice plots to visualize the data?
a) levelplot
b) featurePlot
c) plotsample
d) none of the mentioned
Answer: b
Explanation: featurePlot is used for data visualization in caret.
2. Point out the wrong statement.
a) In every situation, the data generating mechanism can create predictors that only have a single unique value
b) Predictors might have only a handful of unique values that occur with very low frequencies
c) The function findLinearCombos uses the QR decomposition of a matrix to enumerate sets of linear combinations
d) All of the mentioned
Answer: a
Explanation: In some situations, the data generating mechanism can create predictors that only have a single unique value.
3. Which of the following function can be used to identify near zero-variance variables?
a) zeroVar
b) nearVar
c) nearZeroVar
d) all of the mentioned
Answer: c
Explanation: The saveMetrics argument can be used to show the details and usually defaults to FALSE.
4. Which of the following function can be used to flag predictors for removal?
a) searchCorrelation
b) findCausation
c) findCorrelation
d) none of the mentioned
Answer: c
Explanation: Some models thrive on correlated predictors.
5. Point out the correct statement.
a) findLinearColumns will also return a vector of column positions can be removed to eliminate the linear dependencies
b) findLinearCombos will return a list that enumerates dependencies
c) the function findLinearRows can be used to generate a complete set of row variables from one factor
d) none of the mentioned
Answer: b
Explanation: For each linear combination, it will incrementally remove columns from the matrix and test to see if the dependencies have been resolved.
6. Which of the following can be used to impute data sets based only on information in the training set?
a) postProcess
b) preProcess
c) process
d) all of the mentioned
Answer: b
Explanation: This can be done with K-nearest neighbors.
7. The function preProcess estimates the required parameters for each operation.
a) True
b) False
Answer: a
Explanation: predict.preProcess is used to apply them to specific data sets.
8. Which of the following can also be used to find new variables that are linear combinations of the original set with independent components?
a) ICA
b) SCA
c) PCA
d) None of the mentioned
Answer: a
Explanation: ICA stands for independent component analysis.
9. Which of the following function is used to generate the class distances?
a) preprocess.classDist
b) predict.classDist
c) predict.classDistance
d) all of the mentioned
Answer: b
Explanation: By default, the distances are logged.
10. The preProcess class can be used for many operations on predictors.
a) True
b) False
Answer: a
Explanation: Operations include centering and scaling.