Data Science Interview Questions and Answers for experienced focuses on “Reading from Web and API”.
1. Which of the following package is used to connect MySQL RDBMS with R?
a) RMySQL vignette
b) MySQL vignette
c) RSQL vignette
d) None of the mentioned
Answer: a
Explanation: This package contains meta information and index.
2. Point out the correct statement.
a) HDF5 is a hierarchical format
b) HDF5 does not support range of different data types
c) HDF5 is used for storing small datasets
d) None of the mentioned
Answer: a
Explanation: HDF5 is used for storing large datasets.
3. Which of the following is used to extract data from HTML code of websites?
a) Webscraping
b) Webdredging
c) Webcleaning
d) All of the mentioned
Answer: a
Explanation: Webscraping is a great way to get data.
4. Which of the following function is used to read data off the webpages?
a) read.web
b) read.Lines
c) read.Line
d) all of the mentioned
Answer: b
Explanation: read.Lines function will extract the web page data.
5. Point out the wrong statement.
a) hdf5 can be used to reading/writing from disc in Python
b) rhdf5 is an interface for hdf5 format
c) maximum size of an HDF5 dataset is fixed when it is created
d) all of the mentioned
Answer: b
Explanation: hdf5 can be used to reading/writing from disc in R.
6. Which of the following package is used for reading HTML and XML data?
a) httr
b) http
c) httx
d) all of the mentioned
Answer: a
Explanation: httr contains tools for Working with URLs and HTTP.
7. httr package does not work well with facebook and twitter API.
a) True
b) False
Answer: b
Explanation: Most modern APIs use something like oauth.
8. Which of the following request can be issued from httr package?
a) GET
b) PUT
c) DELETE
d) All of the mentioned
Answer: d
Explanation: Authentication is necessary for issuing a request.
9. Which of the following package loads data from SPSS?
a) read.spss(SPSS)
b) read.oct(SPSS)
c) read.xpot(SPSS)
d) all of the mentioned
Answer: a
Explanation: SPSS is a comprehensive and flexible statistical analysis and data management solution.
10. Which of the following package is used for reading GIS data?
a) rdgal
b) rgeos
c) raster
d) all of the mentioned
Answer: d
Explanation: A geographic information system is a system designed to capture, store, manipulate, analyze, manage, and present all types of spatial or geographical data.