0 of 88 questions completed
You have already completed the quiz before. Hence you can not start it again.
Quiz is loading…
You must sign in or sign up to start the quiz.
You must first complete the following:
0 of 88 questions answered correctly
Time has elapsed
You have reached 0 of 0 point(s), (0)
Earned Point(s): 0 of 0, (0)
0 Essay(s) Pending (Possible Point(s): 0)
In a future society, a machine is used to predict a crime before it occurs. If you were responsible for tuning this machine, what evaluation metric would you want to maximize to ensure no innocent people (people not about to commit a crime) are imprisoned (where crime is the positive label)?
Consider the machine from the previous question. If you were
responsible for tuning this machine, what evaluation metric
would you want to maximize to ensure all criminals (people
about to commit a crime) are imprisoned (where crime is the
After the data are appropriately processed, transformed, and stored, what is a good starting point for data mining?
The ultimate purpose of analytics is to communicate findings to stakeholders to formulate policy or
Select the correct statement.
Select the correct statement about the Data Preparation stage
Select the correct statement about what data scientists and database administrators (DBAs) do during the Data Preparation stage
Select the correct statement about the Data Preparation stage of the data science methodology
The Data Requirements stage of the data science methodology involves identifying the necessary data content, formats and sources for initial data collection
In the Data Collection stage, techniques such as descriptive statistics and visualization can be applied to the data set, to assess the content, quality, and initial insights about the data
Select the correct statement about the Feedback stage of the data science methodology
Select the correct sentence about the data science methodology explained in the course
What do data scientists typically use for exploratory analysis of data and to get acquainted with it?
Data scientists may frequently return to a previous stage to make adjustments, as they learn more about the data and the modelling
Data scientists may use either a “top-down” approach or a “bottom-up” approach to data science. These two approaches refer to:
what is the type of the following: 1.0
What is the result of the following code segment: type(int(12.3))
What is the result of the following code segment: int(3.99}
What is the result of the following code segment: int(True)
What is the type of the variable x after the following: x=2/2
In Python, if you executed name= ‘Lizz’, what would be the output of print(name[0:2])?
In Python, what is the result of the following operation: ‘1’+’2′
Consider the string Name=”ABCOE“, what is the result of the following operation Name.find(“B”)
what is the result of the following: “ABC”.replace(“AB”, “ab”)
What is the syntax to obtain the first element of the tuple:
Consider the tuple A=((11, 12),[21,22]), that contains a tuple and list. What is the result of the following operation A:
Consider the tuple A=((11,12),[21,22]), that contains a tuple and list. What is the result of the following operation A:
What is the result of the following operation: ‘A,B,C,D’.split(‘,‘)
Consider the following list: A=[“hard rock”,10,1.2]
what will list A contain after the following command is run: del(A[O])
What is the result of the following: len((“disco”,10))?
Matplotlib was created by
The Backend, Artist, and Scripting Layers are the three layers that make up the Matplotlib architecture.
What is a Python library?
What is the primary instrument or data-structure in Pandas?
What task does the following lines of code perform?
1 path=‘C:\Windows\_\ automobile.csv’
Consider the following lines of code, what is the name of the column that contains the target values:
How would you cast each element in the column “price” to an integer?
What task do the following lines of code perform:
with open(‘Example2.txt’,‘r’) as readfile:
with open(‘Example3.txt’,‘w’) as writefile:
for line in readfile:
How would you rename column name from “highway-mpg” to “highway-L/100km”
The following code is an example of:
df[‘length’] = df[‘length’]/df[‘length’].max()
select the appropriate table for the following line of code
Stamen Terrain, Stamen Toner, and Map box Bright, are three tile styles of Folium maps.
If you are interested in generating a map of Spain to visualize its hill shading and natural vegetation, which of the following lines of code will create the right map for you?
A choropleth map is a thematic map in which areas are shaded or patterned in proportion to the measurement of the statistical variable being displayed on the map
Which of the choices below will create the following regression line plot, given a pandasdataframe,
A regression plot is a great way to visualize data in relation to a whole, or to highlight progress against a given threshold.
A word cloud is a depiction of the meaningful words in some textual data, where the more a specific
word appears in the text, bigger and bolder it appears in the word cloud.
Bar charts are less confusing than pie charts and should be your first attempt when creating a visual to explore a dataset.
What is the correct combination of function and parameter to create a box plot in Matplotlib?
A bubble plot is a variation of the scatter plot that displays three dimensions of data
Area plots are unstacked by default.
The following code uses the artist layer to create a stacked area plot of the data in the pandas
The following code will create a horizontal bar chart of the data in a pandasdataframe, question
what model would have the lowest training error
What is the maximum value of R^2 that can be obtained
“Regression” technique in Machine Learning is a group of algorithms that are used for:
Select the correct function to calculate the mean squared error between yhat and y
Consider the following equation:
what is the parameter b_O (b subscript 0)
Consider the dataframe df, what method provides the summary statistics?
If we have 10 columns and 100 samples how large is the output of df.corr()
What is the Pearson correlation between variables X and Y, if X=Y:
Supervised learning deals with unlabeled data, while unsupervised learning deals with labelled data.
Multiple Linear Regression is proper for:
Which one IS NOT a sample of classification problem?
Which one is TRUE about kNN algorithm?
Which of the following is an application of clustering?
What is the meaning of “Cold start” in collaborative filtering?
Choose correct statements about convolutional layer:
Which of the following is an example of clustering?
Modeling the features of an unlabeled dataset to find hidden structure is known as
Training a model using categorically labelled data to predict labels for new data is known as —-
Training a model using labelled data where the labels are continuous quantities to predict labels for new data is known as
The key purpose of splitting the dataset into training and test sets is:
Data Science is a
Which bracketing style does Python use for tuples?
String slicing is
A complex model with many parameters is most likely to suffer from:
A simple model with few parameters is most likely to suffer from:
A model with many parameters that fits training data very well but does poorly on test data is considered to be
Consider the following dataset:
Let us train a decision tree with this data. Let’s call this tree T1. What feature will we split on at the root?
It’s always better to remove missing data points (i.e., rows) as opposed to removing missing features (i.e., columns).
Which of the following is true for decision trees?
Which of the following is NOT an ensemble method?
If you have a few images of different types of plankton labeled with their species name, what would you expect to perform better predictions:
If you have lots of images of different types of plankton labeled with their species name, and lots of computational resources, what would you expect to perform better predictions:
Among the three points, which two are closest to each other in terms of having
the smallest Euclidean distance?
Among the three points, which two are closest to each other in terms of having
the largest cosine similarity (or equivalently, smallest cosine distance)?
For positive features, cosine similarity is always between 0 and 1