Data100 Final Prep

Data100 Final Prep

University

18 Qs

quiz-placeholder

Similar activities

DATA ANALYSIS

DATA ANALYSIS

University

20 Qs

Understanding Computer Concepts

Understanding Computer Concepts

7th Grade - University

15 Qs

Descriptive Inferential Statistics

Descriptive Inferential Statistics

12th Grade - University

15 Qs

Data Appreciation

Data Appreciation

University

15 Qs

Chapter 5 and Online Chapter B

Chapter 5 and Online Chapter B

University

20 Qs

Questionnaire Design

Questionnaire Design

University

17 Qs

CompTIA A+ 20200306

CompTIA A+ 20200306

University

15 Qs

POST TEST MODULE 7

POST TEST MODULE 7

University

20 Qs

Data100 Final Prep

Data100 Final Prep

Assessment

Quiz

Other

University

Medium

Created by

Devan Becker

Used 375+ times

FREE Resource

18 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

What is the target variable?

The thing we're trying to predict

The values of the y-variable in the test set.

The x-variables

🎯

2.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

What's the difference between a validation set and the test set.

The validation set is used to train the model.

Only the validation set is used to compare models.

The validation set is used many times, test set is used once.

3.

MULTIPLE CHOICE QUESTION

30 sec • 1 pt

Which of the sets do you not use to compare RMSE across models?

Training

Validation

Testing

4.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

Which is not an assumption that we make with the train-val-test approach?

Our sample "looks like" the population

We are likely going to collect more data at some point

The model we fit to the training set is a perfect representation of the population.

5.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

We use the validation set to:

Find the best model parameters

Find the model parameters with the best out-of-sample predictions

Find the modelling approach that leads to the best predictions

6.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

What is information leakage?

When any information from the test set is used in training.

When confidential information is leaked to the public.

A reason to call an information plumber.

When there is some information that was lost.

7.

MULTIPLE CHOICE QUESTION

1 min • 1 pt

We suspect that there might be an interaction between two continuous features x1 and x2. What type of plot should we make?

A scatterplot of y versus x1, coloured by the levels of x2

A scatterplot of y versus x1 * x2

A scatterplot of x1 versus x2

A boxplot of y versus x1 and x2

Create a free account and access millions of resources

Create resources

Host any resource

Get auto-graded reports

Google

Continue with Google

Email

Continue with Email

Classlink

Continue with Classlink

Clever

Continue with Clever

or continue with

Microsoft

Microsoft

Apple

Apple

Others

Others

By signing up, you agree to our Terms of Service & Privacy Policy

Already have an account?