Question 1

Fill in the blank: The best fit line is the line that fits the data best by minimizing some _____.

Accepted Answer

loss function (CORRECT)

Question 2

What is the sum of the squared differences between each observed value and the associated predicted value?

Accepted Answer

Sum of squared residuals (CORRECT)

Question 3

What does the circumflex symbol, or "hat" (^), indicate when used over a coefficient?

Accepted Answer

The coefficient is an estimate or predicted value (CORRECT)

Question 4

How does a data professional determine if a linearity assumption is met?

Accepted Answer

They confirm whether data on the X-Y coordinate falls along a straight line. (CORRECT)

Question 5

Which of the following statements accurately describes the normality assumption?

Accepted Answer

The normality assumption can only be confirmed after a model is built. (CORRECT)

Question 6

A data professional is using a scatterplot to plot residuals and predicted values from a regression model to check for homoscedasticity. What does this scenario represent?

Accepted Answer

Random cloud (CORRECT)

Question 7

What type of visualization uses a series of scatterplots that show the relationships between pairs of variables?

Accepted Answer

Scatterplot matrix (CORRECT)

Question 8

What is the area surrounding a regression line, which describes the uncertainty around the predicted outcome at every value of X?

Accepted Answer

Confidence band (CORRECT)

Question 9

Fill in the blank: R squared measures the _____ in the dependent variable, Y, that is explained by the independent variable, X.

Accepted Answer

proportion of variation (CORRECT)

Question 10

Which linear regression evaluation metric is sensitive to large errors?

Accepted Answer

Mean squared error (MSE) (CORRECT)

Question 11

Which of the following are best practices when communicating linear regression results? Select all that apply.

Accepted Answer

Make the findings quickly understood without technical terms. Provide measures of uncertainty around estimated results. Use data visualizations to present the results. (CORRECT)

Question 12

Which of the following statements accurately describe coefficients and p-values for regression model interpretation? Select all that apply.

Accepted Answer

Coefficients determine how changes in the independent variables are associated with changes in the dependent variable. P-values demonstrate whether coefficients are statistically significant. (CORRECT)

Question 13

A data professional determines the best fit line by calculating the difference between observed values and the predicted value of a regression line. What is this calculation?

Accepted Answer

Residual (CORRECT)

Question 14

In linear regression, what mathematical technique is used to calculate the best fit line?

Accepted Answer

Ordinary least squares (CORRECT)

Question 15

A data professional testing for linear regression assumptions plots their dependent variable against their independent variable and notices that the graph appears as a repeating waveform. Which model assumption does this invalidate?

Accepted Answer

Linearity (CORRECT)

Question 16

Fill in the blank: A scatterplot matrix is a series of scatterplots that show the _____ between pairs of variables.

Accepted Answer

relationships (CORRECT)

Question 17

A data professional at a toy manufacturer checks model assumptions while working on a project about potential new game concepts. They find no clear pattern in their scatterplot and can confirm constant variance along the values of the dependent variable. What does this scenario describe?

Accepted Answer

Homoscedasticity (CORRECT)

Question 18

Fill in the blank: A confidence band is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of _____.

Accepted Answer

X (CORRECT)

Question 19

What is another term for R squared?

Accepted Answer

Coefficient of residuals (CORRECT)

Question 20

Which of the following statements accurately describe running a randomized, controlled experiment? Select all that apply.

Accepted Answer

The differences between the control and treatment groups must be observable and measurable. To be successful, data professionals must control for every factor in the experiment.  It is typically used when arguing for causation between variables. (CORRECT)

Question 21

Fill in the blank: _____ is the difference between observed values and the predicted values of a regression line.

Accepted Answer

Residual (CORRECT)

Question 22

A data professional minimizes the sum of squared residuals to estimate parameters in a linear regression model. What method are they using?

Accepted Answer

Ordinary least squares (CORRECT)

Question 23

A data analytics professional working for a storage facility checks model assumptions while determining optimal storage space sizes. They notice that the model's residuals appear in a cone-shaped pattern when plotted against the independent variable. Which model assumption does this invalidate?

Accepted Answer

Homoscedasticity (CORRECT)

Question 24

A data professional determines how much of the variation in the X variable explains the variation in the Y variable. Which model evaluation metric enables this determination?

Accepted Answer

R squared (CORRECT)

Question 25

Fill in the blank: A scatterplot _____ is a series of scatterplots that show the relationships between pairs of variables.

Accepted Answer

matrix (CORRECT)

Question 26

Which of the following statements accurately describe a randomized, controlled experiment? Select all that apply.

Accepted Answer

The differences between the control and treatment groups must be observable and measurable. It is a study design that randomly assigns participants into an experimental group or a control group. To be successful, data professionals must control for every factor in the experiment. (CORRECT)

Question 27

In linear regression, what mathematical technique is used to calculate beta zero hat and beta one hat?

Accepted Answer

Ordinary least squares (CORRECT)

Question 28

Fill in the blank: A scatterplot matrix is a series of scatterplots that show the relationships between pairs of _____.

Accepted Answer

variables (CORRECT)

Question 29

What is the difference between observed or actual values and the predicted values of a regression line?

Accepted Answer

Residual (CORRECT)

Question 30

Fill in the blank: A _____ is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of X.

Accepted Answer

confidence band (CORRECT)

Question 31

What measures the proportion of variation in the dependent variable Y explained by the independent variable X?

Accepted Answer

R squared (CORRECT)

Question 32

Fill in the blank: A scatterplot _____ is a series of scatterplots that show the relationships between pairs of variables.

Accepted Answer

matrix (CORRECT)

Question 33

Fill in the blank: A _____ is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of X.

Accepted Answer

confidence band (CORRECT)

Question 34

Fill in the blank: A confidence band is the area surrounding a line that describes the _____ around the predicted outcome at every value of X.

Accepted Answer

Uncertainty (CORRECT)

Question 35

What term describes the difference between observed or actual values and the predicted values of the regression line?

Accepted Answer

Residuals (CORRECT)

Question 36

There are four assumptions of simple linear regression, including linearity, normality, and independent observations. What is the fourth assumption?

Accepted Answer

Homoscedasticity (CORRECT)

Question 37

In a linear regression model, what is the area surrounding the regression line that describes the uncertainty around the predicted outcome at every value of X?

Accepted Answer

confidence band (CORRECT)

COURSE 5: REGRESSION ANALYSIS: SIMPLIFY COMPLEX DATA RELATIONSHIPS

Module 2: Simple Linear Regression

GOOGLE ADVANCED DATA ANALYTICS PROFESSIONAL CERTIFICATE

Complete Coursera Study Guide

TABLE OF CONTENT

INTRODUCTION – Simple Linear Regression

Learning Objectives

PRACTICE QUIZ: TEST YOUR KNOWLEDGE: FOUNDATIONS OF LINEAR REGRESSION

1. Fill in the blank: The best fit line is the line that fits the data best by minimizing some _____.

2. What is the sum of the squared differences between each observed value and the associated predicted value?

3. What does the circumflex symbol, or “hat” (^), indicate when used over a coefficient?

PRACTICE QUIZ: TEST YOUR KNOWLEDGE: ASSUMPTIONS AND CONSTRUCTION IN PYTHON

1. How does a data professional determine if a linearity assumption is met?

2. Which of the following statements accurately describes the normality assumption?

3. A data professional is using a scatterplot to plot residuals and predicted values from a regression model to check for homoscedasticity. What does this scenario represent?

4. What type of visualization uses a series of scatterplots that show the relationships between pairs of variables?

PRACTICE QUIZ: TEST YOUR KNOWLEDGE: EVALUATE A LINEAR REGRESSION MODEL

1. What is the area surrounding a regression line, which describes the uncertainty around the predicted outcome at every value of X?

2. Fill in the blank: R squared measures the _____ in the dependent variable, Y, that is explained by the independent variable, X.

3. Which linear regression evaluation metric is sensitive to large errors?

PRACTICE QUIZ: TEST YOUR KNOWLEDGE: INTERPRET LINEAR REGRESSION RESULTS

1. Which of the following are best practices when communicating linear regression results? Select all that apply.

2. Which of the following statements accurately describe coefficients and p-values for regression model interpretation? Select all that apply.

QUIZ: MODULE 2 CHALLENGE

1. A data professional determines the best fit line by calculating the difference between observed values and the predicted value of a regression line. What is this calculation?

2. In linear regression, what mathematical technique is used to calculate the best fit line?

3. A data professional testing for linear regression assumptions plots their dependent variable against their independent variable and notices that the graph appears as a repeating waveform. Which model assumption does this invalidate?

4. Fill in the blank: A scatterplot matrix is a series of scatterplots that show the _____ between pairs of variables.

5. A data professional at a toy manufacturer checks model assumptions while working on a project about potential new game concepts. They find no clear pattern in their scatterplot and can confirm constant variance along the values of the dependent variable. What does this scenario describe?

6. Fill in the blank: A confidence band is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of _____.

7. What is another term for R squared?

8. Which of the following statements accurately describe running a randomized, controlled experiment? Select all that apply.

9. Fill in the blank: _____ is the difference between observed values and the predicted values of a regression line.

10. A data professional minimizes the sum of squared residuals to estimate parameters in a linear regression model. What method are they using?

12. A data professional determines how much of the variation in the X variable explains the variation in the Y variable. Which model evaluation metric enables this determination?

13. Fill in the blank: A scatterplot _____ is a series of scatterplots that show the relationships between pairs of variables.

14. Which of the following statements accurately describe a randomized, controlled experiment? Select all that apply.

15. In linear regression, what mathematical technique is used to calculate beta zero hat and beta one hat?

16. Fill in the blank: A scatterplot matrix is a series of scatterplots that show the relationships between pairs of _____.

17. What is the difference between observed or actual values and the predicted values of a regression line?

18. Fill in the blank: A _____ is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of X.

19. What measures the proportion of variation in the dependent variable Y explained by the independent variable X?

20. Fill in the blank: A scatterplot _____ is a series of scatterplots that show the relationships between pairs of variables.

21. Fill in the blank: A _____ is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of X.

22. Fill in the blank: A confidence band is the area surrounding a line that describes the _____ around the predicted outcome at every value of X.

23. What term describes the difference between observed or actual values and the predicted values of the regression line?

24. There are four assumptions of simple linear regression, including linearity, normality, and independent observations. What is the fourth assumption?

25. In a linear regression model, what is the area surrounding the regression line that describes the uncertainty around the predicted outcome at every value of X?

CONCLUSION – Simple Linear Regression

Subscribe to our site

Quiztudy Top Courses

Popular in Coursera

Mood Zone for Studying & Relaxing