Verification and reporting come directly before the data-cleaning process

Course 4 – Process Data from Dirty to Clean Quiz Answers

Week 4: Verify and Report on your Cleaning Results

GOOGLE DATA ANALYTICS PROFESSIONAL CERTIFICATION

Complete Study Guide

Verify and Report on your Cleaning Results INTRODUCTION

Verifying and reporting your data cleaning is an important part of Google’s Data Analytics Professional Certificate program on Coursera. In this course, you will learn the processes involved with verifying and reporting the results of your data cleaning, as well as their benefits. Through hands-on exercises, you’ll become familiar with using real-world datasets to practice verifying and reporting the results of your data-cleaning efforts.

You’ll also develop skills in writing effective summaries that enable others to understand better why you performed certain tasks and decisions made throughout the process. Understanding how to verify and report on your cleaning results is key for ensuring that your analysis can be trusted by those who use it later down the line.

Learning Objectives

  • Describe the process involved in verifying the results of cleaning data
  • Describe what is involved in manually cleaning data
  • Discuss the elements and importance of data-cleaning reports
  • Describe the benefits of documenting data cleaning process

Test your knowledge on manual data cleaning

1. Making sure data is properly verified is an important part of the data-cleaning process. Which of the following tasks are involved in this verification? Select all that apply.

  • Considering whether the data is credible and appropriate for the project. (Correct)
  • Manually fixing any errors found in the data. (Correct)
  • Rechecking the data-cleaning effort. (Correct)
  • Asking stakeholders to check and confirm the data is clean.

Correct: The verification process confirms that data cleaning was well executed and the resulting data is accurate and reliable. To verify data, analysts recheck the data-cleaning effort, manually fix errors, and consider whether the data is credible and appropriate for the project.

2. Fill in the blank: To count the total number of spreadsheet values within a specified range, a data analyst uses the _____ function.

  • COUNTA (Correct)
  • SUM
  • WHOLE
  • TOTAL

Correct: To count the total number of spreadsheet values within a specified range, a data analyst uses the COUNTA function.

3. A data analyst is cleaning a dataset with inconsistent formats and repeated cases. They use the TRIM function to remove extra spaces from string variables. What other tools can they use for data cleaning? Select all that apply.

  • Import data
  • Remove duplicates (Correct)
  • Protect sheet
  • Find and replace (Correct)

Correct: The analyst can use TRIM, remove duplicates, and find and replace for data cleaning.

4. To correct a typo in a database column, where should you insert a CASE statement in a query?

  • As an ORDER BY clause
  • As a GROUP BY clause
  • As a SELECT clause (Correct)
  • As a FROM clause

Correct: You should add a CASE statement as a SELECT clause. A CASE statement goes through one or more conditions and returns a value as soon as a condition is met. The typo would be a condition and the correction would be the returned value for the condition.

Test your knowledge on documenting the cleaning process

1. Why is it important for a data analyst to document the evolution of a dataset? Select all that apply.

  • To determine the quality of the data (Correct)
  • To identify best practices in the collection of data
  • To inform other users of changes (Correct)
  • To recover data-cleaning errors (Correct)

Correct: It is important to document the evolution of a dataset in order to recover data-cleaning errors, inform other users of changes, and determine the quality of the data.

2. Fill in the blank: While cleaning data, documentation is used to track _____. Select all that apply.

  • deletions (Correct)
  • errors (Correct)
  • bias
  • changes (Correct)

Correct: While cleaning data, documentation is used to track changes, deletions, and errors.

3. Documenting data-cleaning makes it possible to achieve what goals? Select all that apply.

  • Demonstrate to project stakeholders that you are accountable (Correct)
  • Visualize the results of your data analysis
  • Be transparent about your process (Correct)
  • Keep team members on the same page (Correct)

Correct: Documenting data-cleaning makes it possible to be transparent about your process, keep team members on the same page, and demonstrate to project stakeholders that you are accountable.

GOOGLE DATA ANALYTICS COURSERA ANSWERS AND STUDY GUIDE

Liking our content? Then don’t forget to add us to your bookmarks so you can find us easily!

Weekly Breakdown | Google Study Guides | Back to Top

Process Data from Dirty to Clean Weekly Challenge 4

1. The data collected for an analysis project has just been cleaned. What are the next steps for a data analyst? Select all that apply.

  • Certification
  • Reporting (Correct)
  • Verification (Correct)
  • Validation

Correct: Verification and reporting are the next steps for a data analyst after the data is cleaned.

2. What is the first step in the verification process?

  • Compare cleaned data with the original, uncleaned dataset and compare it to what is there now (Correct)
  • Create a chronological list of modifications made to the data
  • Determine the quality of the data
  • Inform others of your data-cleaning effort

Correct: The first step in the verification process is to compare cleaned data with the original, uncleaned dataset and compare it to what is there now.

3. Fill in the blank: TRIM is a function that removes _____ spaces in data. Select all that apply.

  • Trailing (Correct)
  • Leading (Correct)
  • repeated (Correct)
  • inner

Correct: TRIM is a function that removes leading, trailing, and repeated spaces in data.

4. While verifying cleaned data, a data analyst encounters a misspelled name. Which function can they use to determine if the error is repeated throughout the dataset?

  • CHECK
  • COUNTA (Correct)
  • COUNT
  • CASE

Correct: To determine if the error is repeated throughout the dataset, they can use COUNTA.

5. A WHEN statement considers one or more conditions and returns a value as soon as that condition is met.

  • True
  • False (Correct)

Correct: A CASE statement considers one or more conditions and returns a value as soon as that condition is met.

6. Fill in the blank: Documentation is the process of tracking _____ during data cleaning. Select all that apply.

  • inactivity
  • deletions (Correct)
  • changes (Correct)
  • additions (Correct)

Correct: Documentation is the process of tracking changes, additions, deletions, and errors during data cleaning.

7. Fill in the blank: While cleaning data, a data analyst can use a changelog to keep a chronological list of changes they make. They can refer to it during the _____ period if there are errors or questions.

  • verification (Correct)
  • visualization
  • presenting
  • documentation

Correct: While cleaning data, a data analyst can use a changelog to keep a chronological list of changes they make. They can refer to it during the verification period if there are errors or questions.

8. Reviewing version history is an effective way to view a changelog in SQL.

  • True
  • False (Correct)

Correct: Reviewing version history is an effective way to view a changelog in spreadsheets.

9. Fill in the blank: Once data is clean, a data analyst moves on to _____ and verification.

  • processing
  • publishing
  • reporting (Correct)
  • confirming

Correct: Once data is clean, a data analyst moves on to reporting and verification.

10. A data analyst is in the verification step. They consider the business problem, the goal, and the data involved in their analytics project. What scenario does this describe?

  • Visualizing the data
  • Seeing the big picture (Correct)
  • Reporting on the data
  • Considering the stakeholders

Correct: To see the big picture when verifying data cleaning, consider the business problem, the goal, and the data.

11. Which of the following functions automatically remove extra spaces when cleaning data?

  • SNIP
  • REMOVE
  • CLEAR
  • TRIM (Correct)

Correct: TRIM automatically removes extra spaces when cleaning data.

12. While verifying cleaned data, a data analyst encounters a misspelled name. Which function can they use to determine if the error is repeated throughout the dataset?

  • COUNTA (Correct)
  • COUNT
  • CHECK
  • CASE

Correct: To determine if the error is repeated throughout the dataset, they can use COUNTA.

13. A data analyst uses a changelog while cleaning data. What process does a changelog support?

  • Documentation (Correct)
  • Illumination
  • Disclosure
  • Examination

Correct: A changelog supports documentation.

14. Verification and reporting come directly before the data-cleaning process.

  • True
  • False (Correct)

Correct: Verification and reporting come after the data-cleaning process.

15. Which function removes leading, trailing, and repeated spaces in data?

  • TRIM (Correct)
  • CROP
  • TIDY
  • CUT

Correct: TRIM is a function that removes leading, trailing, and repeated spaces in data.

16. Which SQL tool considers one or more conditions, then returns a value as soon as a condition is met?

  • CASE (Correct)
  • WHEN
  • THEN
  • ELSE

Correct: CASE considers one or more conditions, then returns a value as soon as a condition is met.

17. Fill in the blank: A changelog contains a _____ list of modifications made to a project.

  • approximate
  • random
  • synchronized
  • chronological (Correct)

Correct: A data analyst uses a changelog to access the information needed. A changelog is a file that contains a chronological list of modifications made to a project.

18. A data analyst makes changes to SQL queries and uses these comments to create a changelog. This involves specifying the changes they made and why they made them.

  • True (Correct)
  • False

Correct: A data analyst making changes to SQL queries and using these comments to create a changelog involves specifying the changes they made and why they made them.

19. What is involved in seeing the big picture when verifying data cleaning? Select all that apply

  • Consider the business problem (Correct)
  • Consider the data (Correct)
  • Consider the goal (Correct)
  • Consider the reporting

Correct: To see the big picture when verifying data cleaning, consider the business problem, the goal, and the data.

20. Fill in the blank: TRIM is a function that removes _____ spaces in data. Select all that apply.

  • Leading (Correct)
  • Repeated (Correct)
  • inner
  • trailing (Correct)

Correct: TRIM is a function that removes leading, trailing, and repeated spaces in data.

21. What is the process of tracking changes, additions, deletions, and errors during data cleaning?

  • Documentation (Correct)
  • Cataloging
  • Recording
  • Observation

Correct: Documentation is the process of tracking changes, additions, deletions, and errors during data cleaning.

22. At what point during the analysis process does a data analyst use a changelog?

  • While cleaning the data (Correct)
  • While visualizing the data
  • While gathering the data
  • While reporting the data

Correct: A data analyst uses a changelog while cleaning data.

23. A data analyst is starting a large scale project. The project will be crucial to business success and the data analyst needs to keep the big picture at the forefront when verifying their data cleaning. What is the first step in the verification process?

  • Determine the quality of the data
  • Compare cleaned data with the original, uncleaned dataset and compare it to what is there now (CORRECT)
  • Create a chronological list of modifications made to the data
  • Inform others of the data-cleaning effort

24. During the verification process, you find that you missed a few leading spaces during data cleaning. What function can you use to eliminate these spaces?

  • TIDY
  • TRIM (CORRECT)
  • CROP
  • CUT

25. What tool can a data analyst use to figure out how many identical errors occur in a dataset?

  • CONFIRM
  • CASE
  • COUNT
  • COUNTA (CORRECT)

26. You find a few misspellings in your datatable and need to correct them when running a query. What function can you use when your set condition is met?

  • CASE (CORRECT)
  • THEN
  • WHEN
  • ELSE

27. A data analyst uses a changelog while cleaning their data. What data modifications should they track in the changelog?

  • Changes, resolutions, and deletions
  • Errors, deletions, and notes (CORRECT)
  • Errors, additions, and deletions
  • Additions, changes, and queries

28. Fill in the blank: A process to confirm that a data-cleaning effort was well-executed and the resulting data is accurate and reliable is known as _____.

  • manipulation
  • publishing
  • verification (CORRECT)
  • processing

29. What is the first step in the verification process?

  • Inform others of your data-cleaning effort
  • Compare cleaned data with the original, uncleaned dataset and compare it to what is there now (CORRECT)
  • Create a chronological list of modifications made to the data
  • Determine the quality of the data

30. During data cleaning, you find an error in a username where the ID number was accidentally joined to the user’s last name. You need to figure out if this username has been entered incorrectly more than once in your dataset. If you use a pivot table, what function can you use to determine the number of times this error occurs in your dataset?

  • COUNT
  • CASE
  • CHECK
  • COUNTA (CORRECT)

31.  Fill in the blank: A data analyst uses the CASE statement to consider one or more _____, then return a value.

  • changes
  • fields
  • identifications
  • conditions (CORRECT)

32.  Fill in the blank: While cleaning data, a data analyst can use a changelog to keep a chronological list of changes they make. They can refer to it during the _____ period if there are errors or questions.

  • documentation
  • presenting
  • verification (CORRECT)
  • visualization

33. A data analyst is reviewing modifications made to a SQL table and a spreadsheet. The data analyst will get similar results when using the changelogs for both data sources.

  • True (CORRECT)
  • False

34.  Fill in the blank: A data analyst finishes cleaning their data. The next step in the process is reporting and ____.

  • verification (CORRECT)
  • manipulation
  • replacing
  • processing

35. A data analyst is starting a large scale project that is crucial to business success. The data analyst needs to remember the big picture when verifying their data cleaning. What is involved when focusing on the big picture-view of the project? Select all that apply.

  • Consider the stakeholders
  • Consider the reporting
  • Consider the business problem (CORRECT)
  • Consider the goal (CORRECT)

36. Your manager points out an error in a product ID number in your dataset. The Product IDs can be numbers like 42 or text like “CAD-425”. Using a pivot table, what function can you use to find how many times this error occurs in the dataset?

  • CASE
  • CHECK
  • COUNT
  • COUNTA (CORRECT)

37. A data analyst is in the verification process and needs to verify the modifications that they have made to the data. What could the analyst reference to find the changes they made throughout data cleaning?

  • Changelog (CORRECT)
  • Metadata
  • Spreadsheet
  • Notepad

38. A data analyst uses the COUNTA function to count which of the following?

  • The total number of values within a specified range (CORRECT)
  • The total number of headers in a specific range
  • The specific numbers in a dataset
  • The total number of entries in a changelog

39. You’re working with a dataset that contains categorical variables. You notice that some of the strings are misspelled or are not capitalized. What function can you use to fix these errors when a condition is met?

  • CASE (CORRECT)
  • THEN
  • WHEN
  • ELSE

40. Fill in the blank: Documentation is the process of tracking _____ during data cleaning. Select all that apply.

  • inactivity
  • changes (CORRECT)
  • additions (CORRECT)
  • deletions (CORRECT)

41. Fill in the blank: As a data analyst, you should always create a _____ to track your additions, deletions, errors, and changes to a query.

  • notepad
  • spreadsheet
  • changelog (CORRECT)
  • database

42. In what step of the data-cleaning process do you find mistakes before you begin analyzing the data?

  • Publishing
  • Processing
  • Confirming
  • Verifying (CORRECT)

43. As a data analyst, you will need to keep the big picture in mind throughout any project when verifying data cleaning. What must the analyst do to take a big picture view of the project? Select all that apply.

  • Consider the reporting
  • Consider the goal (CORRECT)
  • Consider the business problem (CORRECT)
  • Consider the data (CORRECT)

Verify and Report on your Cleaning Results conclusion

In summary, data cleaning is an essential step in the data analysis process. It is important to report and verify your data cleaning in order to show that your data is ready for the next step. By taking this course on Coursera, you will learn about the processes involved with verifying and reporting data cleaning as well as their benefits. Join the learning experience today!