COURSE 3: GO BEYOND THE NUMBERS: TRANSLATE DATA INTO INSIGHTS

Module 1: Find and Share Stories Using Data

GOOGLE ADVANCED DATA ANALYTICS PROFESSIONAL CERTIFICATE

Complete Coursera Study Guide

INTRODUCTION – Find and Share Stories Using Data

Discover the art of uncovering compelling narratives hidden within data and effectively sharing them with your audience. Explore the significance of data cleaning, understanding its methods, and discovering how this process is pivotal in revealing impactful stories. Delve into the steps of the Exploratory Data Analysis (EDA) process, leveraging its power to rapidly gain insights into data. Conclude your learning journey by exploring diverse techniques for visualizing data, enabling the communication of key insights in a clear and engaging manner.

Learning Objectives

  • Recognize the importance of ethics and accessibility in visualizing data
  • Explain how EDA helps data professionals share stories from raw data sources
  • Recognize the importance of ethics and accessibility in sharing stories with data
  • Explain the process of exploratory data analysis (EDA) and the benefits of understanding data
  • Explain the importance of aligning EDA methods with business purposes while using the PACE framework
  • Identify the six different parts of the EDA process: discovering, structuring, cleaning, joining, validating, presenting
  • Explain how data analysis helps data professionals tell stories from raw data sources

PRACTICE QUIZ: TEST YOUR KNOWLEDGE: TELL STORIES WITH DATA

1. Fill in the blank: The presenting stage of exploratory data analysis involves sharing _____, which can include graphs, charts, diagrams, or dashboards.

  • databases
  • data visualizations (CORRECT)
  • data frames
  • datasets

Correct: The presenting stage of exploratory data analysis involves sharing data visualizations, which can include graphs, charts, diagrams, or dashboards.

2. During which exploratory data analysis practice might a data professional familiarize themself with the meaning of column headers in a dataset?

  • Discovering (CORRECT)
  • Validating
  • Joining
  • Structuring

Correct: A data professional might consider the meaning of column headers in a dataset during the discovering element of exploratory data analysis. This is when data professionals familiarize themselves with the data.

3. If sampled data is organized in such a way that it does not accurately represent its population as a whole, what problem will occur?

  • Biased data (CORRECT)
  • Unclean data
  • Disorganized data
  • Unfiltered data

Correct: If sampled data is organized in a way that does not accurately represent its population as a whole, it will be biased. Data bias occurs when a preference in favor of or against a person, group of people, or thing systematically skews data analysis results.

PRACTICE QUIZ: TEST YOUR KNOWLEDGE: HOW PACE INFORMS EDA AND DATA VISUALIZATIONS

1. What are the primary drivers of a data-driven story? Select all that apply.

  • Sales predictions
  • Stakeholder theories
  • Project goals (CORRECT)
  • Project purpose (CORRECT)

Correct: Data insights and the stories they help create are driven by a project’s purpose and its goals.

2. Fill in the blank: In order to help avoid _____ in the workplace, data professionals share the PACE plan with stakeholders and team members.

  • unnecessary meetings
  • competition
  • miscommunication (CORRECT)
  • unintentional bias

Correct: In order to help avoid miscommunication in the workplace, data professionals share the PACE plan with stakeholders and team members. PACE helps users communicate, solve problems, and make judgments quickly and efficiently.

3. Why is it important to maintain proper scale of a graph’s axes in a data visualization?

  • To take advantage of white space
  • To tell a more interesting data story
  • To change stakeholders’ minds
  • To avoid misrepresenting the data (CORRECT)

Correct: It is important to maintain proper scale of a graph’s axes in a data visualization to avoid skewing the data. Data visualizations with skewed data mislead the audience.

QUIZ: MODULE 1 CHALLENGE

1. Fill in the blank: The type of data being studied and the _____ guide the order of the six practices of exploratory data analysis.

  • size of the dataset
  • company mission
  • needs of the data team (CORRECT)
  • available hardware and software

Correct!

2. A data team leader at a clothing manufacturer reviews a dataset that will be used to decide where to open new retail stores. They conceptualize how their analytics team can most effectively use the dataset. Which exploratory data analysis process does this scenario describe?

  • Validating
  • Joining
  • Discovering (CORRECT)
  • Cleaning

Correct!

3. What are the goals of the structuring exploratory data analysis step? Select all that apply.

  • Correcting misspellings or other errors
  • Prepare data to be effectively modeled (CORRECT)
  • Make data easier to visualize and explain (CORRECT)
  • Group data in such a way that it accurately represents the dataset as a whole (CORRECT)

Correct!

4. Which of the following statements correctly compare data cleaning to data validation during exploratory data analysis? Select all that apply.

  • Cleaning is the process of confirming that no errors were introduced during validation.
  • Both data cleaning and data validation involve eliminating any misspellings in the data. (CORRECT)
  • Cleaning involves ensuring the data is useful. (CORRECT)
  • Validating involves verifying the data is of high quality. (CORRECT)

Correct!

5. Fill in the blank: A data professional discovers that their dataset does not have enough data. Therefore, they choose to add more data during the _____ process.

  • Joining (CORRECT)
  • structuring
  • validating
  • cleaning

Correct!

6. What steps may be involved with presenting data insights to others during exploratory data analysis? Select all that apply.

  • Remove written descriptions to save people time when viewing the visualizations
  • Share a cleaned dataset for additional analysis (CORRECT)
  • Ask team members or stakeholders for feedback (CORRECT)
  • Make the visualizations available to others for further modeling (CORRECT)

Correct!

7. What are some strategies that a data professional might use to help avoid miscommunication in the workplace? Select all that apply.

  • Provide audiences with raw data for their own exploration.
  • Share the PACE plan with all stakeholders. (CORRECT)
  • Present primary analysis with a working group to get feedback. (CORRECT)
  • Understand stakeholders’ most important goals before presenting to them. (CORRECT)

Correct!

8. A data professional works on a project that uses data from a study about farming in Africa. They consider how to use the PACE framework to perform exploratory data analysis practices effectively. Which of the following objectives will this help them achieve? Select all that apply.

  • Conform to client expectations by misrepresenting the data
  • Confirm that the data represents an appropriate number of African geographical regions (CORRECT)
  • Ensure ethical depictions of the farmers represented in the study (CORRECT)
  • Maintain focus on the project purpose (CORRECT)

Correct!

9. Fill in the blank: The exploratory data analysis process is_____, which means data professionals often work through the six practices multiple times.

  • Supplementary
  • Immutable
  • transitory
  • iterative (CORRECT)

Correct!

10. Fill in the blank: A data professional might add more context to the data during the _____ process by adding information from other data sources.

  • Joining (CORRECT)
  • structuring
  • cleaning
  • validating

Correct!

11. Fill in the blank: To avoid _____ in the workplace, data professionals can share initial data findings with a working group to get feedback before providing analyses to all stakeholders.

  • competition
  • silos
  • favoritism
  • miscommunication (CORRECT)

Correct!

12. A data professional at a financial investment company familiarizes themselves with a dataset for a new investment project. They consider the meaning of the column headers and how many total data points exist. Which exploratory data analysis process does this scenario describe?

  • Validating
  • Joining
  • Cleaning
  • Discovering (CORRECT)

Correct!

13. Fill in the blank: In exploratory data analysis, _____ is the process of augmenting a dataset by adding values from other sources.

  • cleaning
  • structuring
  • validating
  • joining (CORRECT)

Correct!

14. A data professional works on a project that uses data from a study about mental health in Europe. They consider how to use the PACE framework to perform exploratory data analysis practices effectively. Which of the following objectives will this help them achieve? Select all that apply.

  • Modify the data in order to meet all project deadlines
  • Ensure ethical depictions of the mental health subjects represented in the study (CORRECT)
  • Maintain focus on key priorities and project purpose (CORRECT)
  • Confirm that the data represents an appropriate number of European geographical regions (CORRECT)

Correct!

15. A data professional works in the research and development department of a high-tech firm. They receive a dataset that will be used when creating next year’s products. They review the data and consider key questions about it. Which exploratory data analysis process does this scenario describe?

  • Joining
  • Discovering (CORRECT)
  • Cleaning
  • Validating

Correct!

16. What procedures take place during the structuring exploratory data analysis step? Select all that apply.

  • Share data with stakeholders
  • Organize data columns based on the data within the dataset (CORRECT)
  • Transform raw data from the dataset (CORRECT)
  • Group data into categories that represent the dataset (CORRECT)

Correct!

17. What processes do data professionals perform during the structuring exploratory data analysis step? Select all that apply.

  • Create data visualizations
  • Transform raw data. (CORRECT)
  • Categorize data into categories representing the dataset (CORRECT)
  • Organize raw data (CORRECT)

Correct!

18. What are some best practices associated with visualizing data during exploratory data analysis? Select all that apply.

  • Design visualizations specifically to support your personal hypotheses.
  • Create visualizations that are ethical, accessible, and representative of the data. (CORRECT)
  • Ensure visualizations are guided by the story uncovered by the data. (CORRECT)
  • Use data visualizations throughout exploratory data analysis to better understand the data. (CORRECT)

Correct!

19. Fill in the blank: Exploratory data analysis is the process of investigating, organizing, and analyzing datasets and _____ their main characteristics.

  • Modifying
  • Summarizing (CORRECT)
  • preparing
  • augmenting

Correct!

20. A data professional works on a project that uses data from a study about teachers in Australia. They apply the PACE framework to perform exploratory data analysis practices effectively. Which of the following objectives will this help them achieve? Select all that apply.

  • Conform to stakeholder expectations by misrepresenting the data
  • Ensure ethical depictions of the teachers represented in the study (CORRECT)
  • Keep project priorities in order (CORRECT)
  • Confirm that the data represents an appropriate number of Australian geographical regions (CORRECT)

Correct!

21. Fill in the blank: A data professional discovers that their dataset does not have enough data. Therefore, they choose to add more data during the _____ process.

  • Validating
  • Structuring
  • cleaning
  • joining (CORRECT)

Correct!

22. Fill in the blank: To avoid miscommunication in the workplace, data professionals can share _____ with a working group to get early feedback.

  • metadata
  • changelogs
  • initial data findings (CORRECT)
  • raw data

Correct!

23. What may be involved with visualizing data during exploratory data analysis? Select all that apply.

  • Asking stakeholders to hold their comments until the final official presentation
  • Making data visualizations available to team members for further analysis or modeling (CORRECT)
  • Considering people with auditory impairments by providing captioned descriptions about the data (CORRECT)
  • Considering people with visual impairments by describing the data in detail (CORRECT)

Correct!

24. What is the process data professionals use to investigate, organize, and analyze datasets in order to summarize the data’s main characteristics?

  • Storytelling with data
  • Exploratory data analysis (CORRECT)
  • Data strategy
  • Data visualization

Correct: Exploratory data analysis, or EDA, is the process data professionals use to investigate, organize, and analyze datasets in order to summarize the data’s main characteristics.

25. When a data professional discusses a project plan and company goals with stakeholders, which element of the PACE model are they engaged in?

  • Analyze
  • Construct
  • Plan (CORRECT)
  • Execute

Correct: When a data professional discusses a project plan and company goals with stakeholders, they are engaged in the plan element of the PACE model. During planning, data professionals define the scope of the project and identify the informational needs of the organization.