several data collections across its many departments.

Course 3 – Prepare Data for Exploration Quiz Answers

Week 3: Databases: Where Data Lives

GOOGLE DATA ANALYTICS PROFESSIONAL CERTIFICATION

Complete Coursera Study Guide

DATABASES: WHERE DATA LIVES – INTRODUCTION

Databases are the backbone of data analysis. With a Google Analytics Certification from Coursera, you’ll be able to access the data that lives in databases and use it for your own purposes. You will learn how to extract, filter, and sort this data and work with metadata. All of these skills are necessary for effective data analysis and can help you gain insights into trends and other important information.

Databases offer a wealth of knowledge and understanding that is essential for any analyst’s toolkit. The Google Data Analytics certification from Coursera will equip you with the necessary skills to navigate through databases easily and efficiently so you can make sense of the vast amounts of data available today.

Test your knowledge on working with databases

1. Fill in the blank: A _____ is an identifier that references a database column in which each value is unique.

  • foreign key
  • relation
  • field
  • primary key (Correct)

Correct: A primary key is an identifier that references a column in which each value is unique. A foreign key is a field within a table that’s a primary key in the original table.

2. Fill in the blank: A relational database contains a series of _____ that can be connected to form relationships.

  • cells
  • spreadsheets
  • fields
  • tables (Correct)

Correct: A relational database contains a series of tables that can be connected to form relationships.

3. A key benefit of working with normalized databases is that they help lower data redundancy. Which of the following is an example of redundancy?

  • The same piece of data being stored in two different places (Correct)
  • A database that forms two or more relationships
  • Team members in different office locations working with the same data
  • A database containing two foreign keys

Correct: The same piece of data being stored in two different places is an example of redundancy.

Test your knowledge on metaData

1. A large company has several data collections across its many departments. What kind of metadata indicates exactly how many collections a piece of data lives in?

  • Structural (Correct)
  • Administrative
  • Descriptive
  • Representative

Correct: Structural metadata indicates exactly how many collections data lives in. It provides information about how a piece of data is organized and whether it’s part of one, or more than one, data collection.

2. The date and time a photo was taken is an example of which kind of metadata?

  • Structural
  • Representative
  • Descriptive
  • Administrative (Correct)

Correct: The date and time a photo was taken is an example of administrative metadata. Administrative metadata indicates the technical source and details for a digital asset.

3. A large metropolitan high school gives each of its students an ID number to differentiate them in its database. What kind of metadata are the ID numbers?

  • Administrative
  • Representative
  • Structural
  • Descriptive (Correct)

Correct: The ID numbers are descriptive metadata. Descriptive metadata describes a piece of data or can be used to identify it at any time.

4. A company needs to merge third-party data with its own data. Which of the following actions will help make this process successful? Select all that apply.

  • Replace the incoming data’s metadata with its own company metadata.
  • Use the metadata to standardize the data. (Correct)
  • Alter the company’s metadata to more closely reflect the incoming metadata.
  • Use the metadata to evaluate the third-party data’s quality and credibility. (Correct)

Correct: The company can use the metadata to standardize the data and evaluate the third-party data’s quality and credibility.

Test your knowledge on accessing data sources

1. A CSV file saves data in a table format. What does CSV stand for?

  • Compatible scientific variables
  • Calculated spreadsheet values
  • Comma-separated values (Correct)
  • Cell-structured variables

Correct: CSV stands for comma-separated values.

2. A data analyst wants to bring data from a CSV file into a spreadsheet. This is an example of what process?

  • Importing data (Correct)
  • Filing data
  • Editing data
  • Normalizing data

Correct: A data analyst bringing data from a CSV file into a spreadsheet is an example of importing data.

3. A CSV file makes it easier for data analysts to complete which tasks? Select all that apply.

  • Distinguish values from one another (Correct)
  • Import data to a new spreadsheet (Correct)
  • Manage multiple tabs within a worksheet
  • Examine a small subset of a large dataset (Correct)

Correct: A CSV file makes it easier for data analysts to examine a small part of a large dataset, import data to a new spreadsheet, and distinguish values from one another.

4. What is the process of showing only the data that meets a specified criteria while hiding the rest?

  • Filtering (Correct)
  • Inspecting
  • Sorting
  • Converting

Correct: Filtering is the process of showing only the data that meets a specified criteria while hiding the rest. Filtering is an extremely useful technique for data cleaning, and an essential tool in every data analyst’s toolkit.

Test your knowledge on sorting and filtering

1. What is the process for arranging data into a meaningful order to make it easier to understand, analyze, and visualize?

  • Reframing
  • Sorting (Correct)
  • Filtering
  • Prioritizing

Correct: Sorting is the process for arranging data into a meaningful order to make it easier to understand, analyze, and visualize.

2. A data analyst is reviewing a national database of real estate sales. They are only interested in sales of condominiums. How can the analyst narrow their scope?

  • Sort by condominium sales
  • Filter out condominium sales
  • Sort by non-condominium sales
  • Filter out non-condominium sales (Correct)

Correct: The analyst can narrow their scope by filtering out non-condominium sales. This will enable them to view only the data on condominium sales.

3. A data analyst works for a rental car company. They have a spreadsheet that lists car ID numbers and the dates cars were returned. How can they sort the spreadsheet to find the most recently returned cars?

  • By return date, in descending order (Correct)
  • By car numerical ID, in descending order
  • By return date, in ascending order
  • By car numerical ID, in ascending order

Correct: To sort the spreadsheet to quickly find the most recently returned cars, they should sort by return date, in descending order.

4. Fill in the blank: To keep a header row at the top of a spreadsheet, highlight the row and select _____ from the View menu.

  • Pin
  • Lock
  • Set
  • Freeze (Correct)

Correct: To keep a header row at the top of a spreadsheet, highlight the row and select freeze from the View menu.

Working with large datasets in Sql

1. Run another query on your table:

SELECT

     end_station_name

FROM

    `bigquery-public-data.london_bicycles.cycle_hire`

WHERE

     rental_id = 57635395;

At what station did the bike trip with rental_id 57635395 end?

  • Southwark Street, Bankside
  • Tower Gardens, Tower
  • Notting Hill Gate Station, Notting Hill
  • East Village, Queen Elizabeth Olympic Park (Correct)

Correct: The address listed under the end_station_name column for Row 1 of your results table was East Village, Queen Elizabeth Olympic Park. To find this, you successfully ran a query. Going forward, you will continue using SELECT, FROM, and WHERE statements in your queries to interact with databases using SQL. This will help you build more complicated SQL queries when you are analyzing data in the future.

Create a custom table in BigQuery

1. After running the query on your new table, what was the third most popular baby name for boys in 2014?

  • Jacob
  • William
  • Mason (Correct)
  • Noah

Correct: To find that Mason was the third most popular baby name for boys in 2014, you queried your custom table and checked the results. Going forward, you’ll be able to upload your own data sources into BigQuery for future analysis projects. This will allow you to practice writing SQL queries for more data sources, which will be a key skill as a data analyst.

Test your knowledge on using SQL with large datasets

1. In MySQL, what is acceptable syntax for the SELECT keyword? Select all that apply.

  • select (Correct)
  • “SELECT”
  • SELECT (Correct)
  • ‘SELECT’

Correct: In MySQL, SELECT or select is acceptable syntax.

2. A database table is named blueFlowers. What type of case is this?

  • Lowercase
  • Snake case
  • Camel case (Correct)
  • Sentence case

Correct: blueFlowers is in camel case.

3. In BigQuery, what optional syntax can be removed from the following FROM clause without stopping the query from running?

FROM `bigquery-public-data.sunroof_solar.solar_potential_by_postal_code`

  • Dashes
  • Underscores
  • Backticks (Correct)
  • Dots

Correct: The name of the dataset is shown between two backticks to help people read the query more easily. If you remove the backticks, the query will still run.

4. In the following FROM clause, what is the table name in the SQL query?

FROM

bigquery-public-data.sunroof_solar.solar_potential_by_postal_code

  • public-data.sunroof
  • solar.solar
  • sunroof_solar
  • solar_potential_by_postal_code (Correct)

Correct: The table name in the SQL query is solar_potential_by_postal_code. This table is in the sunroof_solar dataset, a public dataset in BigQuery.

GOOGLE DATA ANALYTICS COURSERA ANSWERS AND STUDY GUIDE

Liking our content? Then don’t forget to add us to your bookmarks so you can find us easily!

Weekly Breakdown | Google Study Guides | Back to Top

Prepare Data for Exploration Weekly Challenge 3

1. Primary and foreign keys are two connected identifiers within separate tables. These tables exist in what kind of database?

  • Normalized
  • Metadata
  • Relational (Correct)
  • Primary

Correct: Primary and foreign keys are two connected identifiers within separate tables in a relational database.

2. When working with data from an external source, what can metadata help data analysts do? Select all that apply.

  • Ensure data is clean and reliable (Correct)
  • Choose which analyses to run
  • Understand the contents of a database (Correct)
  • Combine data from more than one source (Correct)

Correct: Metadata helps data analysts understand the contents of a database, ensure data is clean and reliable, and understand how to combine data from more than one source.

3. Think about data as a student at a high school. In this metaphor, which of the following are examples of metadata? Select all that apply.

  • Classes the student is enrolled in (Correct)
  • Student’s ID number (Correct)
  • Student’s enrollment date (Correct)
  • Grades the student earns

Correct: The student ID number, enrollment date, and classes the student is enrolled in represent structural metadata.

4. Fill in the blank: Data governance is the process of ensuring that a company’s _____ are managed in a formal manner.

  • business tasks
  • data assets (Correct)
  • data engineers
  • business strategies

Correct: Data governance is the process of ensuring that a company’s data assets are managed in a formal manner.

5. What are some key benefits of using external data? Select all that apply.

  • External data has broad reach. (Correct)
  • External data can provide industry-level perspectives. (Correct)
  • External data is always reliable.
  • External data is free to use.

Correct: Some key benefits of using external data are that it has a broad reach and it provides industry-level perspectives.

6. A data analyst reviews a database of Wisconsin car sales to find the last car models sold in Milwaukee in 2019. How can they sort and filter the data to return the last five cars sold at the top of their list? Select all that apply.

  • Sort by sale date in ascending order
  • Filter out sales not in 2019 (Correct)
  • Sort by sale date in descending order (Correct)
  • Filter out sales outside of Milwaukee (Correct)

Correct: The analyst can filter out sales outside of Milwaukee in 2019 and sort by date in descending order.

7. When writing a query, the name of the dataset can either be inside two backticks, or not, and the query will still run properly.

  • True (Correct)
  • False

Correct: When writing a query, the name of the dataset can either be inside two backticks, or not, and the query will still run properly.

8. You are working with a database table that contains customer data. The first_name column lists the first name of each customer. You are only interested in customers with the first name Mark.

You write the SQL query below. Add a WHERE clause that will return only customers named Mark.

Course_3_Challenge_Week_3

9. How many customers are named Mark?

  • 5
  • 1
  • 2 (Correct)
  • 3

Correct: The clause WHERE first_name = ‘Mark’ will return only customers named Mark. The complete query is SELECT * FROM customer WHERE first_name = ‘Mark’. The WHERE clause filters results that meet certain conditions. The WHERE clause includes the name of the column, an equals sign, and the value(s) in the column to include. Place quotes around text values. There are two customers named Mark.

10. Relational databases contain a series of tables connected to form relationships. Which two types of fields exist in two connected tables?

  • Primary and foreign keys (CORRECT)
  • Internal and external data
  • Descriptive and structural metadata
  • Star and snowflake schemas

Correct: Primary and foreign keys exist in two connected tables in a relational database.

11. Data analysts use metadata for what tasks? Select all that apply.

  • To perform data analyses
  • To evaluate the quality of data (CORRECT)
  • To interpret the contents of a database (CORRECT)
  • To combine data from more than one source (CORRECT)

Correct: Data analysts use metadata to combine data, evaluate data, and interpret a database.
Correct: Data analysts use metadata to combine data, evaluate data, and interpret a database.
Correct: Data analysts use metadata to combine data, evaluate data, and interpret a database.

12. Structural metadata indicates how a piece of data is organized and whether it’s part of one or more than one data collection.

  • True (CORRECT)
  • False

Correct: Structural metadata indicates how a piece of data is organized and whether it’s part of one, or more than one, data collection.

12. What is the process that data analysts use to ensure the formal management of their company’s data assets?

  • Data mapping
  • Data governance (CORRECT)
  • Data aggregation
  • Data integrity

Correct: Data governance is the process of ensuring the formal management of a company’s data assets.

13. A data analyst chooses not to use external data because it represents diverse perspectives. This is an appropriate decision when working with external data.

  • True
  • False (CORRECT)

Correct: External data representing diverse perspectives is not an appropriate reason to choose not to use a dataset. A data analyst might choose not to use external data if the it cannot be confirmed to be reliable.

14. A data analyst reviews a database of Wisconsin car sales to find the last car models sold in Milwaukee in 2019. How can they sort and filter the data to return the last five cars sold at the top of their list? Select all that apply.

  • Sort by sale date in ascending order
  • Sort by sale date in descending order (CORRECT)
  • Filter out sales outside of Milwaukee (CORRECT)
  • Filter out sales not in 2019 (CORRECT)

Correct: The analyst can filter out sales outside of Milwaukee in 2019 and sort by date in descending order.
Correct: The analyst can filter out sales outside of Milwaukee in 2019 and sort by date in descending order.
Correct: The analyst can filter out sales outside of Milwaukee in 2019 and sort by date in descending order.

15. Think about data as driving a taxi cab. In this metaphor, which of the following are examples of metadata? Select all that apply.

  • Passengers the taxi picks up
  • Make and model of the taxi cab (CORRECT)
  • License plate number (CORRECT)
  • Company that owns the taxi (CORRECT)

Correct: The license plate number, make and model of the cab, and the company that owns the taxi represent structural metadata.
Correct: The license plate number, make and model of the cab, and the company that owns the taxi represent structural metadata.
Correct: The license plate number, make and model of the cab, and the company that owns the taxi represent structural metadata.

16. What are some key benefits of using external data? Select all that apply.

  • External data is free to use.
  • External data is always reliable.
  • External data can provide industry-level perspectives. (CORRECT)
  • External data has broad reach. (CORRECT)

Correct: Some key benefits of using external data are that it has a broad reach and it provides industry-level perspectives.
Correct: Some key benefits of using external data are that it has a broad reach and it provides industry-level perspectives.

17. You are working with a database table that contains customer data. The city column lists the city where each customer is located. You want to find out which customers are located in Berlin.

You write the SQL query below. Add a WHERE clause that will return only customers located in Berlin.
SELECT
*
FROM
customer

How many customers are located in Berlin?

  • 9
  • 12
  • 2 (CORRECT)
  • 7

18. A data analyst reviews a national database of movie theater showings. They want to find the first movies shown in San Francisco in 2001. How can they organize the data to return the first 10 movies shown at the top of their list? Select all that apply.

  • Sort by date in descending order
  • Sort by date in ascending order (CORRECT)
  • Filter out showings outside of San Francisco (CORRECT)
  • Filter out showings not in 2001 (CORRECT)

Correct: The analyst can filter out showings outside of San Francisco in 2001 and sort by date in ascending order.

19. A nonprofit maintains a list of how many laptops they provide to each school in the county. In the table, there is a column called number_of_laptops. A data analyst wants to determine which schools were given the fewest laptops. How should they sort the data to return these schools first?

  • Sort numerically in descending order
  • Sort alphabetically in ascending order
  • Sort numerically in ascending order (CORRECT)
  • Sort alphabetically in descending order

Correct: The data analyst should sort the number_of_laptops column numerically in ascending order.

20. When writing a query, you must remove the two backticks around the name of the dataset in order for the query to run properly.

  • True
  • False (CORRECT)

Correct: When writing a query, the name of the dataset can either be inside two backticks, or not, and the query will still run properly.

21. Fill in the blank: Data _ is the process of ensuring the formal management of a company’s data assets.

  • aggregation
  • governance (CORRECT)
  • mapping
  • integrity

Correct: Data governance is the process of ensuring the formal management of a company’s data assets.

22. In what circumstance might a data analyst choose not to use external data in their analysis?

  • The data cannot be confirmed to be reliable (CORRECT)
  • The data is free for anyone to access
  • The data represents diverse perspectives
  • The data is too thorough

Correct: A data analyst might choose not to use external data in their analysis if the data cannot be confirmed to be reliable.

databases: where data lives – conclusion

Many people view databases as daunting and overwhelming, but they can be a powerful asset with the proper tools and training. The Google Analytics Certification from Coursera will give you the skills you need to work with databases so you can extract data that is important for your needs. This certification will also teach you how to filter and sort this data so you can make trends and other insights. With this valuable skill set, you’ll be able to access the information available in databases and use it to your advantage. Join Coursera today and get started on your journey to becoming a certified data analyst.