Question 1

Apache Spark is a unified processing engine that can analyze big data with which of the following features? Select all that apply.

Accepted Answer

Graph Processing, Real-time stream analysis, SQL, Machine Learning (CORRECT)

Question 2

Which of the following Databricks features are not Open-Source Spark? Select all that apply.

Accepted Answer

Databricks Runtime, Databricks Workflows, Databricks Workspace (CORRECT)

Question 3

Apache Spark notebooks allow which of the following? Select all that apply.

Accepted Answer

Display graphical visualizations, Rendering of formatted text, Execution of code (CORRECT)

Question 4

In Azure Databricks when creating a new Notebook, the default languages available to select from are? Select all that apply.

Accepted Answer

R, Scala, Python, SQL (CORRECT)

Question 5

If your notebook is attached to a cluster, you can carry out which of the following from within the notebook?

Accepted Answer

Attach to another cluster, Restart the cluster, Detach your notebook from the cluster (CORRECT)

Question 6

Select all that apply.  You work with Big Data as a data engineer or a data scientist, and you must process data that is oftentimes referred to as the “3 Vs of Big Data”. What do the 3Vs of Big Data stand for?

Accepted Answer

Variety, Velocity, Volume (CORRECT)

Question 7

Spark's performance is based on parallelism. Which of the following Scalability methods is limited to a finite amount of RAM, Threads and CPU speeds?

Accepted Answer

Vertical Scaling (CORRECT)

Question 8

In an Apache Spark Cluster jobs are divided into which of the following?

Accepted Answer

Tasks (CORRECT)

Question 9

When creating a new cluster in the Azure Databricks workspace, which of the following is a sequence of steps that happens in the background?

Accepted Answer

Azure Databricks creates a cluster of driver and worker nodes, based on your VM type and size selections. (CORRECT)

Question 10

To parallelize work, the unit of distribution is a Spark Cluster. Every Cluster has a Driver and one or more executors. Work submitted to the Cluster is split into what type of object?

Accepted Answer

Jobs (CORRECT)

Question 11

Spark Cluster use two levels of parallelization. Which of the following are levels of parallelization?

Accepted Answer

Slot, Executor (CORRECT)

Question 12

Azure Databricks Runtime adds several key capabilities to Apache Spark workloads that can increase performance and reduce costs. Which of the following are features of Azure Databricks? Select all that apply.

Accepted Answer

High-speed connectors to Azure storage services, Caching, Auto-scaling and auto-termination, Indexing (CORRECT)

Question 13

Apache Spark supports which of the following languages? Select all that apply.

Accepted Answer

Python, Java, Scala (CORRECT)

Question 14

Which of the following statements are True Select all that apply.

Accepted Answer

You can detach a notebook from a cluster and attach it to another cluster. To use your Azure Databricks notebook to run code, you must attach it to a cluster (CORRECT)

Question 15

Which of the following Databricks features are not Open-Source Spark?

Accepted Answer

Databricks Runtime, Databricks Workflows, Databricks Workspace (CORRECT)

Question 16

How many drivers does a Cluster have?

Accepted Answer

Only one (CORRECT)

Question 17

What type of process are the driver and the executors?

Accepted Answer

Java processes (CORRECT)

Question 18

You work with Big Data as a data engineer, and you must process real-time data. This is referred to as having which of the following characteristics?

Accepted Answer

High velocity(CORRECT)

Question 19

Spark's performance is based on parallelism. Which of the following Scalability methods is limited to a finite amount of RAM, Threads and CPU speeds?

Accepted Answer

Vertical Scaling (CORRECT)

Question 20

Spark Cluster use two levels of parallelization. Which of the following are levels of parallelization?

Accepted Answer

Slot, Executor (CORRECT)

Question 21

In an Apache Spark Cluster jobs are divided into which of the following?

Accepted Answer

Tasks (CORRECT)

Question 22

You are introducing the Databricks platform to your team. Which of the following features can you demonstrate to them? Check all that apply

Accepted Answer

Advanced query optimization, Caching and indexing, High-speed Azure connection, Automation of Spark clusters (CORRECT)

Question 23

Identify the components of an Azure Databricks Spark cluster. Check all that apply.

Accepted Answer

Slot, Driver, Task, Worker (CORRECT)

Question 24

You have deployed a series of jobs within a Spark cluster. How will the Spark cluster assign jobs to the nodes?

Accepted Answer

Horizontally (CORRECT)

COURSE 4: PERFORM DATA SCIENCE WITH AZURE DATABRICKS

Module 1: Introduction To Azure Databricks

MICROSOFT AZURE DATA SCIENTIST ASSOCIATE (DP-100) PROFESSIONAL CERITIFICATE

Complete Coursera Study Guide

TABLE OF CONTENT

INTRODUCTION – Introduction To Azure Databricks

Learning Objectives

PRACTICE QUIZ: KNOWLEDGE CHECK 1

1. Apache Spark is a unified processing engine that can analyze big data with which of the following features?

Select all that apply.

2. Which of the following Databricks features are not Open-Source Spark?

Select all that apply.

3. Apache Spark notebooks allow which of the following?

Select all that apply.

4. In Azure Databricks when creating a new Notebook, the default languages available to select from are?

Select all that apply.

5. If your notebook is attached to a cluster, you can carry out which of the following from within the notebook?

Select all that apply.

PRACTICE QUIZ: KNOWLEDGE CHECK 2

1. Select all that apply.

You work with Big Data as a data engineer or a data scientist, and you must process data that is oftentimes referred to as the “3 Vs of Big Data”. What do the 3Vs of Big Data stand for?

2. Spark’s performance is based on parallelism. Which of the following Scalability methods is limited to a finite amount of RAM, Threads and CPU speeds?

3. In an Apache Spark Cluster jobs are divided into which of the following?

4. When creating a new cluster in the Azure Databricks workspace, which of the following is a sequence of steps that happens in the background?

5. To parallelize work, the unit of distribution is a Spark Cluster. Every Cluster has a Driver and one or more executors. Work submitted to the Cluster is split into what type of object?

6. Spark Cluster use two levels of parallelization. Which of the following are levels of parallelization?

QUIZ: TEST PREP

1. Azure Databricks Runtime adds several key capabilities to Apache Spark workloads that can increase performance and reduce costs. Which of the following are features of Azure Databricks?

Select all that apply.

2. Apache Spark supports which of the following languages?

Select all that apply.

3. Which of the following statements are True

Select all that apply.

4. Which of the following Databricks features are not Open-Source Spark?

5. How many drivers does a Cluster have?

6. What type of process are the driver and the executors?

7. You work with Big Data as a data engineer, and you must process real-time data. This is referred to as having which of the following characteristics?

8. Spark’s performance is based on parallelism. Which of the following Scalability methods is limited to a finite amount of RAM, Threads and CPU speeds?

9. Spark Cluster use two levels of parallelization. Which of the following are levels of parallelization?

10. In an Apache Spark Cluster jobs are divided into which of the following?

11. You are introducing the Databricks platform to your team. Which of the following features can you demonstrate to them? Check all that apply

12. Identify the components of an Azure Databricks Spark cluster. Check all that apply.

13. You have deployed a series of jobs within a Spark cluster. How will the Spark cluster assign jobs to the nodes?

CONCLUSION – Introduction To Azure Databricks

Quiztudy Top Courses

Popular in Coursera

Mood Zone for Studying & Relaxing