Databricks Associate Data Engineer Questionnaire

Databricks Associate Data Engineer Questionnaire

Professional Development

45 Qs

quiz-placeholder

Similar activities

Terik's Practice Test1

Terik's Practice Test1

Professional Development

42 Qs

MST II

MST II

Professional Development

48 Qs

Simulare examen final 2

Simulare examen final 2

Professional Development

50 Qs

Diagnostic test part II.

Diagnostic test part II.

KG - Professional Development

48 Qs

IS Security

IS Security

Professional Development

44 Qs

Đề TA2012 trên istqb.org

Đề TA2012 trên istqb.org

Professional Development

42 Qs

Information Storage & Management Assessment (Set A)

Information Storage & Management Assessment (Set A)

Professional Development

50 Qs

AWS Quiz Show 2023 Season 2 (Week 3)

AWS Quiz Show 2023 Season 2 (Week 3)

Professional Development

50 Qs

Databricks Associate Data Engineer Questionnaire

Databricks Associate Data Engineer Questionnaire

Assessment

Quiz

Other

Professional Development

Easy

Created by

Subhoshree Deo

Used 12+ times

FREE Resource

45 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

  • A data organization leader is upset about the data analysis team’s reports being different from the data engineering team’s reports. The leader believes the siloed nature of their organization’s data engineering and data analysis architectures is to blame. Which of the following describes how a data lakehouse could alleviate this issue?

A. Both teams would autoscale their work as data size evolves

B. Both teams would use the same source of truth for their work

C. Both teams would reorganize to report to the same department

D. Both teams would be able to collaborate on projects in real-time

E. Both teams would respond more quickly to ad-hoc requests

Answer explanation

Databricks Lakehouse enables using data as the single source of truth. Duplicating data often results in data silos in organizations.

 

2.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

  • Which of the following describes a scenario in which a data team will want to utilize cluster pools?

A. An automated report needs to be refreshed as quickly as possible

B. An automated report needs to be made reproducible

C. An automated report needs to be tested to identify errors

D. An automated report needs to be version-controlled across multiple collaborators

E. An automated report needs to be runnable by all stakeholders

Answer explanation

Using cluster pools reduces the cluster startup time. So in this case, the reports can be refreshed quickly and not having to wait long for the cluster to start

3.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

  • Which of the following is hosted completely in the control plane of the classic Databricks architecture?

A. Worker node

B. JDBC data source

C. Databricks web application

D. Databricks Filesystem

E. Driver node

Answer explanation

Databricks web application is hosted in Control plane

4.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

  • Which of the following benefits of using the Databricks Lakehouse Platform is provided by Delta Lake?

A. The ability to manipulate the same data using a variety of languages

B. The ability to collaborate in real time on a single notebook

C. The ability to set up alerts for query failures

D. The ability to support batch and streaming workloads

E. The ability to distribute complex data operations

Answer explanation

Delta Lake supports both Batch & Stream workloads

5.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

  • Which of the following describes the storage organization of a Delta table?

A. Delta tables are stored in a single file that contains data, history, metadata, and other attributes

B. Delta tables store their data in a single file and all metadata in a collection of files in a separate location

C. Delta tables are stored in a collection of files that contain data, history, metadata, and other attributes

D. Delta tables are stored in a collection of files that contain only the data stored within the table

E. Delta tables are stored in a single file that contains only the data stored within the table

Answer explanation

Delta table stores data as a directory of files on a cloud object storage and registers table metadata to the metastore within a catalog and schema

6.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

  • Which of the following code blocks will remove the rows where the value in column age is greater than 25 from the existing Delta table my table and save the updated table?

A. SELECT * FROM my_table WHERE age > 25;

B. UPDATE my_table WHERE age > 25;

C. DELETE FROM my_table WHERE age > 25;

D. UPDATE my_table WHERE age <= 25;

E. DELETE FROM my_table WHERE age <= 25;

Answer explanation

DELETE FROM to delete existing records from the table. UPDATE is used to modify existing records. SELECT only creates a view, it does not alter the table records.

7.

MULTIPLE CHOICE QUESTION

45 sec • 1 pt

A data engineer has realized that they made a mistake when making a daily update to a table. They need to use Delta time travel to restore the table to a version that is 3 days old. However, when the data engineer attempts to time travel to the older version, they are unable to restore the data because the data files have been deleted.

Which of the following explains why the data files are no longer present?

A. The VACUUM command was run on the table

B. The TIME TRAVEL command was run on the table

C. The DELETE HISTORY command was run on the table

D. The OPTIMIZE command was nun on the table

E. The HISTORY command was run on the table

Answer explanation

VACUUM command is the only way to explicitly remove the history information from a delta table. There is no command like DELETE HISTORY exists in Databricks

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?