Databricks Certified Data Engineer Associate PDF
Databricks Certified Data Engineer Associate PDF
Question 1
A data organization leader is upset about the data analysis team’s reports being different from the
data engineering team’s reports. The leader believes the siloed nature of their organization’s data
engineering and data analysis architectures is to blame.
Which of the following describes how a data lakehouse could alleviate this issue?
Options:
B. Both teams would use the same source of truth for their work
Answer: B
Question 2
Which of the following describes a scenario in which a data team will want to utilize cluster pools?
Options:
https://www.certification-questions.com
Databricks Databricks-Certified-Data-Engineer-Associate
Answer: E
Question 3
Which of the following is hosted completely in the control plane of the classic Databricks
architecture?
Options:
A. Worker node
D. Databricks Filesystem
E. Driver node
Answer: E
Question 4
Which of the following benefits of using the Databricks Lakehouse Platform is provided by Delta
Lake?
Options:
Answer: D
Question 5
Which of the following describes the storage organization of a Delta table?
Options:
A. Delta tables are stored in a single file that contains data, history, metadata, and other attributes.
https://www.certification-questions.com
Databricks Databricks-Certified-Data-Engineer-Associate
B. Delta tables store their data in a single file and all metadata in a collection of files in a separate
location.
C. Delta tables are stored in a collection of files that contain data, history, metadata, and other
attributes.
D. Delta tables are stored in a collection of files that contain only the data stored within the table.
E. Delta tables are stored in a single file that contains only the data stored within the table.
Answer: C
Question 6
Which of the following code blocks will remove the rows where the value in column age is greater
than 25 from the existing Delta table my_table and save the updated table?
Options:
Answer: C
Question 7
A data engineer has realized that they made a mistake when making a daily update to a table. They
need to use Delta time travel to restore the table to a version that is 3 days old. However, when the
data engineer attempts to time travel to the older version, they are unable to restore the data
because the data files have been deleted.
Which of the following explains why the data files are no longer present?
Options:
https://www.certification-questions.com
Databricks Databricks-Certified-Data-Engineer-Associate
Answer: C
Question 8
Which of the following Git operations must be performed outside of Databricks Repos?
Options:
A. Commit
B. Pull
C. Push
D. Clone
E. Merge
Answer: D
Explanation:
Reference: https://docs.databricks.com/repos/repos-setup.html
Question 9
Which of the following data lakehouse features results in improved data quality over a traditional
data lake?
Options:
A. A data lakehouse provides storage solutions for structured and unstructured data.
Answer: C
Question 10
A data engineer needs to determine whether to use the built-in Databricks Notebooks versioning or
https://www.certification-questions.com
Databricks Databricks-Certified-Data-Engineer-Associate
Options:
Answer: B
https://www.certification-questions.com