Databricks Certified Data Engineer Associate Online Practice Questions

Home / Databricks / Databricks Certified Data Engineer Associate

Latest Databricks Certified Data Engineer Associate Exam Practice Questions

The practice questions for Databricks Certified Data Engineer Associate exam was last updated on 2025-09-15 .

Viewing page 1 out of 6 pages.

Viewing questions 1 out of 31 questions.

Question#1

A data engineer has a Job that has a complex run schedule, and they want to transfer that schedule to other Jobs.
Rather than manually selecting each value in the scheduling form in Databricks, which of the following tools can the data engineer use to represent and submit the schedule programmatically?

A. pyspark.sql.types.DateType
B. datetime
C. pyspark.sql.types.TimestampType
D. Cron syntax
E. There is no way to represent and submit this information programmatically

Question#2

Which of the following data workloads will utilize a Gold table as its source?

A. A job that enriches data by parsing its timestamps into a human-readable format
B. A job that aggregates uncleaned data to create standard summary statistics
C. A job that cleans data by removing malformatted records
D. A job that queries aggregated data designed to feed into a dashboard
E. A job that ingests raw data from a streaming source into the Lakehouse

Question#3

A data engineer is working with two tables. Each of these tables is displayed below in its entirety.



The data engineer runs the following query to join these tables together:



Which of the following will be returned by the above query?


A. Option A
B. Option B
C. Option C
D. Option D
E. Option E

Question#4

A data engineer wants to create a data entity from a couple of tables. The data entity must be used by other data engineers in other sessions. It also must be saved to a physical location.
Which of the following data entities should the data engineer create?

A. Database
B. Function
C. View
D. Temporary view
E. Table

Explanation:
In the context described, creating a "Table" is the most suitable choice. Tables in SQL are data entities that exist independently of any session and are saved in a physical location. They can be accessed and manipulated by other data engineers in different sessions, which aligns with the requirements stated. A "Database" is a collection of tables, views, and other database objects. A "Function" is a stored procedure that performs an operation. A "View" is a virtual table based on the result-set of an SQL statement, but it is not stored physically. A "Temporary view" is a feature that allows you to store the result of a query as a view that disappears once your session with the database is closed.

Question#5

A data engineer needs to create a table in Databricks using data from their organization’s existing SQLite database.
They run the following command:



Which of the following lines of code fills in the above blank to successfully complete the task?

A. org.apache.spark.sql.jdbc
B. autoloader
C. DELTA
D. sqlite
E. org.apache.spark.sql.sqlite

Explanation:
https://docs.yugabyte.com/preview/integrations/apache-spark/spark-sql/ CREATE TABLE new_employees_table USING JDBC
OPTIONS (
url "<jdbc_url>",
dbtable "<table_name>",
user '<username>',
password '<password>'
) AS
SELECT * FROM employees_table_vw
https://docs.databricks.com/external-data/jdbc.html#language-sql

Exam Code: Databricks Certified Data Engineer AssociateQ & A: 117 Q&AsUpdated:  2025-09-15

 Get All Databricks Certified Data Engineer Associate Q&As