Professional Data Engineer Online Practice Questions

Home / Google / Professional Data Engineer

Latest Professional Data Engineer Exam Practice Questions

The practice questions for Professional Data Engineer exam was last updated on 2025-06-03 .

Viewing page 1 out of 26 pages.

Viewing questions 1 out of 130 questions.

Question#1

You have a data processing application that runs on Google Kubernetes Engine (GKE). Containers need to be launched with their latest available configurations from a container registry. Your GKE nodes need to have GPUs. local SSDs, and 8 Gbps bandwidth. You want to efficiently provision the data processing infrastructure and manage the deployment process.
What should you do?

A. Use Compute Engi.no startup scriots to pull container Images, and use gloud commands to provision the infrastructure.
B. Use GKE to autoscale containers, and use gloud commands to provision the infrastructure.
C. Use Cloud Build to schedule a job using Terraform build to provision the infrastructure and launch with the most current container images.
D. Use Dataflow to provision the data pipeline, and use Cloud Scheduler to run the job.

Explanation:
https://cloud.google.com/architecture/managing-infrastructure-as-code

Question#2

To give a user read permission for only the first three columns of a table, which access control method would you use?

A. Primitive role
B. Predefined role
C. Authorized view
D. It's not possible to give access to only the first three columns of a table.

Explanation:
An authorized view allows you to share query results with particular users and groups without giving them read access to the underlying tables. Authorized views can only be created in a dataset that does not contain the tables queried by the view.
When you create an authorized view, you use the view's SQL query to restrict access to only the rows and columns you want the users to see.
Reference: https://cloud.google.com/bigquery/docs/views#authorized-views

Question#3

You have Cloud Functions written in Node.js that pull messages from Cloud Pub/Sub and send the data to BigQuery. You observe that the message processing rate on the Pub/Sub topic is orders of magnitude higher than anticipated, but there is no error logged in Stackdriver Log Viewer.
What are the two most likely causes of this problem? Choose 2 answers.

A. Publisher throughput quota is too small.
B. Total outstanding messages exceed the 10-MB maximum.
C. Error handling in the subscriber code is not handling run-time errors properly.
D. The subscriber code cannot keep up with the messages.
E. The subscriber code does not acknowledge the messages that it pulls.

Question#4

You created an analytics environment on Google Cloud so that your data scientist team can explore data without impacting the on-premises Apache Hadoop solution. The data in the on-premises Hadoop Distributed File System (HDFS) cluster is in Optimized Row Columnar (ORC) formatted files with multiple columns of Hive partitioning. The data scientist team needs to be able to explore the data in a similar way as they used the on-premises HDFS cluster with SQL on the Hive query engine. You need to choose the most cost-effective storage and processing solution.
What should you do?

A. Import the ORC files lo Bigtable tables for the data scientist team.
B. Import the ORC files to BigOuery tables for the data scientist team.
C. Copy the ORC files on Cloud Storage, then deploy a Dataproc cluster for the data scientist team.
D. Copy the ORC files on Cloud Storage, then create external BigQuery tables for the data scientist team.

Question#5

By default, which of the following windowing behavior does Dataflow apply to unbounded data sets?

A. Windows at every 100 MB of data
B. Single, Global Window
C. Windows at every 1 minute
D. Windows at every 10 minutes

Explanation:
Dataflow's default windowing behavior is to assign all elements of a PCollection to a single, global window, even for unbounded PCollections
Reference: https://cloud.google.com/dataflow/model/pcollection

Exam Code: Professional Data EngineerQ & A: 371 Q&AsUpdated:  2025-06-03

 Get All Professional Data Engineer Q&As