Master the data engineering stack on Google Cloud: BigQuery analytics, Dataflow streaming pipelines, Pub/Sub event buses, Bigtable time-series storage, Vertex AI MLOps, and Cloud Composer orchestration. 60 scenario-based practice questions.
| Field | Details |
|---|---|
| Exam Code | Professional Data Engineer (updated 2023) |
| Questions | 50–60 multiple-choice and multiple-select |
| Duration | 2 hours |
| Passing Score | ~70% (Google does not publish exact score) |
| Price | $200 USD |
| Recertification | Every 2 years |
| Recommended Experience | 3+ years industry experience, 1+ year on GCP |
| Prerequisites | None official (GCP ACE or PCA recommended) |
The exam loves asking which storage system to use. The deciding factor: Bigtable for high-throughput, low-latency reads/writes of time-series or IoT data at millions of QPS. BigQuery for analytical queries over terabytes with SQL. If the scenario says "real-time sensor data at <10ms latency," it's Bigtable. If it says "analyze 3 years of sales data," it's BigQuery.
Three window types tested heavily: Fixed (tumbling) — non-overlapping equal-size windows, e.g., aggregate sales per hour. Sliding — overlapping windows for moving averages, e.g., 1-hour window every 5 minutes. Session — gap-based, variable duration, ideal for user activity sessions. Late data is handled with allowedLateness and trigger strategies.
The PDE exam tests CREATE MODEL type selection: LINEAR_REG for numeric predictions (price forecasting). LOGISTIC_REG for binary/multi-class classification (churn prediction). KMEANS for unsupervised clustering (customer segmentation). BOOSTED_TREE_CLASSIFIER for high-accuracy tabular classification. ARIMA_PLUS for time-series forecasting with trend/seasonality decomposition.
CREATE MODEL and evaluate it with ML.EVALUATE. Explore Vertex AI Feature Store and understand the difference between online and offline serving. Study MLOps pipeline patterns.
WHERE clause. If you wrap the partition column in a function (DATE(timestamp_col)), pruning is disabled. The exam tests this with cost optimization scenarios.
60 scenario-based questions covering all 5 PDE exam domains. Immediate feedback with detailed explanations. No signup, no paywall.
GCP PDE exam tips on the CertQuests podcast →