Course Path Icon Course

Build Batch Data Pipelines on Google Cloud

9 год 15 годин Середній
Course Path Shape

In this intermediate course, you will learn to design, build, and optimize robust batch data pipelines on Google Cloud. Moving beyond fundamental data handling, you will explore large-scale data transformations and efficient workflow orchestration, essential for timely business intelligence and critical reporting.

Get hands-on practice using Dataflow for Apache Beam and Serverless for Apache Spark (Dataproc Serverless) for implementation, and tackle crucial considerations for data quality, monitoring, and alerting to ensure pipeline reliability and operational excellence. A basic knowledge of data warehousing, ETL/ELT, SQL, Python, and Google Cloud concepts is recommended.

Earn a badge today!

The Power of Challenge Labs

Now you can fast track your way to a skill badge without having to take the entire course. If you're confident with your skills, jump straight to the challenge lab.

Попередній перегляд