Francisco Andres Tapia Ibañez
成为会员时间:2023
钻石联赛
8298 积分
成为会员时间:2023
In this course you will get hands-on in order to work through real-world challenges faced when building streaming data pipelines. The primary focus is on managing continuous, unbounded data with Google Cloud products.
In this intermediate course, you will learn to design, build, and optimize robust batch data pipelines on Google Cloud. Moving beyond fundamental data handling, you will explore large-scale data transformations and efficient workflow orchestration, essential for timely business intelligence and critical reporting. Get hands-on practice using Dataflow for Apache Beam and Serverless for Apache Spark (Dataproc Serverless) for implementation, and tackle crucial considerations for data quality, monitoring, and alerting to ensure pipeline reliability and operational excellence. A basic knowledge of data warehousing, ETL/ELT, SQL, Python, and Google Cloud concepts is recommended.
While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.
在本课程中,您将了解 Google Cloud 数据工程、数据工程师的角色和职责,以及相关的 Google Cloud 产品和服务。您还将了解如何应对数据工程挑战。
This course helps learners create a study plan for the PDE (Professional Data Engineer) certification exam. Learners explore the breadth and scope of the domains covered in the exam. Learners assess their exam readiness and create their individual study plan.
在本入门级课程中,您将了解 Google Cloud 的基础工具和服务。此课程提供了可选视频, 旨在帮助您深入了解和回顾实验中涉及的概念。Google Cloud 基础知识是推荐给 Google Cloud 学员的第一门课程 - 即使您几乎没有云相关知识,也能从中获得实践 经验,并将其直接运用于您的首个 Google Cloud 项目。从编写 Cloud Shell 命令和部署您的第一个虚拟机,到在 Kubernetes Engine 上运行应用 或者使用负载均衡,“Google Cloud 基础知识”都是您了解该平台 基本功能的首选入门级课程。