Ryan Boots
成为会员时间:2024
钻石联赛
9653 积分
成为会员时间:2024
This course is designed for data analysts who want to learn about using BigQuery for their data analysis needs. Through a combination of videos, labs, and demos, we cover various topics that discuss how to ingest, transform, and query your data in BigQuery to derive insights that can help in business decision making.
在本新手级课程中,您将了解 Google Cloud 数据分析工作流,以及可用于探索、分析和直观呈现数据并与相关人员共享发现结果的工具。结合案例研究、实操实验、讲座和测验/演示,本课程展示了如何将原始数据集转化为纯净数据,进而转化为实用的可视化图表和信息中心。无论您是已经在从事数据工作并想了解如何通过 Google Cloud 取得成功,还是在寻求职业发展,都可以借助本课程迈出第一步。几乎所有在工作中执行或使用数据分析的人都可以从本课程中受益。
In this intermediate course, you will learn to design, build, and optimize robust batch data pipelines on Google Cloud. Moving beyond fundamental data handling, you will explore large-scale data transformations and efficient workflow orchestration, essential for timely business intelligence and critical reporting. Get hands-on practice using Dataflow for Apache Beam and Serverless for Apache Spark (Dataproc Serverless) for implementation, and tackle crucial considerations for data quality, monitoring, and alerting to ensure pipeline reliability and operational excellence. A basic knowledge of data warehousing, ETL/ELT, SQL, Python, and Google Cloud concepts is recommended.
While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.
This course helps learners create a study plan for the PDE (Professional Data Engineer) certification exam. Learners explore the breadth and scope of the domains covered in the exam. Learners assess their exam readiness and create their individual study plan.