Ryan Boots
成为会员时间:2024
钻石联赛
9653 积分
成为会员时间:2024
This course is designed for data analysts who want to learn about using BigQuery for their data analysis needs. Through a combination of videos, labs, and demos, we cover various topics that discuss how to ingest, transform, and query your data in BigQuery to derive insights that can help in business decision making.
這堂初級課程將介紹 Google Cloud 的資料分析工作流程,以及用於探索、分析資料並以圖表呈現的工具。您也能學會如何與相關人員分享自己的發現結果。本課程包含個案研究、實作實驗室、講座、測驗和示範,實際展示如何將原始資料集轉化為清晰的資料,進而呈現出能發揮成效的圖表和資訊主頁。無論您是資料領域從業人員、想瞭解如何透過 Google Cloud 取得成功,或有意在職涯中更上一層樓,本課程都能協助您踏出第一步。絕大多數在工作上執行或運用資料分析的學員,都能從本課程受益。
In this intermediate course, you will learn to design, build, and optimize robust batch data pipelines on Google Cloud. Moving beyond fundamental data handling, you will explore large-scale data transformations and efficient workflow orchestration, essential for timely business intelligence and critical reporting. Get hands-on practice using Dataflow for Apache Beam and Serverless for Apache Spark (Dataproc Serverless) for implementation, and tackle crucial considerations for data quality, monitoring, and alerting to ensure pipeline reliability and operational excellence. A basic knowledge of data warehousing, ETL/ELT, SQL, Python, and Google Cloud concepts is recommended.
While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.
This course helps learners create a study plan for the PDE (Professional Data Engineer) certification exam. Learners explore the breadth and scope of the domains covered in the exam. Learners assess their exam readiness and create their individual study plan.