正在加载…
未找到任何结果。
在 LinkedIn 动态中分享 Twitter Facebook

Google Cloud Skills Boost

在 Google Cloud 控制台中运用您的技能

05

Building Batch Data Pipelines on Google Cloud

05

Building Batch Data Pipelines on Google Cloud

magic_button Dataproc Data Pipeline Data Pipelines
These skills were generated by AI. Do you agree this course teaches these skills?
13 个小时 中级

Data pipelines typically fall under one of the Extract and Load (EL), Extract, Load and Transform (ELT) or Extract, Transform and Load (ETL) paradigms. This course describes which paradigm should be used and when for batch data. Furthermore, this course covers several technologies on Google Cloud for data transformation including BigQuery, executing Spark on Dataproc, pipeline graphs in Cloud Data Fusion and serverless data processing with Dataflow. Learners get hands-on experience building data pipeline components on Google Cloud using Qwiklabs.

Skill badges validate your practical knowledge on specific products through hands-on labs and challenge assessments. Earn a badge by completing a course or jump straight into the challenge lab to get your badge today. Badges prove your proficiency, enhance your professional profile, and ultimately lead to increased career opportunities. Visit your profile to track badges you’ve earned.

info
课程信息
目标
  • Review different methods of data loading: EL, ELT and ETL and when to use what
  • Run Hadoop on Dataproc, leverage Cloud Storage, and optimize Dataproc jobs
  • Build your data processing pipelines using Dataflow
  • Manage data pipelines with Data Fusion and Cloud Composer
前提条件

Experience with data modeling and ETL (extract, transform, load) activities.

Experience with developing applications by using a common programming language such as Python or Java.

受众
Developers responsible for designing pipelines and architectures for data processing.
支持的语言
English, español (Latinoamérica), 日本語, français, português (Brasil), italiano, and 한국어

实验室挑战赛的强大作用

现在,您可以快速获得技能徽章,而无需完成整门课程。如果您对自己的技能有信心,请直接跳转到实验室挑战赛。

预览