加入 登录

Thierno Diallo

成为会员时间:2025

黄金联赛

13126 积分
在 BigQuery 中使用 Gemini 模型 Earned Apr 9, 2026 EDT
使用 Gemini in BigQuery 提高效率 Earned Feb 13, 2026 EST
使用 Dataplex 构建数据网格 Earned Feb 3, 2026 EST
使用 BigQuery 构建数据仓库 Earned Jan 29, 2026 EST
Serverless Data Processing with Dataflow: Operations Earned Jan 27, 2026 EST
Serverless Data Processing with Dataflow: Develop Pipelines Earned Jan 8, 2026 EST
Serverless Data Processing with Dataflow: Foundations Earned Jan 7, 2026 EST
Build Streaming Data Pipelines on Google Cloud Earned Jan 7, 2026 EST
Build Batch Data Pipelines on Google Cloud Earned Dec 23, 2025 EST
Build Data Lakes and Data Warehouses on Google Cloud Earned Dec 8, 2025 EST
Google Cloud 数据工程简介 Earned Dec 3, 2025 EST
面向开发者的 Responsible AI:可解释性和透明度 Earned Oct 30, 2025 EDT
面向开发者的 Responsible AI:公平性与偏见 Earned Oct 28, 2025 EDT
矢量搜索和嵌入 Earned Oct 24, 2025 EDT
Vertex AI Studio 简介 Earned Oct 23, 2025 EDT
创建图片标注模型 Earned Oct 23, 2025 EDT
Transformer 模型和 BERT 模型 Earned Oct 21, 2025 EDT
编码器-解码器架构 Earned Oct 21, 2025 EDT
注意力机制 Earned Oct 15, 2025 EDT
图像生成简介 Earned Oct 15, 2025 EDT
Preparing for your Professional Data Engineer Journey Earned Mar 10, 2025 EDT

本课程展示了如何在 BigQuery 中使用 AI/机器学习模型处理生成式 AI 任务。通过一个涉及客户关系管理的实际应用场景,您将学习到使用 Gemini 模型解决业务问题的工作流程。为了便于理解,本课程还将通过使用 SQL 查询和 Python 笔记本的编码解决方案提供分步指导。

了解详情

此课程将探索如何使用 AI 功能套件 Gemini in BigQuery 为“数据到 AI”工作流提供助力。其中涉及到的功能包括数据探索和准备、代码生成和问题排查,以及工作流发现和可视化。此课程包含概念解释、真实使用场景以及实操实验等内容,可帮助数据从业者提升效率并加快流水线开发速度。

了解详情

完成入门技能徽章课程使用 Dataplex 构建数据网格,展示以下方面的技能:使用 Dataplex 构建数据网格, 以在 Google Cloud 上实现数据安全、治理和发现。您将在 Dataplex 中练习和测试自己在标记资产、分配 IAM 角色和评估数据质量方面的技能。

了解详情

完成中级技能徽章课程使用 BigQuery 构建数据仓库,展示以下技能: 联接数据以创建新表、排查联接故障、使用并集附加数据、创建日期分区表, 以及在 BigQuery 中使用 JSON、数组和结构体。

了解详情

In the last installment of the Dataflow course series, we will introduce the components of the Dataflow operational model. We will examine tools and techniques for troubleshooting and optimizing pipeline performance. We will then review testing, deployment, and reliability best practices for Dataflow pipelines. We will conclude with a review of Templates, which makes it easy to scale Dataflow pipelines to organizations with hundreds of users. These lessons will help ensure that your data platform is stable and resilient to unanticipated circumstances.

了解详情

In this second installment of the Dataflow course series, we are going to be diving deeper on developing pipelines using the Beam SDK. We start with a review of Apache Beam concepts. Next, we discuss processing streaming data using windows, watermarks and triggers. We then cover options for sources and sinks in your pipelines, schemas to express your structured data, and how to do stateful transformations using State and Timer APIs. We move onto reviewing best practices that help maximize your pipeline performance. Towards the end of the course, we introduce SQL and Dataframes to represent your business logic in Beam and how to iteratively develop pipelines using Beam notebooks.

了解详情

This course is part 1 of a 3-course series on Serverless Data Processing with Dataflow. In this first course, we start with a refresher of what Apache Beam is and its relationship with Dataflow. Next, we talk about the Apache Beam vision and the benefits of the Beam Portability framework. The Beam Portability framework achieves the vision that a developer can use their favorite programming language with their preferred execution backend. We then show you how Dataflow allows you to separate compute and storage while saving money, and how identity, access, and management tools interact with your Dataflow pipelines. Lastly, we look at how to implement the right security model for your use case on Dataflow.

了解详情

In this course you will get hands-on in order to work through real-world challenges faced when building streaming data pipelines. The primary focus is on managing continuous, unbounded data with Google Cloud products.

了解详情

In this intermediate course, you will learn to design, build, and optimize robust batch data pipelines on Google Cloud. Moving beyond fundamental data handling, you will explore large-scale data transformations and efficient workflow orchestration, essential for timely business intelligence and critical reporting. Get hands-on practice using Dataflow for Apache Beam and Serverless for Apache Spark (Dataproc Serverless) for implementation, and tackle crucial considerations for data quality, monitoring, and alerting to ensure pipeline reliability and operational excellence. A basic knowledge of data warehousing, ETL/ELT, SQL, Python, and Google Cloud concepts is recommended.

了解详情

While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.

了解详情

在本课程中,您将了解 Google Cloud 数据工程、数据工程师的角色和职责,以及相关的 Google Cloud 产品和服务。您还将了解如何应对数据工程挑战。

了解详情

本课程介绍了 AI 可解释性和透明度的相关概念,探讨了 AI 透明度对于开发者和工程师的重要性。同时探索了有助于在数据和 AI 模型中实现可解释性和透明度的实用方法及工具。

了解详情

本课程介绍了 Responsible AI 的概念和 AI 原则,还介绍了在 AI/机器学习实践中识别公平性与偏见以及减少偏见的实用技巧,同时探索了使用 Google Cloud 产品和开源工具来实施 Responsible AI 最佳实践的实用方法和工具。

了解详情

在本次课程中,探索 AI 赋能的搜索技术、工具和应用。学习利用向量嵌入的语义搜索、融合语义和关键字的混合搜索方法,以及检索增强生成 (RAG) 技术,以打造基于事实的 AI 智能体,尽可能减少 AI 幻觉。获取 Vertex AI Vector Search 实战经验,打造您自己的智能搜索引擎。

了解详情

本课程介绍 Vertex AI Studio,这是一种用于与生成式 AI 模型交互、围绕业务创意进行原型设计并在生产环境中落地的工具。通过沉浸式应用场景、富有吸引力的课程和实操实验,您将探索从提示到产品的整个生命周期,了解如何将 Vertex AI Studio 用于多模态 Gemini 应用、提示设计、提示工程和模型调优。本课程的目的在于帮助您利用 Vertex AI Studio,在自己的项目中充分发掘生成式 AI 的潜力。

了解详情

本课程教您如何使用深度学习来创建图片标注模型。您将了解图片标注模型的不同组成部分,例如编码器和解码器,以及如何训练和评估模型。学完本课程,您将能够自行创建图片标注模型并用来生成图片说明。

了解详情

本课程向您介绍 Transformer 架构和 Bidirectional Encoder Representations from Transformers (BERT) 模型。您将了解 Transformer 架构的主要组成部分,例如自注意力机制,以及该架构如何用于构建 BERT 模型。您还将了解可以使用 BERT 的不同任务,例如文本分类、问答和自然语言推理。完成本课程估计需要大约 45 分钟。

了解详情

本课程简要介绍了编码器-解码器架构,这是一种功能强大且常见的机器学习架构,适用于机器翻译、文本摘要和问答等 sequence-to-sequence 任务。您将了解编码器-解码器架构的主要组成部分,以及如何训练和部署这些模型。在相应的实验演示中,您将在 TensorFlow 中从头编写简单的编码器-解码器架构实现代码,以用于诗歌生成。

了解详情

本课程将向您介绍注意力机制,这是一种强大的技术,可令神经网络专注于输入序列的特定部分。您将了解注意力的工作原理,以及如何使用它来提高各种机器学习任务的性能,包括机器翻译、文本摘要和问题解答。

了解详情

本课程向您介绍扩散模型。这类机器学习模型最近在图像生成领域展现出了巨大潜力。扩散模型的灵感来源于物理学,特别是热力学。过去几年内,扩散模型成为热门研究主题并在整个行业开始流行。Google Cloud 上许多先进的图像生成模型和工具都是以扩散模型为基础构建的。本课程向您介绍扩散模型背后的理论,以及如何在 Vertex AI 上训练和部署此类模型。

了解详情

This course helps learners create a study plan for the PDE (Professional Data Engineer) certification exam. Learners explore the breadth and scope of the domains covered in the exam. Learners assess their exam readiness and create their individual study plan.

了解详情