Course Path Icon Cours

Google DeepMind: 02 Represent Your Language Data

45 minutes Intermédiaire Mis à jour il y a 6 mois
Course Path Shape

In this Google DeepMind course you will learn how to prepare text data for language models to process. You will investigate the tools and techniques used to prepare, structure, and represent text data for language models, with a focus on tokenization and embeddings. You will be encouraged to think critically about the decisions behind data preparation, and what biases within the data may be introduced into models. You will analyze trade-offs, learn how to work with vectors and matrices, how meaning is represented in language models. Finally, you will practice designing a dataset ethically using the Data Cards process, ensuring transparency, accountability, and respect for community values in AI development.

Gagnez un badge aujourd'hui !

La puissance des ateliers challenge

Vous pouvez désormais obtenir un badge de compétence sans avoir à suivre l'intégralité du cours. Si vous êtes sûr de vos compétences, passez directement à l'atelier challenge.

Aperçu