Course Path Icon Course

Google DeepMind: 04 Discover The Transformer Architecture

1 hour Intermediate Updated 4 months ago
Course Path Shape

In this Google DeepMind course you will discover the mechanisms of the transformer architecture. You will investigate how transformer language models process prompts to make context-sensitive next-token predictions. Through practical activities you will explore the attention mechanism, visualize attention weights, and encounter advanced concepts like masked attention and multi-head attention. You will also learn other techniques that are necessary to build neural networks that are well-suited to be used as language models. Finally, through activities on values, stakeholder mapping and community engagement, you will practice concrete tools for ensuring AI projects are developed with communities, not just for them.

Earn a badge today!

The Power of Challenge Labs

Now you can fast track your way to a skill badge without having to take the entire course. If you're confident with your skills, jump straight to the challenge lab.

Preview