AI Agents represent a major shift beyond traditional large language models (LLMs): instead of simply generating text-based solutions, they can also act autonomously to execute them. This course introduces the fundamentals of AI Agents, how they differ from LLM APIs, and where they add value in the real world. Based on Google’s agents whitepaper, it provides the theoretical foundation needed before writing your first lines of agent code—ideal for developers, architects, and technical decision-makers who want to understand AI systems through the lens of autonomous, goal-directed behavior (and not just text generation). Join the community forum for questions and discussions.
Complete the introductory Build Real World AI Applications with Gemini and Imagen skill badge to demonstrate skills in the following: image recognition, natural language processing, image generation using Google's powerful Gemini and Imagen models, deploying applications on the Google Cloud Agent Platform.
Complete the introductory Prompt Design in Agent Platform skill badge to demonstrate skills in the following: prompt engineering, image analysis, and multimodal generative techniques, within Agent Platform. Discover how to craft effective prompts, guide generative AI output, and apply Gemini models to real-world marketing scenarios.