Exploring Google Gemini and Generative AI
#1about 1 minute
Generative AI code is simple but prompting is complex
The core challenge in generative AI development isn't writing code, but mastering prompt engineering to get desired results, similar to writing performant SQL.
#2about 3 minutes
Understanding Google Gemini models and capabilities
Google Gemini offers different models like Pro and Flash for varying needs, supporting a large context window for inputs like video, audio, and code.
#3about 3 minutes
Getting your API key and making your first call
Obtain a free-tier API key easily through AI Studio without needing the full Google Cloud Platform, and test it immediately with a provided curl command.
#4about 4 minutes
Prototyping prompts and writing code with Node.js
Use AI Studio as a playground to test prompts and generate starter code, then implement it using the Node.js SDK for simple question-and-answer interactions.
#5about 5 minutes
Processing images and files with multimodal input
Leverage Gemini's multimodal capabilities by uploading images via the Files API to analyze their content and automate tasks like generating descriptive filenames.
#6about 3 minutes
Building conversational context with chat history
Create stateful chat interactions by sending the entire conversation history with each new message, a process the Gemini SDK manages automatically.
#7about 3 minutes
Defining model persona and style with system instructions
Use system instructions to formally define a model's persona, tone, and subject matter constraints, ensuring consistent and tailored responses for specific use cases.
#8about 4 minutes
Enforcing structured output with JSON Schema
Ensure reliable and structured data from the model by specifying the desired output format as JSON and defining its precise structure using a JSON Schema.
#9about 3 minutes
Exploring practical use cases and model limitations
Real-world applications of Gemini include a movie recommendation system and a Dungeons and Dragons tool, but it can fail at tasks requiring strategic reasoning like blackjack.
#10about 3 minutes
Running on-device AI in the browser with Gemini Nano
Gemini Nano brings generative AI directly into the Chrome browser, enabling on-device processing for tasks like summarization and translation without API calls.
#11about 4 minutes
Implementing summarization and translation with web APIs
Use the experimental `window.ai` object in Chrome to implement features like text summarization and translation that run entirely on the user's device.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
01:31 MIN
Introducing generative AI in the browser with Chrome AI
aa
15:26 MIN
Building web apps and live experiences with AI Studio
Google Gemma and Open Source AI Models - Clement Farabet
18:41 MIN
Building agents with Google Gemini and open source tools
Beyond Chatbots: How to build Agentic AI systems
14:51 MIN
The alternative: Built-in AI and the Prompt API
Prompt API & WebNN: The AI Revolution Right in Your Browser
34:02 MIN
Building interactive agents with the Gemini Live API
What’s New with Google Gemini?
45:45 MIN
A call for developer feedback on AI Studio and APIs
Google Gemma and Open Source AI Models - Clement Farabet
22:12 MIN
Exploring diverse real-world use cases for AI agents
Beyond the IDE: A new era of agent collaboration
05:49 MIN
Understanding the roles of Gemini and Gemma models
Google Gemma and Open Source AI Models - Clement Farabet
Featured Partners
Related Videos
Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen
Sam Witteveen
What’s New with Google Gemini?
Logan Kilpatrick
aa
aa
Google Gemma and Open Source AI Models - Clement Farabet
Developer Productivity Using AI Tools and Services - Ryan J Salva
Ryan J Salva
Beyond the IDE: A new era of agent collaboration
Ryan J. Salva
Exploring the Future of Web AI with Google
Thomas Steiner
Lessons Learned Building a GenAI Powered App
Mete Atamel
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.





Front End Engineering Manager ( Generative AI experience )
Accenture
GraphQL
React Native
Continuous Integration

Gemini Adoption Solution Engineer, Higher Education (Multiple Languages)
Google Netherlands B.V.
Senior
HTML
Machine Learning



AI Content Expert, Artificial General Intelligence
Amazon.com, Inc
Shoreham-by-Sea, United Kingdom
HTML
JSON
Data analysis