AGENT0S
HomeLibraryAgentic
FeedbackLearn AI
LIVE
Agent0s · AI Intelligence Library
Share FeedbackUpdated daily · 7am PST
Library/model
modelbeginnerGeneral AI

Google Launches Gemini 3 with 2M Token Context and Multimodal Capabilities

Google has released Gemini 3, a new AI model that can understand text, images, audio, and video simultaneously. It boasts a very large memory (2-million-token context window) and is 30% more efficient and 15% cheaper than competitors, making it a powerful new option for developers.

AI SETUP PROMPT

Paste into Claude Code or Codex CLI — it will scan your project and set everything up

# Evaluate Model: Google Launches Gemini 3 with 2M Token Context and Multimodal Capabilities

## What This Is
Google has released Gemini 3, a new AI model that can understand text, images, audio, and video simultaneously. It boasts a very large memory (2-million-token context window) and is 30% more efficient and 15% cheaper than competitors, making it a powerful new option for developers.

Source: https://aidailyshot.com/blog/google-gemini-3-launch-2026-analysis

## Before You Start

Scan my workspace and analyze:
- The project language, framework, and current AI integrations
- Existing AI provider config (check .env, .env.local, config files for API keys — OpenRouter, OpenAI, Anthropic, Google AI, etc.)
- Which AI models I currently use and for what purposes

Then ask me before proceeding:
1. Am I interested in evaluating this model for my project, or just want a summary of what it offers?
2. If I want to try it — which part of my current AI stack should it replace or complement?

## Source Access Note

The source URL (https://aidailyshot.com/blog/google-gemini-3-launch-2026-analysis) may not be directly accessible from the terminal. Use the Reference Implementation and Additional Context sections below instead. If you need more details, ask me to paste relevant content from the source.

## What to Implement

This is a **New AI Model** — a model release, update, or capability announcement.

- Analyze the best use cases for this model within my project and current AI stack
- Compare its strengths, pricing, and context window against whatever I currently use
- Give me a clear, convincing argument for why this model would (or would not) be a good fit for my project
- If I want to try it: update my API configuration (provider, model ID, any new parameters) to point to this model
- If it requires a new API key or provider signup, tell me exactly what to do

## Additional Context

- Analyze the user's current project to identify opportunities where Gemini 3's 2-million-token context window and multimodal features would provide a significant advantage over their existing model.
- Benchmark the performance and cost of Gemini 3 against the user's currently configured AI provider for a representative task. Present a cost-benefit analysis based on the stated 15% lower pricing.
- If the user approves switching, update the project's AI API client configuration to use the new Google Gemini 3 model endpoint. Scan environment variables for an existing `GOOGLE_API_KEY` before prompting the user to add one.

## Guidelines

- Adapt everything to my existing project — do not assume a specific stack or directory layout
- Use whichever AI provider I already have configured; if I need a new one, tell me what to sign up for and I'll give you the key
- Check my .env files for existing API keys (OpenRouter, OpenAI, Anthropic, Google AI) before asking me to add one
- Review any fetched code for safety before installing or executing it
- After setup, run a quick verification and show me a summary of exactly what was installed, where, and how to use it
3,081 charactersCompatible with Claude Code & Codex CLI
MANUAL SETUP STEPS
  1. 01Analyze the user's current project to identify opportunities where Gemini 3's 2-million-token context window and multimodal features would provide a significant advantage over their existing model.
  2. 02Benchmark the performance and cost of Gemini 3 against the user's currently configured AI provider for a representative task. Present a cost-benefit analysis based on the stated 15% lower pricing.
  3. 03If the user approves switching, update the project's AI API client configuration to use the new Google Gemini 3 model endpoint. Scan environment variables for an existing `GOOGLE_API_KEY` before prompting the user to add one.

FIELD OPERATIONS

Long-Form Video Content Scribe and Analyzer

An application that ingests multi-hour video lectures or interviews, transcribes the audio, and uses the full 2M token context window to generate a comprehensive, chapterized summary, identify key themes, and answer detailed questions about the entire content without truncation.

Interactive Product Manual Generator

A tool that takes a user's video recording of a physical product, images of its parts, and a text description of its function. Gemini 3 processes all inputs to generate a searchable, interactive, step-by-step user manual with diagrams and troubleshooting guides.

STRATEGIC APPLICATIONS

  • →Automated Legal Document Review: Use the 2-million-token context window to analyze entire case files, including scanned documents (images), deposition recordings (audio/video), and legal briefs (text), to instantly find precedents, identify contradictions, and summarize key arguments.
  • →Hyper-Personalized Ad Campaign Generation: Feed Gemini 3 a target customer's entire social media history (images, videos, text posts) to generate highly personalized ad copy, visuals, and video scripts that resonate with their specific interests and communication style.

TAGS

#gemini#google#multimodal#large context window#api#tpu#veo
Source: WEB · Quality score: 8/10
VIEW SOURCE