Gemini
Bridge Claude with Google's Gemini via the official Gemini CLI. Leverages Gemini's massive 1M+ token context window for large file and codebase analysis while Claude handles interaction and code editing.
Discontinued on consumer tiers — migrate to Antigravity
From 2026-06-18, Google restricts Gemini CLI access to Gemini Code Assist Standard/Enterprise seats; free, Google AI Pro, and Ultra accounts lose access. ask-gemini-mcp still installs and launches — the failure is a runtime auth/quota error from Google's backend, not a missing binary.
On a subscription tier? Migrate to Antigravity (agy) — Google's successor CLI, covered by the same AI Pro/Ultra subscription with no per-token billing. Install ask-antigravity-mcp, or switch to ask-codex / ask-ollama. The 2026-06-18 tier change section below covers what happens at runtime.
2026-06-18 tier change
From 2026-06-18, Google serves Gemini CLI requests only for Gemini Code Assist Standard/Enterprise accounts; free, Google AI Pro, and Ultra accounts lose access. The surviving tiers keep working with the same gemini binary and backend.
What this means for ask-gemini-mcp:
- The npm package still installs and launches — the binary is unchanged. The failure for non-enterprise accounts is a runtime auth/quota error, not a missing binary, so reinstalling will not help.
- On or after the cutoff, an auth/quota-class failure now surfaces an actionable notice (cutoff date + options) instead of a raw error. The note is advisory ("likely caused by the tier change") — a genuine auth, billing, or quota error can also trigger it.
- Google's successor is the Antigravity CLI (
agy) — a separate, closed-source binary thatask-gemini-mcpdoes not wrap. Use the dedicatedask-antigravity-mcppackage (covered by the same Google AI Pro/Ultra subscription, no per-token billing), runagydirectly, or switch toask-codex/ask-ollama. - Testing the guidance: set
ASK_GEMINI_TIER_CUTOFFto a past UTC instant (e.g.2020-01-01T00:00:00Z) to force the post-cutoff gate on; an auth/quota failure will then prepend the notice. The default cutoff is2026-06-18T00:00:00Z.
Best for: whole-codebase reads using the 1M+ token context window, if you have an eligible Gemini Code Assist Standard/Enterprise seat. Not for: most users from 2026-06-18 (see the notice above). For large-context reads without an enterprise seat, use Antigravity.
Installation
Run in your terminal:
# Project scope (current project only)
claude mcp add gemini-cli -- npx -y ask-gemini-mcp
# User scope (all projects)
claude mcp add --scope user gemini-cli -- npx -y ask-gemini-mcp Or install as a plugin (adds slash commands like /multi-review, /brainstorm, /compare, plus reviewer subagents and the opt-in continuous codex-pair review hook):
/plugin marketplace add Lykhoyda/ask-llm
/plugin install ask-llm@ask-llm-pluginsOr install globally:
npm install -g ask-gemini-mcpPrerequisites
- Node.js v20.0.0 or higher
- Gemini CLI installed and authenticated:
npm install -g @google/gemini-cli
gemini loginTools
| Tool | Purpose |
|---|---|
ask-gemini | Send prompts to Gemini CLI with @ file syntax. Optional sessionId for multi-turn; live progressive output via --output-format stream-json (ADR-057) |
ask-gemini-edit | Structured code edits via Gemini changeMode. Returns OLD/NEW edit blocks. Supports includeDirs for monorepo context |
fetch-chunk | Retrieve subsequent chunks from cached large responses |
get-usage-stats | Per-session token totals + breakdowns by provider/model. In-memory (ADR-054) |
ping | Fast connection test to verify MCP setup |
ask-gemini returns both human-readable text and a structured AskResponse (provider, response, model, sessionId, usage) via MCP outputSchema — programmatic clients can extract the sessionId and usage fields directly without parsing the response footer (ADR-065).
Models
- Default:
gemini-3.1-pro-preview(latest, highest capability) - Fallback:
gemini-3.5-flash(automatic on quota errors per ADR-044)
Key Features
- 1M+ token context for analyzing entire codebases
- Multi-turn sessions via
sessionId— native--resume <id>(zero replay cost) - Include directories for monorepo context (
includeDirsparameter onask-gemini-edit) - Live progressive output — assistant message deltas stream to MCP progress notifications, no frozen waits on long calls
- Structured AskResponse via outputSchema for programmatic clients
- Automatic quota fallback from Pro to Flash on
RESOURCE_EXHAUSTED
npm
- Package: ask-gemini-mcp
- Binary:
ask-gemini-mcp