7 Minutes to Master Gemini Spark

Last updated: May 20, 2026 — first-day, off the I/O keynote. Read time: 7 minutes. What you’ll learn: what Spark actually is (not the marketing version), the exact access rules today, your first Spark task, how it compares to Manus / Operator / Claude, and whether it’s worth waiting for or paying for now.

The model everyone talked about from I/O 2026 was Gemini 3.5 Flash. The product everyone will talk about a month from now is Gemini Spark — Google’s first serious attempt at a 24/7 personal AI agent. It runs on its own cloud VM, reads your Gmail / Docs / Sheets, drafts replies on your behalf, and keeps working while your laptop is closed.

Spark is the first place Gemini 3.5 Flash gets to do what Google actually built it for: hours-long agentic execution with minimal supervision.

This is the 7-minute guide. Skip the keynote replay.

What is Gemini Spark (60 seconds)

Three sentences for the impatient:

  • Spark is a personal AI agent inside the Gemini app that “helps you navigate your digital life, taking action on your behalf and under your direction” (Google’s official description).
  • It runs on dedicated virtual machines on Google Cloudnot on your phone or laptop — so it’s truly 24/7. You can close everything and Spark keeps working.
  • Powered by Gemini 3.5 + Google Antigravity (the agentic harness Google launched alongside 3.5 Flash). Our Gemini 3.5 Flash guide covers the model side; Spark is the consumer product that puts it to work.

The thing that’s actually new — and that separates Spark from “Gemini app with a fancier mode” — is the decoupling from your device. Manus, Cursor agents, even Claude with Computer Use all need something of yours running. Spark doesn’t.

How to access it today (1 minute)

This is where most launch coverage gets wrong, so read this section carefully:

TierWhoWhen
Trusted testersLimited invite from GoogleAvailable now (started May 19, 2026)
Beta — Google AI Ultra subscribers, USAnyone on the new $100/month or $200/month Ultra tiers (see Pricing below)The week of May 26, 2026
Wider rollout (other AI Ultra markets)Ultra users outside USNot announced
MCP third-party integrationsAll Spark users”Coming weeks”
Android Halo (live agent UI on phone)Android users”Later this year”
Chrome-native (Spark inside the browser)Chrome users”Later this summer”
Free tierNot announced. Treat as paid-only for the foreseeable future.

Heads-up — pricing changed on launch day. Google restructured its AI plans at I/O 2026: the old $250/month Ultra plan is gone, replaced by a new $100/month Ultra tier and a $200/month top tier (same capabilities as the former $250, just cheaper). Spark Beta is included on both. See the Pricing section below.

Honest summary: unless you’re a US Ultra subscriber, you’re on the waitlist. Sign in to https://gemini.google.com with the Google account you’d want to grant Spark access to — that’s where the agent will surface when your turn comes up.

If you want to try the underlying model and harness today, you can use Gemini 3.5 Flash directly in Google AI Studio — same model, same agentic capability, just no Gmail/Docs auto-integration. See our Gemini 3.5 Flash tutorial for the API path.

Your first Spark task (2 minutes)

The wrong way to use Spark is to chat with it. Chat is what the regular Gemini app is for. The right way is to give it a goal with constraints and let it run.

Once you have access, here’s a clean template that works for most early use cases. Open Spark, paste this, replace the bracketed bits:

Goal: [one sentence — what success looks like]

Context Spark needs (pull from my Google account):
- [Gmail thread / sender / label]
- [Doc / Sheet / Slide title]

Constraints:
- Don't send anything; draft only — leave it in Gmail drafts.
- If you need a decision I haven't given you, pause and ask.
- Spend no more than [N] minutes / [N] iterations.

When you're done:
- Summarize what you did in 5 bullets.
- List every file or message you touched.
- Flag anything you're unsure about.

A concrete example you can adapt:

Goal: Draft a follow-up email to every prospect who replied to my
"Pricing" email last week but hasn't responded since I sent the deck.

Context:
- Gmail label: "Pipeline/Pricing-Sent"
- Doc: "Q3 Pricing Deck - Final" in my Drive

Constraints:
- Don't send anything; draft only.
- Use the tone of my last three replies to this label as a style guide.
- Different draft per prospect, referencing what they said.

When you're done:
- List every prospect, what they said last, and the draft subject line.
- Flag anyone where the right move is a call, not an email.

The most important constraint above is “draft only — don’t send.” Agents are very good at “looking ready” to act; you want a human-in-the-loop checkpoint until you trust the output for that task.

First-time tip: start Spark on something where you’d be fine if the output was mediocre — drafting cold outreach is a good first test, because the worst case is “I delete the drafts and write them myself.” Don’t start with reply-to-investor emails.

Top 5 things Spark is actually built for

Per Google’s demos and the agent harness Antigravity is tuned for:

1. Inbox triage and reply drafting (the headline demo)

Spark reads incoming Gmail, classifies, drafts, leaves in drafts folder. Small-business positioning: “monitoring inboxes for customer inquiries.” If you spend >30 min/day on inbox, this is the breakeven case.

2. Cross-document synthesis

Spark can pull from emails + Docs + Sheets + Slides in a single task — the canonical demo was “draft a board update by gathering the relevant data from these 7 sources.” This is the kind of work where Gemini 3.5’s 1M context window pays for itself.

3. Long-horizon research

It “runs autonomously for multiple hours” between human checkpoints (TechCrunch quoting Google’s CTO Koray Kavukcuoglu). Practical use: “monitor these 5 competitors and Slack me when one ships.”

4. Workspace busywork (formatting, summaries, slide updates)

Things where the human value-add is taste, not labor. Spark does the labor; you do the taste pass.

5. Future-state: bring-your-own-tools via MCP

The big upgrade is “in the coming weeks, third-party tools through MCP” — so you’ll be able to give Spark your CRM, your project tracker, your custom data sources, the same way you’d connect tools to Claude. The day this lands is the day Spark becomes more interesting than the Google-only demo.

Two underrated features you’ll only spot in the live demo

The marketing pages skip these; the keynote demo made them obvious.

Custom Skills — teach Spark your voice and workflow

Spark supports user-defined Skills: small, persistent capability modules that teach Spark how you specifically want a class of task done. On stage, Google’s Josh Woodward demoed a “Ghostwriter” Skill — Spark drafted an email pulling from Gmail / Docs / chat history, then ran it through the Ghostwriter Skill to match his personal writing tone.

Think of Skills as the difference between “Gemini knows what a follow-up email looks like” and “Gemini knows what your follow-up email looks like.” Reasonable bet: this is what makes Spark stickier than a generic agent — your Skills travel with you, custom-tuned models for your workflow.

There’s no public Skills marketplace yet, but expect one. Until then, you build your own.

Voice → multi-task decomposition

In the demo, Woodward held up a phone and said in one continuous voice prompt:

“Find all meetings with Sundar and tag them hot pink. Write an invitation to our new neighbor John for the block party. Create a doc listing everything to do for the kids before end of school year, sorted by deadline.”

Spark didn’t treat that as one task. It split the speech into three independent task threads and dispatched them in parallel. The user gets a single voice prompt, the agent gets three concurrent work streams.

This is the closest thing yet to “talk to your assistant like a human.” Earlier agent products would have either taken one task and ignored the other two, or pretended to do all three in one tangled run. Spark planned, decomposed, and parallelized.

Gemini Spark vs Manus / Operator / Cursor / Claude

There are now four real “general personal agent” products in market. They’re not the same thing.

Gemini SparkManusOpenAI OperatorClaude (Computer Use / Projects)
Runs whereGoogle Cloud VM (24/7, off-device)Cloud (off-device)Cloud + your browserYour computer (Computer Use)
Underlying modelGemini 3.5 + AntigravityMulti-model under the hoodGPT-5 familyClaude Sonnet 4.6
Workspace integrationNative: Gmail, Docs, Sheets, SlidesPlug-in styleWeb browser basedComputer Use = controls your screen
Third-party toolsMCP, “coming weeks”YesLimitedMCP, mature
Free tierNoLimited freeNo (paid only)Yes (Claude.ai free)
Where to trygemini.google.com (Ultra US, beta)manus.imchatgpt.com (Pro)claude.ai
Best forGmail/Workspace-heavy usersLong research tasksIn-browser web tasksAnything requiring computer control

The reality: if you live in Gmail / Docs all day, Spark probably wins by 2026 Q3 once MCP integrations are in. If you don’t, Claude or Operator is the more flexible pick today.

Pricing (30 seconds)

Google restructured its AI plans on I/O day. Read carefully:

TierPrice (US, /month)What Spark-relevant thing you get
Google AI Pro$20No Spark access
Google AI Ultra (new entry tier)$100Spark Beta + 5× the Gemini app usage limit of AI Pro + 20TB storage + YouTube Premium
Google AI Ultra (top tier)$200Same Spark access + same features as the prior $250 plan (which is gone)
Free tier$0No Spark

The big change: the prior $249.99/$250 Ultra is gone, replaced by $100 entry + $200 top. Both Ultra tiers get Spark Beta — the $100 tier is the cheapest entry point. (Engadget coverage · Android Central)

For context: Anthropic’s Claude Pro is $20/mo, OpenAI’s ChatGPT Pro is $200/mo. Google’s $100 Ultra is now sitting squarely between them.

If you only want to run agentic tasks and don’t care about the Spark wrapper, the Gemini 3.5 Flash API at $1.50/$9 per million tokens is dramatically cheaper. The $100 Ultra buys you the integration — Gmail/Docs auto-connect, 24/7 cloud VM, agent state persistence, custom Skills, voice multi-task — not raw model access. Whether that’s worth it depends on whether your workflow lives in Workspace.

Common errors + FAQ

Q: I’m not in the US, can I get on the waitlist? A: Yes — sign into https://gemini.google.com with your target Google account. Google hasn’t announced international Ultra rollout for Spark, but the queue exists.

Q: Will Spark work with my non-Google email? A: Through MCP integrations, eventually. At launch, Gmail is the only native email connection.

Q: Can it send messages on my behalf without confirmation? A: Default is “draft and ask.” You can grant permissions per-task, but it’s not a recommended default — start with --draft only.

Q: How is Spark different from “Agent Mode” in the regular Gemini app? A: Agent Mode runs while you’re in the app, on a single task you start. Spark runs continuously, on its own cloud VM, and persists state across tasks. Think Agent Mode = “do this now”; Spark = “be my agent.”

Q: What’s Google Antigravity? A: The developer-facing agentic platform Google released at the same I/O. Spark is one app built on Antigravity. If you’re a developer, you can build similar agents yourself using Antigravity directly — that’s a separate tutorial coming this week.

Q: When does the Chrome-native version land? A: Google said “later this summer” (so July-September 2026). At that point Spark runs inside Chrome itself, not just in the Gemini app — significantly more powerful.

Q: Wait — I read somewhere that Google AI Ultra is $250? A: That was the price before I/O 2026. Google restructured plans on May 19, 2026: new $100 entry-tier Ultra and a $200 top-tier Ultra; the old $250 plan was retired. Both Ultra tiers include Spark Beta. See Pricing section above.

Q: Can I make my own Skills? A: Yes. Skills are user-defined modules that teach Spark how to do a recurring task in your specific way (tone, formatting, workflow). The Ghostwriter demo is the canonical example — it taught Spark to write email in Josh Woodward’s personal voice. There’s no Skills marketplace yet; expect one.

What to use it with

  • For the model: Gemini 3.5 Flash — same brain, accessible via API today
  • For developer agent building: Google Antigravity (separate tutorial coming)
  • For non-Google email: wait for MCP rollout in “coming weeks”
  • Already live: 7 Minutes to Master Gemini 3.5 Flash — the model powering Spark
  • Coming this week: Google Antigravity: Build Your Own Spark in 1 Hour
  • Coming this week: Spark vs Operator: Head-to-Head Email Triage Test
  • Coming this week: 50 Spark Task Templates for Workspace Power Users

Sources