← Tap · Blog

40 Commits to Ship a Chrome MV3 Extension: A Git Log Retrospective

April 11, 2026 · Leon Ting · 10 min read · Five categories of MV3 pain, with commit links

Tap is a browser automation tool that turns interface operations into deterministic programs. The Chrome extension is one of its three runtimes — alongside Playwright and macOS — and it's the one most people actually use, because it runs in the user's real logged-in Chrome instead of a sandboxed headless browser.

When I started building the extension, I assumed Manifest V3 would be a minor adjustment from V2. Drop background.html for background.js, swap chrome.extension for chrome.runtime, done. I had shipped MV2 extensions before. How different could it be?

Three months and roughly forty commits later, the answer is: fundamentally different. This is a category-by-category tour of what actually broke, drawn from git log on Tap's extension repo, so that the next person building an MV3 extension doesn't have to rediscover each of these the hard way.

Category 1: the service worker dies every 30 seconds

The single biggest architectural difference between MV2 and MV3 is that MV3 service workers are not long-lived background pages. They're event handlers. Chrome terminates them after roughly thirty seconds of idle, and any in-flight computation, module-level state, timer, or open connection dies with them. This one fact drove at least a dozen of my commits.

setInterval no longer works

The first version of Tap's extension used setInterval to poll the local daemon for commands. It worked on my machine for about five seconds at a time, then silently stopped. The reason was obvious once I thought about it: when the service worker dies, the timer dies with it, and nothing wakes the SW back up on a timer alone.

commit 061dce6
perf: switch from setInterval polling to long-poll loop

Removes adaptive setInterval (50ms/500ms) in favor of a single
persistent long-poll connection to the daemon. Browser makes 0
requests at idle instead of 2/sec, and commands dispatch with
near-zero latency.

The replacement was a long-poll: the extension opens one HTTP request to the daemon, the daemon holds it open until a command arrives, then responds. The loop re-issues immediately. At idle, browser traffic is zero. When a command arrives, dispatch latency is sub-millisecond. And crucially, an outstanding fetch does keep the service worker alive — while it's awaiting a response, Chrome doesn't kill it.

chrome.alarms has a silent 1-minute floor

Long-polling alone wasn't enough. What if the fetch disconnects (daemon restart, network flake, timeout), the SW goes idle waiting to reconnect, and Chrome kills it? The standard MV3 keepalive pattern is chrome.alarms: schedule an alarm to fire every N minutes, and Chrome wakes the SW to handle it. I wrote:

chrome.alarms.create('keepalive', { periodInMinutes: 25 / 60 })  // fire every 25 seconds

This does not work. It compiles. It runs. It never fires. I wasted an hour before reading the docs more carefully:

"The minimum period is 1 minute. If you specify a value less than 1 minute, Chrome will silently ignore it and use 1 minute instead."

commit 528e894
fix: pollLoop never exits — retry on all errors, prevent silent death

- Fix chrome.alarms minimum: 1 min (Chrome silently ignores <1)
- Poll fetch failure: badge !, 3s backoff, continue loop (never exit)
- Result post failure: silently ignored (daemon may restart)

If you want a sub-minute keepalive in MV3, you need to get creative — an outstanding fetch that returns a dummy packet every 20 seconds, for example. But for Tap's purposes, 1 minute was fine. The real fix was in the same commit: make the poll loop unkillable. Any fetch error (daemon down, 502, abort, timeout) triggers a 3-second backoff and continues. The loop exits only when the service worker itself is terminated — and then chrome.alarms wakes it back up one minute later.

Every wait must be capped below the 30-second kill threshold

The next subtle bug: Tap's CDP Runtime.evaluate had a 30-second default timeout. A user ran a script that hung waiting for a selector. The timeout fired at exactly 30 seconds — which is exactly when Chrome also killed the service worker for being idle-plus-a-bit. The fetch never resolved, the poll loop died, and the extension looked frozen.

commit 8bb6d7d
fix: MV3 service worker safety — session ID + 25s caps on all waits

- Poll sends sessionId; daemon evicts ghost waiters from old sessions
- CDP eval timeout: 30s → 25s (below Chrome SW kill threshold)
- wait/waitFor/waitForNetwork: capped at 25s max
- poll fetch: explicit 25s AbortSignal timeout

Every internal deadline got shaved to 25 seconds — a safety margin of five seconds below the SW kill threshold. The commit also introduced session IDs: each time the extension wakes up, it generates a new session ID and sends it with every poll. The daemon uses this to evict any ghost waiters still referencing the old session, so a reanimated extension doesn't receive replies meant for a dead one.

In-memory state does not persist across SW restarts

This one I only hit last week, three months into the project. I had a Map at module scope that tracked all Chrome tabs Tap had opened. Every thirty seconds the service worker died and the Map re-initialized empty. The Chrome tabs, however, lived in the browser process — they persisted. So over a single afternoon a user accumulated nine orphan tabs that Tap had forgotten about and could no longer clean up.

commit 7092f8e
fix: persist session map to storage.session, stop nav leaking chrome://newtab/

The fix, which I wrote about in detail in a previous post, is to persist the map to chrome.storage.session — Chrome's built-in per-browser-session in-memory KV store that is specifically designed to survive service worker restarts. Every mutation to the map writes through to storage. Every command handler that reads from the map waits for a rehydration promise before its first lookup. It's fifty lines of boilerplate that I should have written on day one.

Category 2: WebSockets and persistent connections die too

My first draft of the daemon-to-extension bridge was a WebSocket. Obvious choice — bidirectional, low-latency, idiomatic. It worked beautifully until the service worker went idle, at which point the WebSocket closed and the reconnect logic on the SW side couldn't execute because the SW was dead.

commit 24d68eb
feat: HTTP polling replaces WebSocket connection to daemon

HTTP long-polling doesn't care about persistent sockets. Each request is a discrete event that wakes the SW if it's asleep. Worse latency on the first request after idle, but it's measured in milliseconds and it works.

The rule in MV3 is: if your architecture needs a connection that outlives a single event handler, redesign your architecture. Chrome is not going to make exceptions for you.

Category 3: tab stealing across concurrent sessions

Tap supports multiple concurrent automation sessions. Each session owns one or more Chrome tabs. In the early extension, a subtle bug would occasionally cause session A's tap.click to execute in session B's tab — wrong tab, wrong page, data corruption.

The root cause was a helper called resolveTab(tabId) that, if given a falsy tabId, would fall back to Chrome's currently active tab — whatever the user happened to be looking at. Under concurrency, this meant commands issued just after a context switch would land in whichever tab was foregrounded at that millisecond, not the tab their session owned.

commit f2e7ba1
fix: prevent tab stealing — explicit tabId never falls back to active tab

commit 1455f06
fix: nav uses local tabId, not global activeTabId for daemon commands

commit fc414d4
fix: daemon session isolation — daemon commands never use activeTabId

The fix is a design rule: every command must carry an explicit tabId, and any fallback to "the active tab" is a bug. The SessionManager refactor (commit 8c32d78) formalized this by making each session's tabId a mandatory parameter on every internal call, so a missing value is a type error rather than a runtime fallback.

Creating tabs without stealing focus

A related issue: when Tap opens a new tab for a session, Chrome by default foregrounds it, which is annoying if the user is typing in a different window.

commit 88b6c30
fix: tabs created by tap never steal focus — active: false everywhere

One line: chrome.tabs.create({ url, active: false }). But this created its own subtle issue — several Tap APIs need the tab to be the "active" tab to work (chrome.scripting.executeScript works on any tab, but some older code paths didn't). Those all had to be updated to address tabs by explicit ID, never by "whatever is active."

Category 4: the CDP debugger-attached tax

Tap uses the Chrome DevTools Protocol for operations that require native input events — mouse clicks with real hardware cursor, keyboard events that fire exactly the DOM listeners a human would trigger. CDP is attached via chrome.debugger.attach. The tax you pay for this: Chrome shows a yellow bar at the top of every debugged tab that says "WebTap started debugging this browser." Users hate it. They file bugs about it. Some users even think the extension is malicious.

commit 20597c3
feat: click JS-first + dialog JS override — minimize CDP usage

The mitigation is to only attach CDP when strictly necessary. For tap.click, the first attempt is a native JavaScript element.click() synthesized from a real MouseEvent. If the target handler requires a genuine user gesture (OAuth popup, file input, etc.), that fails silently and Tap falls back to CDP. For most clicks on most pages, CDP is never touched, and no debugger bar appears.

commit aa6269c
fix: remove chrome.debugger.sendCommand from protocol.js — use injected cdp/cdpNav

The follow-up tightened the abstraction so that the built-in operation layer (tap.click, tap.type, tap.fetch) never touches chrome.debugger directly. Only the lowest layer of core operations can attach CDP, and it only does so when the JS path has been exhausted. This means roughly 80% of Tap operations never trigger the debugger bar.

Category 5: the unsafe-eval ban

MV2 let you opt into 'unsafe-eval' in the extension CSP. MV3 rejects the manifest at load time if you try. This sounds like a small thing, but it breaks any extension that dynamically loads code — in particular, browser automation tools that need to run user-provided scripts inside the page.

commit c29131c
fix: eval CSP fallback — use CDP Runtime.evaluate when eval blocked

commit a8b9417
fix: CDP eval returns correct values via indirect eval in async IIFE

The workaround is to route all dynamic code execution through chrome.debugger's Runtime.evaluate command. CDP is not subject to extension CSP, because it's running at the browser process level, not inside the extension page context. The downside is the tax from Category 4 — every dynamic evaluation pops the debugger bar. Combined with the JS-first mitigation above, this is fine for occasional evals, but it means extensions built around heavy dynamic code execution pay a UX cost on every call.

The meta-lesson: MV3 is hostile to code, not to extensions

After three months and forty-ish commits of fighting this, the pattern finally clicked. Every single category above has the same root cause: MV3 is aggressively hostile to arbitrary code execution, and aggressively opinionated about how state should be persisted. The rules Chrome is enforcing, collectively, are:

  1. Your extension must ship its code as static files in the package. You cannot download, generate, or evaluate code at runtime. (unsafe-eval ban.)
  2. Your extension code executes only in response to events. There is no long-running process. (Service worker termination.)
  3. Your state must be persisted to chrome.storage, not held in JS variables. (storage.session for ephemeral, storage.local for durable.)
  4. Your extension must not take actions that a reasonable user would not expect. (CDP debugger bar as transparency mechanism.)
  5. Your extension must be isolated from other code running in the same browser. (No global state, no tab stealing, explicit references everywhere.)

Tap's original design broke all five of these rules in various ways. Every commit in the forty-something I cited above is a concession to one of them — usually discovered the hard way, in production, after a user complained.

If you're about to start building an MV3 extension, read those five rules first. Design your architecture around them, not in spite of them. Your future self will thank you. Mine did not get the warning.


Taprun: your agent runs the browser task — you keep the audit trail

Tell your agent a browser task on any site that needs your login — it runs in your real, already-logged-in Chrome and compiles it once into a deterministic, auditable .plan.json program: a versioned, reviewable record of exactly what it did. Every replay after is local, zero tokens, same result every time. Cookies and sessions never leave your machine — by architecture, not policy. Cloud browser SDKs can't match this; they need your session in their database to function. tap verify catches substrate drift before your data goes stale. Works with Claude Code, Cursor, Cline, Windsurf, and any MCP host. 70+ community taps.

curl -fsSL https://taprun.dev/install.sh | sh

taprun.dev · GitHub · More posts

Follow new engineering notes: RSS · Watch on GitHub