Go to file

Nikketryhard 3fdd0368a0 fix: block ALL LS follow-up requests across connections

Move the in-flight blocking check to the top of the LLM request flow,
BEFORE request modification. This catches follow-ups on ALL connections
(the LS opens multiple parallel TLS connections). Only the very first
modified request reaches Google — all others get fake STOP responses.

Previously, each new connection independently allowed one request
through before blocking, letting 4-5 requests leak per turn.

2026-02-16 00:57:33 -06:00

.cargo

feat: MITM request modification — strip bloat from LLM API requests

2026-02-14 18:35:07 -06:00

.config

feat: MITM request modification — strip bloat from LLM API requests

2026-02-14 18:35:07 -06:00

.gemini/plans

feat: sync all endpoints with MITM LS bypass + real-time thinking streaming

2026-02-15 01:03:39 -06:00

docs

feat: implement headless LS authentication via state sync

2026-02-15 21:40:35 -06:00

scripts

fix: resolve symlink in proxyctl before deriving PROJECT_DIR

2026-02-15 17:36:45 -06:00

src

fix: block ALL LS follow-up requests across connections

2026-02-16 00:57:33 -06:00

.gitignore

feat: MITM request modification — strip bloat from LLM API requests

2026-02-14 18:35:07 -06:00

Cargo.lock

feat: initial commit — antigravity proxy with MITM, standalone LS, and snapshot tooling

2026-02-14 02:24:35 -06:00

Cargo.toml

feat: initial commit — antigravity proxy with MITM, standalone LS, and snapshot tooling

2026-02-14 02:24:35 -06:00

GEMINI.md

feat: implement headless LS authentication via state sync

2026-02-15 21:40:35 -06:00

KNOWN_ISSUES.md

feat: forward generation params via MITM + add usageMetadata to Gemini

2026-02-15 14:23:05 -06:00

README.md

chore: code cleanup and documentation overhaul

2026-02-15 18:27:53 -06:00

README.md

Antigravity Proxy

OpenAI-compatible proxy that intercepts and relays requests to Google's Antigravity language server, impersonating the real Electron webview. Supports the Responses API, Chat Completions API, and a native Gemini endpoint with full streaming, multi-turn conversations, tool calling, image uploads, web search grounding, and real token usage capture via MITM interception.

Architecture

%%{init: {'theme': 'dark', 'themeVariables': {'primaryColor': '#1a1a2e', 'primaryTextColor': '#e0e0e0', 'primaryBorderColor': '#7c3aed', 'lineColor': '#7c3aed', 'secondaryColor': '#16213e', 'tertiaryColor': '#0f3460', 'edgeLabelBackground': '#1a1a2e', 'nodeTextColor': '#e0e0e0'}}}%%
graph TB
    subgraph client["Client Layer"]
        style client fill:#1a1a2e,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        APP["OpenAI SDK / curl / Any HTTP Client"]
    end

    subgraph proxy["Proxy Layer :8741"]
        style proxy fill:#16213e,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        API["API Router<br/>responses | completions | gemini | search"]
        STORE["MitmStore<br/>tools | images | errors | usage"]
        PROTO["Protobuf Encoder<br/>byte-exact webview match"]
    end

    subgraph ls["Language Server"]
        style ls fill:#0f3460,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        STANDALONE["Standalone LS<br/>isolated process, UID: antigravity-ls"]
    end

    subgraph mitm["MITM Layer :8742"]
        style mitm fill:#1a1a2e,stroke:#e94560,stroke-width:2px,color:#e0e0e0
        INTERCEPT["TLS Intercept<br/>decrypt + modify + re-encrypt"]
        MODIFY["Request Modifier<br/>inject tools, images, params"]
        PARSE["Response Parser<br/>usage, errors, function calls"]
    end

    subgraph google["Google API"]
        style google fill:#0f3460,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        GAPI["daily-cloudcode-pa.googleapis.com<br/>v1internal:streamGenerateContent"]
    end

    APP -->|"HTTP POST"| API
    API --> STORE
    API --> PROTO
    PROTO -->|"gRPC"| STANDALONE
    STANDALONE -->|"HTTPS :443"| INTERCEPT
    INTERCEPT --> MODIFY
    MODIFY -->|"inject tools, images,<br/>generation params"| GAPI
    GAPI -->|"SSE response"| PARSE
    PARSE -->|"usage, errors,<br/>function calls"| STORE
    INTERCEPT -.->|"iptables REDIRECT<br/>UID-scoped"| STANDALONE

    classDef highlight fill:#7c3aed,stroke:#e94560,stroke-width:2px,color:#fff

Request Flow

Client sends an OpenAI-compatible request to the proxy
Proxy encodes the message as a protobuf matching the real webview format
Proxy sends it to the standalone Language Server via gRPC
LS makes an HTTPS request to Google's API
iptables redirects the LS's traffic (UID-scoped) to the MITM proxy
MITM decrypts TLS, modifies the request (injects tools, images, params), re-encrypts and forwards to Google
Google's SSE response flows back through MITM, which captures usage, errors, and function calls
Proxy polls the LS for cascade state, supplementing with MITM-captured data
Client receives the response in OpenAI-compatible format

Quick Start

# First-time setup (creates user + iptables for MITM)
sudo ./scripts/mitm-redirect.sh install

# Start as daemon (builds if needed)
proxyctl start

# Or run directly
RUST_LOG=info ./target/release/antigravity-proxy

Default port: 8741

Endpoints

Method	Path	Description
`POST`	`/v1/responses`	Responses API (primary) -- supports `stream: true/false`
`POST`	`/v1/chat/completions`	Chat Completions API (OpenAI compat)
`POST`	`/v1/gemini`	Native Gemini API
`GET/POST`	`/v1/search`	Web Search via Google Search grounding
`GET`	`/v1/models`	List available models
`GET`	`/v1/sessions`	List active sessions
`DELETE`	`/v1/sessions/:id`	Delete a session
`POST`	`/v1/token`	Set OAuth token at runtime
`GET`	`/v1/usage`	MITM-intercepted token usage stats
`GET`	`/v1/quota`	LS quota -- credits, per-model rate limits, reset timers
`GET`	`/health`	Health check

Available Models

Name	Label
`opus-4.6`	Claude Opus 4.6 (Thinking) -- default
`opus-4.5`	Claude Opus 4.5 (Thinking)
`gemini-3-pro-high`	Gemini 3 Pro (High)
`gemini-3-pro`	Gemini 3 Pro (Low)
`gemini-3-flash`	Gemini 3 Flash

Features

Core

Sync and streaming on all endpoints
Multi-turn conversations via conversation session ID (cascade reuse)
Full message history forwarded for Chat Completions
Thinking/reasoning exposed in both sync and streaming modes
Thinking signatures preserved for multi-turn thinking model chains

Tool Calling

OpenAI-format tools auto-converted to Gemini format via MITM injection
tool_choice support (auto, none, required, named function)
max_tool_calls limit on tool calls per response
Function call results (function_call_output) routed back correctly
Native Gemini tools passed through on the /v1/gemini endpoint

Image Uploads

Images are injected directly into Google's API request via MITM (the LS does not forward images natively).

Supported input formats:

Responses API: {type: "input_image", image_url: "data:image/png;base64,..."}
Chat Completions: {type: "image_url", image_url: {url: "data:image/png;base64,..."}}
Gemini API: {type: "input_image", image_url: "data:image/png;base64,..."}

Web Search

Google Search grounding can be enabled on any endpoint:

Completions: "web_search": true
Responses: "tools": [{"type": "web_search_preview"}]
Gemini: "google_search": true
Dedicated: GET/POST /v1/search returns structured results with citations

Generation Parameters

All parameters are forwarded to Google via MITM injection:

Parameter	Endpoints
`temperature`	All
`top_p` / `topP`	All
`top_k` / `topK`	Gemini
`max_output_tokens`	All
`stop` / `stopSequences`	All
`frequency_penalty`	Completions
`presence_penalty`	Completions
`reasoning_effort`	All (mapped to `thinkingLevel`)
`response_format`	Completions, Responses (`json_object`, `json_schema`)

Error Propagation

When Google's API returns an error (400, 429, 500, etc.), the MITM proxy captures it and the API handler returns it immediately to the client instead of hanging until timeout.

Error status mapping:

Google Status	HTTP Code	OpenAI Error Type
`INVALID_ARGUMENT`	400	`invalid_request_error`
`RESOURCE_EXHAUSTED`	429	`rate_limit_error`
`PERMISSION_DENIED`	403	`authentication_error`
`INTERNAL`	500	`server_error`
`UNAVAILABLE`	503	`server_error`

Usage Examples

Responses API (sync)

curl -s http://localhost:8741/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "input": "Say hello in exactly 3 words",
    "stream": false,
    "timeout": 60
  }' | jq .

Responses API (streaming)

curl -N http://localhost:8741/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "input": "Say hello in exactly 3 words",
    "stream": true,
    "timeout": 60
  }'

Multi-turn Conversation

curl -s http://localhost:8741/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "input": "What is 2+2?",
    "conversation": "my-session-1",
    "stream": false
  }' | jq .

# Follow-up in same cascade
curl -s http://localhost:8741/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "input": "Now multiply that by 10",
    "conversation": "my-session-1",
    "stream": false
  }' | jq .

Image Upload

curl -s http://localhost:8741/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "input": [
      {"type": "input_image", "image_url": "data:image/png;base64,iVBORw0KGgo..."},
      {"type": "input_text", "text": "What is in this image?"}
    ],
    "stream": false
  }' | jq .

Web Search

# Dedicated search endpoint
curl -s 'http://localhost:8741/v1/search?q=latest+rust+news' | jq .

# Inline grounding on any endpoint
curl -s http://localhost:8741/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "messages": [{"role": "user", "content": "What happened in tech today?"}],
    "web_search": true
  }' | jq .

Tool Calling

curl -s http://localhost:8741/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3-flash",
    "input": "What is the weather in Tokyo?",
    "tools": [{
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Get weather for a location",
        "parameters": {
          "type": "object",
          "properties": {"location": {"type": "string"}},
          "required": ["location"]
        }
      }
    }],
    "stream": false
  }' | jq .

Authentication

The proxy needs an OAuth token. Three ways to provide it:

Environment variable: export ANTIGRAVITY_OAUTH_TOKEN=ya29.xxx
Token file: echo 'ya29.xxx' > ~/.config/antigravity-proxy-token
Runtime API: curl -X POST http://localhost:8741/v1/token -d '{"token":"ya29.xxx"}'

Stealth Features

TLS fingerprint -- BoringSSL with Chrome JA3/JA4 + H2 fingerprint via wreq (version auto-detected)
Protobuf -- Hand-rolled encoder producing byte-exact match to real webview traffic
Warmup -- Mimics real webview startup RPC calls
Heartbeat -- Periodic keep-alive matching real webview lifecycle
Reactive streaming -- StreamCascadeReactiveUpdates for real-time state diffs (polling fallback)
Jitter -- Randomized intervals to avoid automation fingerprint
Session reuse -- Cascades reused for multi-turn, matching real webview behavior
Version detection -- Auto-detects Antigravity/Chrome/Electron versions from installed app

CLI Reference

`proxyctl` -- Daemon Manager

Symlinked to ~/.local/bin/proxyctl for global access.

Command	Description
`proxyctl start`	Start the proxy daemon
`proxyctl stop`	Stop the proxy daemon
`proxyctl restart`	Rebuild + restart
`proxyctl rebuild`	Build release binary only
`proxyctl status`	Service status + quota + usage
`proxyctl logs [N]`	Tail last N lines (default 30) + follow
`proxyctl logs-all`	Full log dump (no follow)
`proxyctl test [msg]`	Quick test request (gemini-3-flash)
`proxyctl health`	Health check

`mitm-redirect.sh` -- MITM Setup

One-time setup script for UID-scoped iptables traffic redirection.

sudo ./scripts/mitm-redirect.sh install    # create user + iptables rule
sudo ./scripts/mitm-redirect.sh uninstall  # remove user + iptables rule
sudo ./scripts/mitm-redirect.sh status     # check current state

Proxy Binary

antigravity-proxy [OPTIONS]

Options:
  --port <PORT>          API server port (default: 8741)
  --no-standalone        Attach to existing LS instead of spawning standalone
  --no-mitm              Disable MITM proxy entirely
  --mitm-port <PORT>     Override MITM proxy port (default: auto-assign)

MITM Proxy

How It Works

%%{init: {'theme': 'dark', 'themeVariables': {'primaryColor': '#1a1a2e', 'primaryTextColor': '#e0e0e0', 'primaryBorderColor': '#e94560', 'lineColor': '#e94560', 'secondaryColor': '#16213e', 'tertiaryColor': '#0f3460'}}}%%
graph LR
    subgraph proxy_layer["Proxy :8741"]
        style proxy_layer fill:#16213e,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        P["API Handler"]
        S["MitmStore"]
    end

    subgraph ls_layer["Standalone LS"]
        style ls_layer fill:#0f3460,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        LS["language_server<br/>UID: antigravity-ls"]
    end

    subgraph mitm_layer["MITM :8742"]
        style mitm_layer fill:#1a1a2e,stroke:#e94560,stroke-width:2px,color:#e0e0e0
        M["TLS Decrypt"]
        MOD["Modify Request<br/>tools | images | params"]
        CAP["Capture Response<br/>usage | errors | calls"]
    end

    subgraph google_layer["Google API"]
        style google_layer fill:#0f3460,stroke:#7c3aed,stroke-width:2px,color:#e0e0e0
        G["streamGenerateContent"]
    end

    P -->|"image, tools,<br/>params"| S
    P -->|"protobuf"| LS
    LS -->|":443 traffic"| M
    M --> MOD
    MOD -->|"modified request"| G
    G -->|"SSE response"| CAP
    CAP -->|"usage, errors"| S
    S -->|"error or result"| P

    linkStyle 2 stroke:#e94560,stroke-width:2px

UID-scoped iptables -- only the standalone LS's traffic is intercepted (zero side effects)
Combined CA bundle -- system CAs + MITM CA written to /tmp/antigravity-mitm-combined-ca.pem
Google SSE parsing -- extracts promptTokenCount, candidatesTokenCount, thoughtsTokenCount
Request modification -- strips LS bloat, injects client tools/images/params (97%+ size reduction typical)
Error capture -- upstream errors stored in MitmStore for instant client forwarding
Init metadata -- protobuf field 34 detect_and_use_proxy set to ENABLED (1)

Development

Dev/testing model: gemini-3-flash -- use for all development and iterative testing
Production model: opus-4.6 -- use sparingly (quota limited)
See docs/ls-binary-analysis.md for reverse-engineered model catalog and proto enum mappings
See docs/endpoint-gap-analysis.md for full API coverage audit
See docs/mitm-interception-status.md for MITM technical details

License

Private. Do not distribute.

README.md

Antigravity Proxy

Architecture

Request Flow

Quick Start

Endpoints

Available Models

Features

Core

Tool Calling

Image Uploads

Web Search

Generation Parameters

Error Propagation

Usage Examples

Responses API (sync)

Responses API (streaming)

Multi-turn Conversation

Image Upload

Web Search

Tool Calling

Authentication

Stealth Features

CLI Reference

proxyctl -- Daemon Manager

mitm-redirect.sh -- MITM Setup

Proxy Binary

MITM Proxy

How It Works

Development

License

`proxyctl` -- Daemon Manager

`mitm-redirect.sh` -- MITM Setup