docs: comprehensive LS binary reverse engineering with model enum mapping

2026-02-14 04:19:48 -06:00
parent edad784bcd
commit 932214fd95
3 changed files with 880 additions and 0 deletions
--- a/docs/mitm-interception-status.md
+++ b/docs/mitm-interception-status.md
@@ -240,3 +240,36 @@ sudo update-ca-certificates
 ## Next Steps

 → See `docs/standalone-ls-todo.md` for standalone LS isolation work
+→ See `docs/ls-binary-analysis.md` for comprehensive binary reverse engineering
+
+## New Findings (from binary analysis)
+
+### Alternative to Polling: `StreamCascadeReactiveUpdates`
+
+The LS has a streaming gRPC method `StreamCascadeReactiveUpdates` that pushes
+cascade state changes in real-time via server-sent streaming. The extension uses
+this instead of polling `GetCascadeTrajectorySteps`.
+
+**Potential improvement:** If we switch from polling to this streaming RPC, we'd
+get lower latency and less backend traffic. However, our current polling approach
+works reliably and doesn't require maintaining a long-lived gRPC stream.
+
+### Quota Endpoint: `retrieveUserQuota`
+
+The `PredictionService/RetrieveUserQuota` gRPC method and
+`v1internal:retrieveUserQuota` REST endpoint provide quota/credit information.
+This could be used to implement a proper `/v1/quota` endpoint instead of
+scraping the LS's own quota tracking.
+
+### `internalAtomicAgenticChat`
+
+A REST endpoint that appears to handle the entire agentic chat loop atomically
+(tool calls + responses in one request?). Investigation needed to understand
+the request/response format.
+
+### Credits System
+
+The `google/internal/cloud/code/v1internal/credits` proto package exists with
+`Credits_CreditType` enum. The `CASCADE_ENFORCE_QUOTA` config key controls
+whether quotas are enforced. Related methods: `AddExtraFlexCreditsInternal`,
+`GetTeamCreditEntries`, `GetPlanStatus`.