feat: MITM request modification — strip bloat from LLM API requests
Intercepts streamGenerateContent requests and trims: - System instruction: strips web_application_development, knowledge_discovery, persistent_context, skills sections (~18KB saved) - Content messages: strips empty user_rules, workflows boilerplate, conversation summaries (~4.5KB saved) - Tools: keeps 12 essential coding tools, strips 8 non-essential (browser_subagent, generate_image, search_web, etc. ~6KB saved) Total: ~55% reduction in request size while keeping identity, user info, and all coding-relevant tools intact. Only modifies 'agent' type requests, checkpoint requests pass through unmodified. Also: - Standalone mode is now the default (use --no-standalone to attach to existing LS) - Enable request modification by default - Add mold linker, sccache, nextest config (8 thread cap) - Add .cargo/config.toml and .config/nextest.toml
This commit is contained in:
7
.config/nextest.toml
Normal file
7
.config/nextest.toml
Normal file
@@ -0,0 +1,7 @@
|
||||
[store]
|
||||
# Cap test threads to 8
|
||||
threads = 8
|
||||
|
||||
[profile.default]
|
||||
retries = 0
|
||||
slow-timeout = { period = "30s" }
|
||||
Reference in New Issue
Block a user