feat: per-deployment concurrency (30 req/account) instead of global pipeline depth

Apps Script enforces 30 concurrent executions per account. The old pipeline used a single global semaphore sized to the number of deployments, meaning 1 in-flight batch per deployment. Now each deployment ID gets its own semaphore with 30 permits — matching the actual per-account limit. With N deployments the system can sustain 30×N concurrent batch requests instead of N. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-05-18 06:44:35 +03:00 · 2026-04-24 20:48:49 +02:00
parent 81d1883f45
commit 2715991afa
3 changed files with 76 additions and 46 deletions
@@ -163,7 +163,7 @@ Firefox keeps its own cert store; the installer also drops the CA into Firefox's
 Open the UI and fill in the form:
- **Apps Script ID** — the Deployment ID from Step 1. Add multiple IDs (one per line in the UI, or a JSON array in `config.json`) for higher quota **and** lower latency. In `apps_script` mode, IDs are round-robined. In `full` mode, more IDs directly increase the pipeline depth (see [Full tunnel mode](#full-tunnel-mode) below).
+- **Apps Script ID** — the Deployment ID from Step 1. Add multiple IDs (one per line in the UI, or a JSON array in `config.json`) for higher quota **and** lower latency. In `apps_script` mode, IDs are round-robined. In `full` mode, each deployment gets its own pool of 30 concurrent requests (the Apps Script per-account limit), so more IDs = more total throughput (see [Full tunnel mode](#full-tunnel-mode) below).
 - **Auth key** — the same secret you set in `Code.gs`.
 - **Google IP** — `216.239.38.120` is a solid default. Use the **scan** button to probe for a faster one from your network.
 - **Front domain** — keep `www.google.com`.
@@ -271,21 +271,20 @@ Full tunnel mode (`"mode": "full"`) routes **all** traffic end-to-end through Ap
 ### How deployment IDs affect performance
-Each Apps Script batch request takes ~2 seconds round-trip. In full mode, `mhrv-rs` runs a **pipelined batch multiplexer** that fires multiple batch requests concurrently without waiting for the previous one to return. The number of in-flight batches (the *pipeline depth*) scales directly with the number of deployment IDs you configure:
+Each Apps Script batch request takes ~2 seconds round-trip. In full mode, `mhrv-rs` runs a **pipelined batch multiplexer** that fires multiple batch requests concurrently without waiting for the previous one to return. Each deployment ID (= one Google account) gets its own concurrency pool of **30 in-flight requests** — matching the Apps Script per-account execution limit.
 ```
-pipeline_depth = number_of_script_ids  (minimum 2)
+max_concurrent = 30 × number_of_deployment_ids
 ```
-| Deployments | Pipeline depth | Effective batch interval | Notes |
+| Deployments | Concurrent requests | Notes |
-|-------------|---------------|------------------------|-------|
+|-------------|-------------------|-------|
-| 1 | 2 | ~1.0s | Minimum — still pipelines 2 batches |
+| 1 | 30 | Single account — plenty for light browsing |
-| 3 | 3 | ~0.7s | Good for light browsing |
+| 3 | 90 | Good for daily use |
-| 6 | 6 | ~0.3s | Recommended for daily use |
+| 6 | 180 | Recommended for heavy use |
-| 12 | 12 | ~0.17s | Sweet spot for most users |
+| 12 | 360 | Multi-account power setup |
 | 20 | 20 | ~0.10s | Multi-account setups |
-More deployments = more concurrent batches = lower per-session latency. Each batch round-robins across your deployment IDs, so the load is spread evenly and you're less likely to hit a single deployment's quota ceiling.
+More deployments = more total concurrency = lower per-session latency. Each batch round-robins across your deployment IDs, so the load is spread evenly and you're less likely to hit a single deployment's quota ceiling.
 **Resource guards** keep things safe:
 - **50 ops max** per batch — if more sessions are active, the mux splits into multiple batches
@@ -294,10 +293,10 @@ More deployments = more concurrent batches = lower per-session latency. Each bat
 ### Quick start
-1. Deploy [`CodeFull.gs`](assets/apps_script/CodeFull.gs) as **3–12 Web App deployments** (same steps as `Code.gs`, but use the full-mode script that forwards to your tunnel-node). You can create multiple deployments on a single Google account — each "New deployment" produces its own ID. Going multi-account only matters for the daily quota (each Google account gets its own 20 000 `UrlFetchApp` calls/day on the free tier / 100 000 on Workspace); the pipeline depth itself scales fine on one account up to Apps Script's simultaneous-execution ceiling. Rule of thumb:
+1. Deploy [`CodeFull.gs`](assets/apps_script/CodeFull.gs) as a **Web App deployment** on each Google account (same steps as `Code.gs`, but use the full-mode script that forwards to your tunnel-node). Use **one deployment per Google account** — the 30-concurrent-request limit is per account, so multiple deployments on the same account share the same pool and don't add concurrency. To scale, add more accounts:
-   - **Solo use** → 3–6 deployments on one account is plenty
+   - **Solo use** → 1–2 accounts is plenty
-   - **Shared with ~3 people** → 6 deployments on one account, bump to multi-account only if you start hitting quota alerts
+   - **Shared with ~3 people** → 3 accounts
-   - **Shared with a group** → one account per heavy user (each with 1–2 deployments) is the clean scaling path
+   - **Shared with a group** → one account per heavy user
 2. Deploy the [tunnel-node](tunnel-node/) on a VPS
 3. Set `"mode": "full"` in your config with all deployment IDs:
@@ -630,21 +629,20 @@ Donations cover hosting, self-hosted CI runner costs, and continued maintenance.
 #### چرا تعداد `Deployment ID` مهم است؟
-هر درخواست دسته‌ای (`batch`) به `Apps Script` حدود ۲ ثانیه طول می‌کشد. در حالت `full`، برنامه یک **لولهٔ موازی** (`pipeline`) اجرا می‌کند که چند درخواست دسته‌ای را همزمان می‌فرستد بدون اینکه منتظر پاسخ قبلی بماند. تعداد درخواست‌های همزمان مستقیماً با تعداد `Deployment ID`ها رابطه دارد:
+هر درخواست دسته‌ای (`batch`) به `Apps Script` حدود ۲ ثانیه طول می‌کشد. در حالت `full`، برنامه یک **لولهٔ موازی** (`pipeline`) اجرا می‌کند که چند درخواست دسته‌ای را همزمان می‌فرستد بدون اینکه منتظر پاسخ قبلی بماند. هر `Deployment ID` (= یک حساب گوگل) حوضچهٔ همزمانی مخصوص خودش با **۳۰ درخواست همزمان** دارد — مطابق سقف اجرای همزمان `Apps Script` به ازای هر حساب.
 ```
-عمق لوله = تعداد Deployment IDها  (حداقل ۲)
+حداکثر همزمانی = ۳۰ × تعداد Deployment IDها
 ```
-| تعداد Deployment | عمق لوله | فاصلهٔ مؤثر بین دسته‌ها | |
+| تعداد Deployment | درخواست‌های همزمان | |
-|-----------------|----------|------------------------|---|
+|-----------------|-------------------|---|
-| ۱ | ۲ | ~۱ ثانیه | حداقل |
+| ۱ | ۳۰ | یک حساب — برای مرور سبک کافیست |
-| ۳ | ۳ | ~۰.۷ ثانیه | مناسب مرور سبک |
+| ۳ | ۹۰ | مناسب استفادهٔ روزانه |
-| ۶ | ۶ | ~۰.۳ ثانیه | توصیه‌شده برای استفادهٔ روزانه |
+| ۶ | ۱۸۰ | توصیه‌شده برای استفادهٔ سنگین |
-| ۱۲ | ۱۲ | ~۰.۱۷ ثانیه | نقطهٔ بهینه |
+| ۱۲ | ۳۶۰ | چند حساب — حداکثر توان |
 | ۲۰ | ۲۰ | ~۰.۱ ثانیه | چند حساب |
-بیشتر `Deployment` = بیشتر درخواست همزمان = تأخیر کمتر برای هر نشست. هر دسته بین `ID`ها چرخش می‌کند (`round-robin`)، پس بار به‌طور یکنواخت توزیع می‌شود.
+بیشتر `Deployment` = بیشتر همزمانی = تأخیر کمتر برای هر نشست. هر دسته بین `ID`ها چرخش می‌کند (`round-robin`)، پس بار به‌طور یکنواخت توزیع می‌شود.
 ### اجرا روی OpenWRT (روتر)
@@ -700,7 +698,7 @@ logread -e mhrv-rs -f
 - **لینوکس:** فایل `/usr/local/share/ca-certificates/mhrv-rs.crt` را حذف و `sudo update-ca-certificates` اجرا کنید
 **چند `Deployment ID` لازم دارم؟**
-یکی برای استفادهٔ عادی کافی است. سهمیهٔ روزانه `UrlFetchApp` برای حساب رایگان گوگل **۲۰٬۰۰۰ درخواست در روز** است (برای `Workspace` پولی ۱۰۰٬۰۰۰)، با محدودیت پاسخ ۵۰ مگابایت به ازای هر `fetch`. برای اکثر کاربران چند ساعت یوتیوب هم با یک `Deployment` کافی است. می‌توانید چند `Deployment` **در همان حساب** بسازید (هر بار `New deployment` یک `ID` جدید می‌دهد) — این روش در حالت `full` پهنای باند بهتری می‌دهد چون `pipeline depth` افزایش پیدا می‌کند و هر `Deployment` یک اجرای همزمان جدا در `Apps Script` می‌گیرد (تا سقف ۳۰ اجرای همزمان هر حساب). برای سهمیهٔ روزانهٔ بیشتر، در حساب‌های گوگل دیگر هم `Deployment` بسازید — هر حساب سهمیهٔ ۲۰ هزار درخواستی خودش را دارد. همهٔ `ID`ها را در فیلد `Apps Script ID(s)` وارد کنید — برنامه خودکار بینشان می‌چرخد. مرجع: <https://developers.google.com/apps-script/guides/services/quotas>
+یکی برای استفادهٔ عادی کافی است. سهمیهٔ روزانه `UrlFetchApp` برای حساب رایگان گوگل **۲۰٬۰۰۰ درخواست در روز** است (برای `Workspace` پولی ۱۰۰٬۰۰۰)، با محدودیت پاسخ ۵۰ مگابایت به ازای هر `fetch`. از هر حساب گوگل **فقط یک `Deployment`** بسازید — سقف ۳۰ درخواست همزمان به ازای هر حساب است، پس چند `Deployment` روی یک حساب همزمانی اضافه نمی‌کند. برای افزایش همزمانی یا سهمیهٔ روزانه، در حساب‌های گوگل دیگر `Deployment` بسازید — هر حساب سهمیهٔ ۲۰ هزار درخواستی و ۳۰ اجرای همزمان خودش را دارد. همهٔ `ID`ها را در فیلد `Apps Script ID(s)` وارد کنید — برنامه خودکار بینشان می‌چرخد. مرجع: <https://developers.google.com/apps-script/guides/services/quotas>
 **یوتوب کار می‌کند؟ ویدیو پخش می‌شود؟**
 صفحهٔ یوتوب سریع باز می‌شود (چون مستقیم از لبهٔ گوگل می‌آید). اما `chunk`های ویدیوی اصلی از `googlevideo.com` از طریق `Apps Script` می‌آیند و روزانه سهمیه دارند. برای تماشای گاه‌به‌گاه خوب است، برای ۱۰۸۰p پخش طولانی دردناک.
@@ -336,6 +336,10 @@ impl DomainFronter {
        self.script_ids.len()
    }
    pub fn script_id_list(&self) -> &[String] {
        &self.script_ids
    }
    pub fn cache(&self) -> &ResponseCache {
        &self.cache
    }
@@ -344,7 +348,7 @@ impl DomainFronter {
        self.coalesced.load(Ordering::Relaxed)
    }
-    fn next_script_id(&self) -> String {
+    pub fn next_script_id(&self) -> String {
        let n = self.script_ids.len();
        let mut bl = self.blacklist.lock().unwrap();
        let now = Instant::now();
@@ -1169,6 +1173,18 @@ impl DomainFronter {
    pub async fn tunnel_batch_request(
        &self,
        ops: &[BatchOp],
    ) -> Result<BatchTunnelResponse, FronterError> {
        let script_id = self.next_script_id();
        self.tunnel_batch_request_to(&script_id, ops).await
    }
    /// Like `tunnel_batch_request` but targets a specific deployment ID.
    /// Used by the pipeline mux to pin a batch to a deployment whose
    /// per-account concurrency slot has already been acquired.
    pub async fn tunnel_batch_request_to(
        &self,
        script_id: &str,
        ops: &[BatchOp],
    ) -> Result<BatchTunnelResponse, FronterError> {
        let mut map = serde_json::Map::new();
        map.insert("k".into(), Value::String(self.auth_key.clone()));
@@ -1176,7 +1192,6 @@ impl DomainFronter {
        map.insert("ops".into(), serde_json::to_value(ops)?);
        let payload = serde_json::to_vec(&Value::Object(map))?;
        let script_id = self.next_script_id();
        let path = format!("/macros/s/{}/exec", script_id);
        let mut entry = self.acquire().await?;
@@ -2,9 +2,10 @@
 //!
 //! A central multiplexer collects pending data from ALL active sessions
 //! and fires batch requests without waiting for the previous one to return.
-//! Pipeline depth equals the number of script deployments (minimum 2),
+//! Each Apps Script deployment (account) gets its own concurrency pool of
-//! so users with more deployments get lower latency automatically.
+//! 30 in-flight requests — matching the per-account Apps Script limit.
 use std::collections::HashMap;
 use std::sync::Arc;
 use std::time::Duration;
@@ -16,8 +17,8 @@ use tokio::sync::{mpsc, oneshot, Semaphore};
 use crate::domain_fronter::{BatchOp, DomainFronter, TunnelResponse};
-/// Minimum pipeline depth even with a single script.
+/// Apps Script allows 30 concurrent executions per account / deployment.
-const MIN_PIPELINE_DEPTH: usize = 2;
+const CONCURRENCY_PER_DEPLOYMENT: usize = 30;
 /// Maximum total base64-encoded payload bytes in a single batch request.
 /// Apps Script accepts up to 50 MB per fetch, but the tunnel-node must
@@ -66,14 +67,14 @@ pub struct TunnelMux {
 impl TunnelMux {
    pub fn start(fronter: Arc<DomainFronter>) -> Arc<Self> {
-        let pipeline_depth = fronter.num_scripts().max(MIN_PIPELINE_DEPTH);
+        let n = fronter.num_scripts();
        tracing::info!(
-            "tunnel mux: pipeline_depth={} (from {} script deployments)",
+            "tunnel mux: {} deployment(s), {} concurrent per deployment",
-            pipeline_depth,
+            n,
-            fronter.num_scripts()
+            CONCURRENCY_PER_DEPLOYMENT
        );
        let (tx, rx) = mpsc::channel(512);
-        tokio::spawn(mux_loop(rx, fronter, pipeline_depth));
+        tokio::spawn(mux_loop(rx, fronter));
        Arc::new(Self { tx })
    }
@@ -85,9 +86,15 @@ impl TunnelMux {
 async fn mux_loop(
    mut rx: mpsc::Receiver<MuxMsg>,
    fronter: Arc<DomainFronter>,
    pipeline_depth: usize,
 ) {
-    let sem = Arc::new(Semaphore::new(pipeline_depth));
+    // One semaphore per deployment ID, each allowing 30 concurrent requests.
    let sems: Arc<HashMap<String, Arc<Semaphore>>> = Arc::new(
        fronter
            .script_id_list()
            .iter()
            .map(|id| (id.clone(), Arc::new(Semaphore::new(CONCURRENCY_PER_DEPLOYMENT))))
            .collect(),
    );
    loop {
        let mut msgs = Vec::new();
@@ -136,7 +143,7 @@ async fn mux_loop(
                            || batch_payload_bytes + op_bytes > MAX_BATCH_PAYLOAD_BYTES)
                    {
                        fire_batch(
-                            &sem,
+                            &sems,
                            &fronter,
                            std::mem::take(&mut data_ops),
                            std::mem::take(&mut data_replies),
@@ -176,22 +183,28 @@ async fn mux_loop(
            continue;
        }
-        fire_batch(&sem, &fronter, data_ops, data_replies).await;
+        fire_batch(&sems, &fronter, data_ops, data_replies).await;
    }
 }
-/// Acquire a pipeline slot and spawn a batch request task.
+/// Pick a deployment, acquire its per-account concurrency slot, and spawn
 /// a batch request task.
 ///
 /// The batch HTTP round-trip is bounded by `BATCH_TIMEOUT` so a slow or
 /// dead tunnel-node target cannot hold a pipeline slot (and block waiting
 /// sessions) forever.
 async fn fire_batch(
-    sem: &Arc<Semaphore>,
+    sems: &Arc<HashMap<String, Arc<Semaphore>>>,
    fronter: &Arc<DomainFronter>,
    data_ops: Vec<BatchOp>,
    data_replies: Vec<(usize, oneshot::Sender<Result<TunnelResponse, String>>)>,
 ) {
-    let permit = sem.clone().acquire_owned().await.unwrap();
+    let script_id = fronter.next_script_id();
    let sem = sems
        .get(&script_id)
        .cloned()
        .unwrap_or_else(|| Arc::new(Semaphore::new(CONCURRENCY_PER_DEPLOYMENT)));
    let permit = sem.acquire_owned().await.unwrap();
    let f = fronter.clone();
    tokio::spawn(async move {
@@ -201,8 +214,12 @@ async fn fire_batch(
        // Bounded-wait: if the batch takes longer than BATCH_TIMEOUT,
        // all sessions in this batch get an error and can retry.
-        let result = tokio::time::timeout(BATCH_TIMEOUT, f.tunnel_batch_request(&data_ops)).await;
+        let result = tokio::time::timeout(
-        tracing::info!("batch: {} ops, rtt={:?}", n_ops, t0.elapsed());
+            BATCH_TIMEOUT,
            f.tunnel_batch_request_to(&script_id, &data_ops),
        )
        .await;
        tracing::info!("batch: {} ops → {}, rtt={:?}", n_ops, &script_id[..script_id.len().min(8)], t0.elapsed());
        match result {
            Ok(Ok(batch_resp)) => {