Tag: multi-tenant
Recipes where more than one tenant shares the same fabric resource (GPU, ingress, scheduler quota) with isolation, fair-share, and per-tenant accounting.
11 recipes carry this tag, ordered by recipe number:
- Recipe 5 — An Inference Server That Shares GPUs Without Containers ·
gpu-and-inference/shared-inference - Recipe 26 — Multi-Tenant Compute With Preemption and Attestation ·
security-audit-and-cost - Recipe 53 — Per-Request Leased Inference Gateway ·
gpu-and-inference/shared-inference - Recipe 54 — Lease-Scoped Data Clean Room ·
security-audit-and-cost - Recipe 57 — Per-Project Fair-Share Policy ·
security-audit-and-cost - Recipe 59 — Cost Attribution With Accounting Tags ·
security-audit-and-cost - Recipe 60 — Tenant Audit Dashboard From the Typed Chain ·
security-audit-and-cost - Recipe 66 — Batching Four Tenants Into One Decode Forward Pass ·
gpu-and-inference/shared-inference - Recipe 67 — Hot-Rebind Inference Continuity After a Lease Revocation ·
gpu-and-inference/revocation-and-recovery - Recipe 68 — Continuous Batching With Per-Tenant Quotas ·
gpu-and-inference/shared-inference - Recipe 69 — Per-Request Audit Attribution for Inference ·
gpu-and-inference/audit-and-attribution