Capacity Explorer | AI Data Platform

Capacity Explorer Capacity Explorer

More Concurrent Users, Same GPU 相同 GPU，更多並行使用者

Drag SLA thresholds to find your workload's operating point. See how aiDAPTIV KV cache reuse doubles capacity from the same GPU.拖曳 SLA 門檻，找到你工作負載的最佳營運點。看 aiDAPTIV KV cache 重用如何從相同 GPU 獲得雙倍容量。

8K Short Q&A · chatbot replies 簡短問答・聊天回覆 16K Document summary · code review 文件摘要・程式碼審查 32K Multi-turn chat · long documents 多輪對話・長文件分析 64K Large codebase · report analysis 大型代碼庫・報告分析 128K Full book · deep research 完整書籍・深度研究

Upper bound 上限 ≤ 10.0 s

Lower bound 下限 ≥ 20 tok/s

Without aiDAPTIV Without aiDAPTIV

—

With aiDAPTIV With aiDAPTIV

—

— — Concurrent users 並行使用者 — increased 提升 GPU capacity GPU 算力 — saved 節省

Concurrent users 並行使用者

Without aiDAPTIV

With aiDAPTIV

SLA threshold

H200 ×1 32K context TTFT ≤ 10.0s TP ≥ 20 t/s

Want to run these numbers on your specific model and SLA requirements? 想針對你的特定模型與 SLA 需求進行計算？

Contact Sales 聯繫銷售