fix(admin_manual): Make capacity estimates more skeptical

Signed-off-by: Marcel Klehr <mklehr@gmx.net>
2026-01-03 02:09:45 +07:00 · 2024-03-26 10:39:50 +01:00
parent 26a60c4295
commit 91b8e2f3e5
3 changed files with 3 additions and 3 deletions
--- a/admin_manual/ai/app_context_chat.rst
+++ b/admin_manual/ai/app_context_chat.rst
@@ -50,7 +50,7 @@ Installation
 Scaling
 -------

-It is currently not possible to scale ExApps like Context Chat, we are working on this. One instance has a rough capacity of 1000 user queries per hour.
+It is currently not possible to scale ExApps like Context Chat, we are working on this. Based on our calculations an instance has a rough capacity of 1000 user requests per hour. However, this number is based on theory and we do appreciate real-world feedback on this.

 App store
 ---------
--- a/admin_manual/ai/app_llm2.rst
+++ b/admin_manual/ai/app_llm2.rst
@@ -52,7 +52,7 @@ This app allows supplying alternate LLM models as *gguf* files in the ``/nc_app_
 Scaling
 -------

-It is currently not possible to scale this app, we are working on this. One instance has a rough capacity of 1000 user queries per hour.
+It is currently not possible to scale this app, we are working on this. Based on our calculations an instance has a rough capacity of 1000 user requests per hour. However, this number is based on theory and we do appreciate real-world feedback on this.

 App store
 ---------
--- a/admin_manual/ai/app_stt_whisper2.rst
+++ b/admin_manual/ai/app_stt_whisper2.rst
@@ -54,7 +54,7 @@ This app allows supplying alternate LLM models as *gguf* files in the ``/nc_app_
 Scaling
 -------

-It is currently not possible to scale this app, we are working on this. One instance has a rough capacity of 4h of transcription throughput per minute (measured with 8 CPU threads on an Intel(R) Xeon(R) Gold 6226R).
+It is currently not possible to scale this app, we are working on this. Based on our calculations an instance has a rough capacity of 4h of transcription throughput per minute (measured with 8 CPU threads on an Intel(R) Xeon(R) Gold 6226R). It is unclear how close to real-world usage this number is, so we do appreciate real-world feedback on this.

 App store
 ---------