diff --git a/admin_manual/ai/app_context_chat.rst b/admin_manual/ai/app_context_chat.rst index 11d6ac367..637c56d9c 100644 --- a/admin_manual/ai/app_context_chat.rst +++ b/admin_manual/ai/app_context_chat.rst @@ -50,7 +50,7 @@ Installation Scaling ------- -It is currently not possible to scale ExApps like Context Chat, we are working on this. One instance has a rough capacity of 1000 user queries per hour. +It is currently not possible to scale ExApps like Context Chat, we are working on this. Based on our calculations an instance has a rough capacity of 1000 user requests per hour. However, this number is based on theory and we do appreciate real-world feedback on this. App store --------- diff --git a/admin_manual/ai/app_llm2.rst b/admin_manual/ai/app_llm2.rst index 2615e093f..1e4cb4d03 100644 --- a/admin_manual/ai/app_llm2.rst +++ b/admin_manual/ai/app_llm2.rst @@ -52,7 +52,7 @@ This app allows supplying alternate LLM models as *gguf* files in the ``/nc_app_ Scaling ------- -It is currently not possible to scale this app, we are working on this. One instance has a rough capacity of 1000 user queries per hour. +It is currently not possible to scale this app, we are working on this. Based on our calculations an instance has a rough capacity of 1000 user requests per hour. However, this number is based on theory and we do appreciate real-world feedback on this. App store --------- diff --git a/admin_manual/ai/app_stt_whisper2.rst b/admin_manual/ai/app_stt_whisper2.rst index bec66f42d..8237230bd 100644 --- a/admin_manual/ai/app_stt_whisper2.rst +++ b/admin_manual/ai/app_stt_whisper2.rst @@ -54,7 +54,7 @@ This app allows supplying alternate LLM models as *gguf* files in the ``/nc_app_ Scaling ------- -It is currently not possible to scale this app, we are working on this. One instance has a rough capacity of 4h of transcription throughput per minute (measured with 8 CPU threads on an Intel(R) Xeon(R) Gold 6226R). +It is currently not possible to scale this app, we are working on this. Based on our calculations an instance has a rough capacity of 4h of transcription throughput per minute (measured with 8 CPU threads on an Intel(R) Xeon(R) Gold 6226R). It is unclear how close to real-world usage this number is, so we do appreciate real-world feedback on this. App store ---------