diff --git a/admin_manual/ai/app_assistant.rst b/admin_manual/ai/app_assistant.rst index 89796d6f1..a5e5656e9 100644 --- a/admin_manual/ai/app_assistant.rst +++ b/admin_manual/ai/app_assistant.rst @@ -111,6 +111,13 @@ In order to make use of our AI agent feature, offering the execution of actions You will also need a text processing provider as specified above (ie. *llm2* or *integration_openai*). +Text-To-Speech +~~~~~~~~~~~~~~ + +In order to make use of Text-To-Speech, you will need an app that provides a Text-To-Speech backend: + +* *integration_openai* - Integrates with the OpenAI API to provide AI functionality from OpenAI servers (Customer support available upon request; see :ref:`AI as a Service`) + Configuration ------------- diff --git a/admin_manual/ai/overview.rst b/admin_manual/ai/overview.rst index cf1e2be87..2cf812f0d 100644 --- a/admin_manual/ai/overview.rst +++ b/admin_manual/ai/overview.rst @@ -66,6 +66,7 @@ Nextcloud uses modularity to separate raw AI functionality from the Graphical Us "Context Chat","`Nextcloud Assistant Context Chat `_","Yellow","Yes","Yes","No","Yes" "","`Nextcloud Assistant Context Chat (Backend) `_","Yellow","Yes","Yes","No","Yes" "Context Agent","`Nextcloud Context Agent `_","Green","Yes","Yes","Yes","Yes" + "Text To Speech","`Open AI Text To Speech `_","Red","No","No","No","No" Ethical AI Rating diff --git a/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst b/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst index d960f5bb1..2a0f1b618 100644 --- a/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst +++ b/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_32.rst @@ -36,7 +36,7 @@ Back-end changes Added APIs ^^^^^^^^^^ -- TBD +- New service ``OCP\TaskProcessing\TextToSpeech`` to convert text to speech. Changed APIs ^^^^^^^^^^^^ diff --git a/developer_manual/digging_deeper/task_processing.rst b/developer_manual/digging_deeper/task_processing.rst index d6267dcba..7d3e30f5a 100644 --- a/developer_manual/digging_deeper/task_processing.rst +++ b/developer_manual/digging_deeper/task_processing.rst @@ -116,6 +116,11 @@ The following built-in task types are available: * ``input``: ``Text`` * Output shape: * ``output``: ``Text`` + * ``'core:text2speech'``: This task type is for generating images from text prompts. It is implemented by ``\OCP\TaskProcessing\TaskTypes\TextToSpeech`` + * Input shape: + * ``input``: ``Text`` + * Output shape: + * ``speech``: ``Audio`` Task types can be disabled in the AI admin settings so they are not available for the Assistant or other apps even if they are implemented. All implemented Task types are enabled by default.