mirror of
https://github.com/langgenius/dify-docs.git
synced 2026-03-27 13:28:32 +07:00
add redirect url for load-balancing content & fix bold rendering (#573)
Co-authored-by: Riskey <riskey47@dify.ai>
This commit is contained in:
12
docs.json
12
docs.json
@@ -5030,6 +5030,18 @@
|
||||
{
|
||||
"source": "/self-host/:slug*",
|
||||
"destination": "/en/self-host/:slug*"
|
||||
},
|
||||
{
|
||||
"source": "/en/guides/model-configuration/load-balancing",
|
||||
"destination": "/en/use-dify/workspace/model-providers#configure-model-load-balancing"
|
||||
},
|
||||
{
|
||||
"source": "/zh-hans/guides/model-configuration/load-balancing",
|
||||
"destination": "/zh/use-dify/workspace/model-providers#配置模型负载均衡"
|
||||
},
|
||||
{
|
||||
"source": "/ja-jp/guides/model-configuration/load-balancing",
|
||||
"destination": "/ja/use-dify/workspace/model-providers#負荷分散の構成"
|
||||
}
|
||||
],
|
||||
"navbar": {
|
||||
|
||||
@@ -40,20 +40,20 @@ icon: "brain-arrow-curved-right"
|
||||
|
||||
## 支持的供应商
|
||||
|
||||
**大型语言模型:**
|
||||
**大型语言模型**:
|
||||
- OpenAI (GPT-4, GPT-3.5-turbo)
|
||||
- Anthropic (Claude)
|
||||
- Google (Gemini)
|
||||
- Cohere
|
||||
- 通过Ollama的本地模型
|
||||
|
||||
**文本嵌入模型:**
|
||||
**文本嵌入模型**:
|
||||
- OpenAI Embeddings
|
||||
- Cohere Embeddings
|
||||
- Azure OpenAI
|
||||
- 本地文本嵌入模型
|
||||
|
||||
**专用模型:**
|
||||
**专用模型**:
|
||||
- 图像生成 (DALL-E, Stable Diffusion)
|
||||
- 语音 (Whisper, ElevenLabs)
|
||||
- 内容审核API
|
||||
@@ -62,25 +62,25 @@ icon: "brain-arrow-curved-right"
|
||||
|
||||
<Tabs>
|
||||
<Tab title="OpenAI">
|
||||
**必需:**来自OpenAI平台的API密钥
|
||||
**必需**:来自OpenAI平台的API密钥
|
||||
|
||||
**可选:**用于Azure OpenAI或代理的自定义基础URL,用于组织范围使用的组织ID
|
||||
**可选**:用于Azure OpenAI或代理的自定义基础URL,用于组织范围使用的组织ID
|
||||
|
||||
**可用模型:**GPT-4、GPT-3.5-turbo、DALL-E、Whisper、文本嵌入
|
||||
**可用模型**:GPT-4、GPT-3.5-turbo、DALL-E、Whisper、文本嵌入
|
||||
</Tab>
|
||||
|
||||
<Tab title="Anthropic">
|
||||
**必需:**来自Anthropic控制台的API密钥
|
||||
**必需**:来自Anthropic控制台的API密钥
|
||||
|
||||
**可用模型:**Claude 3 (Opus, Sonnet, Haiku)、Claude 2.1、Claude Instant
|
||||
**可用模型**:Claude 3 (Opus, Sonnet, Haiku)、Claude 2.1、Claude Instant
|
||||
</Tab>
|
||||
|
||||
<Tab title="本地 (Ollama)">
|
||||
**必需:**Ollama服务器URL(通常为http://localhost:11434)
|
||||
**必需**:Ollama服务器URL(通常为http://localhost:11434)
|
||||
|
||||
**设置:**安装Ollama,拉取模型(`ollama pull llama2`),配置Dify连接
|
||||
**设置**:安装Ollama,拉取模型(`ollama pull llama2`),配置Dify连接
|
||||
|
||||
**优势:**完全数据隐私,无外部API成本,自定义模型微调
|
||||
**优势**:完全数据隐私,无外部API成本,自定义模型微调
|
||||
</Tab>
|
||||
</Tabs>
|
||||
|
||||
@@ -160,7 +160,7 @@ icon: "brain-arrow-curved-right"
|
||||
## 配置模型负载均衡
|
||||
|
||||
<Info>
|
||||
负载均衡为付费特性,可通过[订阅 SaaS 付费服务或者购买企业版](https://dify.ai/zh/pricing)以启用。
|
||||
负载均衡为付费特性,可通过 [订阅 SaaS 付费服务或者购买企业版](https://dify.ai/zh/pricing) 以启用。
|
||||
</Info>
|
||||
|
||||
通常,模型供应商会限制用户在指定时间内访问 API 服务的次数(模型速率限制),以确保服务的稳定性与合理使用。对于企业级应用,来自单一凭据的高并发请求容易触发模型速率限制,从而影响用户访问。
|
||||
|
||||
Reference in New Issue
Block a user