diff --git a/docs.json b/docs.json
index 86cc4b30..776f2392 100644
--- a/docs.json
+++ b/docs.json
@@ -5030,6 +5030,18 @@
       {
         "source": "/self-host/:slug*",
         "destination": "/en/self-host/:slug*"
+      },
+      {
+        "source": "/en/guides/model-configuration/load-balancing",
+        "destination": "/en/use-dify/workspace/model-providers#configure-model-load-balancing"
+      },
+      {
+        "source": "/zh-hans/guides/model-configuration/load-balancing",
+        "destination": "/zh/use-dify/workspace/model-providers#配置模型负载均衡"
+      },
+      {
+        "source": "/ja-jp/guides/model-configuration/load-balancing",
+        "destination": "/ja/use-dify/workspace/model-providers#負荷分散の構成"
       }
     ],
     "navbar": {
diff --git a/zh/use-dify/workspace/model-providers.mdx b/zh/use-dify/workspace/model-providers.mdx
index 728a2404..18f6fb58 100644
--- a/zh/use-dify/workspace/model-providers.mdx
+++ b/zh/use-dify/workspace/model-providers.mdx
@@ -40,20 +40,20 @@ icon: "brain-arrow-curved-right"
 
 ## 支持的供应商
 
-**大型语言模型：**
+**大型语言模型**：
 - OpenAI (GPT-4, GPT-3.5-turbo)
 - Anthropic (Claude)  
 - Google (Gemini)
 - Cohere
 - 通过Ollama的本地模型
 
-**文本嵌入模型：**
+**文本嵌入模型**：
 - OpenAI Embeddings
 - Cohere Embeddings
 - Azure OpenAI
 - 本地文本嵌入模型
 
-**专用模型：**
+**专用模型**：
 - 图像生成 (DALL-E, Stable Diffusion)
 - 语音 (Whisper, ElevenLabs)
 - 内容审核API
@@ -62,25 +62,25 @@ icon: "brain-arrow-curved-right"
 
 <Tabs>
   <Tab title="OpenAI">
-    **必需：**来自OpenAI平台的API密钥
+    **必需**：来自OpenAI平台的API密钥
     
-    **可选：**用于Azure OpenAI或代理的自定义基础URL，用于组织范围使用的组织ID
+    **可选**：用于Azure OpenAI或代理的自定义基础URL，用于组织范围使用的组织ID
     
-    **可用模型：**GPT-4、GPT-3.5-turbo、DALL-E、Whisper、文本嵌入
+    **可用模型**：GPT-4、GPT-3.5-turbo、DALL-E、Whisper、文本嵌入
   </Tab>
   
   <Tab title="Anthropic">
-    **必需：**来自Anthropic控制台的API密钥
+    **必需**：来自Anthropic控制台的API密钥
     
-    **可用模型：**Claude 3 (Opus, Sonnet, Haiku)、Claude 2.1、Claude Instant
+    **可用模型**：Claude 3 (Opus, Sonnet, Haiku)、Claude 2.1、Claude Instant
   </Tab>
   
   <Tab title="本地 (Ollama)">
-    **必需：**Ollama服务器URL（通常为http://localhost:11434）
+    **必需**：Ollama服务器URL（通常为http://localhost:11434）
     
-    **设置：**安装Ollama，拉取模型（`ollama pull llama2`），配置Dify连接
+    **设置**：安装Ollama，拉取模型（`ollama pull llama2`），配置Dify连接
     
-    **优势：**完全数据隐私，无外部API成本，自定义模型微调
+    **优势**：完全数据隐私，无外部API成本，自定义模型微调
   </Tab>
 </Tabs>
 
@@ -160,7 +160,7 @@ icon: "brain-arrow-curved-right"
 ## 配置模型负载均衡
 
 <Info>
-负载均衡为付费特性，可通过[订阅 SaaS 付费服务或者购买企业版](https://dify.ai/zh/pricing)以启用。
+负载均衡为付费特性，可通过 [订阅 SaaS 付费服务或者购买企业版](https://dify.ai/zh/pricing) 以启用。
 </Info>
 
 通常，模型供应商会限制用户在指定时间内访问 API 服务的次数（模型速率限制），以确保服务的稳定性与合理使用。对于企业级应用，来自单一凭据的高并发请求容易触发模型速率限制，从而影响用户访问。