From 87dc40f5e71dd3bd1d63a40215b42077599cfd80 Mon Sep 17 00:00:00 2001 From: myhloli Date: Sat, 16 Nov 2024 02:29:56 +0800 Subject: [PATCH] docs: update feature description for table conversion - Changed the description for table conversion feature in both English and Chinese README files - Specified that tables are automatically converted to HTML format instead of LaTeX or HTML --- README.md | 2 +- README_zh-CN.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 39637ca6..99d47e0a 100644 --- a/README.md +++ b/README.md @@ -121,7 +121,7 @@ https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c - Preserve the structure of the original document, including headings, paragraphs, lists, etc. - Extract images, image descriptions, tables, table titles, and footnotes. - Automatically recognize and convert formulas in the document to LaTeX format. -- Automatically recognize and convert tables in the document to LaTeX or HTML format. +- Automatically recognize and convert tables in the document to HTML format. - Automatically detect scanned PDFs and garbled PDFs and enable OCR functionality. - OCR supports detection and recognition of 84 languages. - Supports multiple output formats, such as multimodal and NLP Markdown, JSON sorted by reading order, and rich intermediate formats. diff --git a/README_zh-CN.md b/README_zh-CN.md index 9b30b577..e1648dbd 100644 --- a/README_zh-CN.md +++ b/README_zh-CN.md @@ -121,7 +121,7 @@ https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c - 保留原文档的结构,包括标题、段落、列表等 - 提取图像、图片描述、表格、表格标题及脚注 - 自动识别并转换文档中的公式为LaTeX格式 -- 自动识别并转换文档中的表格为LaTeX或HTML格式 +- 自动识别并转换文档中的表格为HTML格式 - 自动检测扫描版PDF和乱码PDF,并启用OCR功能 - OCR支持84种语言的检测与识别 - 支持多种输出格式,如多模态与NLP的Markdown、按阅读顺序排序的JSON、含有丰富信息的中间格式等