mirror of
https://github.com/opendatalab/MinerU.git
synced 2026-03-30 04:28:33 +07:00
- Replace MuPDF with pdfminer for detecting invalid characters in PDFs - Uncomment and update the detect_invalid_chars function to use pdfminer - Update the check_invalid_chars function in pdf_meta_scan.py to use the new implementation