What is MinerU
20.02.2025, 09.04.2025 -
What is MinerU
Discover an Incredible Website - https://opendatalab.com/OpenSourceTools/Extractor/PDF
Description
MinerU is an open source high-quality data extraction tool developed by the OpenDataLab team of Shanghai Artificial Intelligence Laboratory, focusing on efficiently extracting content from complex PDF documents, web pages and e-books. It can convert multimodal PDFs containing pictures, formulas and tables into Markdown formats (such as markdown, json), and has a high-precision analysis toolchain, supports multiple input models, supports automatic identification of garbled codes, convert formulas to LaTex, retain document structure, and supports accurate recognition in 176 languages. It is suitable for academic, financial, legal and other fields, and is compatible with Windows/Linux/Mac platforms.