MinerU logo

MinerU

Computer VisionVerified95% conf

High-accuracy document parsing engine that converts PDFs, documents, and images into structured data using AI

mineru.net

📍 Shanghai, China

Verified Data

💰
Est. Revenue<$100K ARR

Open-source project with state funding and token-based API. Academic/research focus suggests minimal commercial revenue generation.

🚀
FundingState-funded by Shanghai Artificial Intelligence Laboratory

State-funded by the Shanghai Artificial Intelligence Laboratory (National AI Lab).

👥
Users58.7k+ GitHub stars
🔗github.com
🧑‍💻
Team Size96+ contributors
🔗github.com
📈
Growth58.7k+ GitHub stars
🔗github.com
🏷️
StageState-funded
📅
Founded2020

Company Profile

ModelOpen Core
VerticalAI/ML, Research, Document Processing
ClientsAI Agents, RAG Systems, Academic Labs
BuyersAI developers, researchers, and RAG system builders in the global technical community
PricingFree open source, Token-based Online API

Contact

Strategic Analysis

Strategy

Open-source first approach backed by state funding from Shanghai AI Laboratory. Focuses on becoming the standard document parsing solution for AI developers and researchers building RAG systems. Leverages academic research backing to drive adoption in the global AI community.

Tactics

GitHub-centric distribution with rapid release cycles and community engagement. Maintains dual presence on both international (.net) and Chinese (.org.cn) domains. Offers both free open-source access and commercial token-based API to monetize enterprise usage.

Competitive Positioning

Positioned as a high-performance, research-backed alternative to commercial document parsing solutions. Differentiates on parsing accuracy and open-source accessibility. Competes with proprietary solutions by offering transparency and customizability for AI developers.

Marketing Approach

Community-driven growth through GitHub engagement and open-source contributions. Leverages academic credibility from Shanghai AI Laboratory backing. Targets AI developer community through technical documentation and benchmark performance demonstrations.

Notable

Ranked #296 globally on GitHub, backed by Shanghai Artificial Intelligence Laboratory

🔗 Source ↗

Tech Stack

PythonPythonPyTorchPyTorchVLMDiffusion ModelsLaTeX
🔗 Source ↗

Recent News

Related Computer Vision Companies

Discovery Sources

opendatalab/MinerU
May 14, 2026

Signals

growth rate58.7k+ GitHub stars🔗 source ↗
team size96+ contributors🔗 source ↗
user count58.7k+ GitHub stars🔗 source ↗
funding raisedState-funded by Shanghai Artificial Intelligence Laboratory

State-funded by the Shanghai Artificial Intelligence Laboratory (National AI Lab).

trend indicatorOpen Source🔗 source ↗
trend indicatorMachine Learning🔗 source ↗
trend indicatorOCR🔗 source ↗
trend indicatorDocument Parsing🔗 source ↗

Evidence

github.com

Converts PDF · DOCX · PPTX · XLSX · Images · Web pages into structured Markdown / JSON · VLM+OCR dual engine · 109 languages

github.com

58.7k+ GitHub stars

github.com

96+ contributors

github.com

58.7k+ GitHub stars