What is Chunkr

 16.09.2024, 09.04.2025 -  

What is Chunkr

Discover an Incredible Website - https://github.com/lumina-ai-inc/chunkr  

Description

Chunkr is an open source PDF data extraction tool based on visual models, focusing on document layout analysis, OCR and chunking processing. It is able to convert PDF, DOC, PPT, and XLS files into structured data for RAG (retrieval enhanced generation) and LLM (large language model). Chunkr uses advanced visual models and OCR technology to extract bounding boxes and structured text from documents, supporting the processing of text, tables, images and handwritten content. Maintained by Lumina AI Inc., supports GPU and CPU environments, and provides free trial and pricing solutions.