extract chunk with vision LLM? #7203

Chunshan-Theta · 2025-04-22T08:22:21Z

Chunshan-Theta
Apr 22, 2025

I'm wondering if LLMs with vision capabilities could be used to process documents. I'm having trouble with OCR(deepdoc) when dealing with educational materials for young children. The formatting is often quite unique, and OCR often makes mistakes. Could an LLM with vision provide a more accurate way to extract information?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfiniFlow

extract chunk with vision LLM? #7203

{{title}}

Replies: 0 comments

Select a reply

InfiniFlow

extract chunk with vision LLM? #7203

Chunshan-Theta Apr 22, 2025

Replies: 0 comments

Chunshan-Theta
Apr 22, 2025