← Field Notes
EN/ES

When Your Documents Are More Than Text

April 24, 2026via github · @HKUDS
AIopen-sourceworkflowtools

The problem nobody talks about

Most AI tools that answer questions from your documents are quietly cheating. They read the words, yes — but when they hit a chart showing quarterly sales, or a table comparing supplier prices, or a diagram of a process flow, they either skip it or turn it into nonsense. You get confident answers built on incomplete information.

RAG-Anything, built by a research lab at the University of Hong Kong, takes a different approach. It treats every part of a document as meaningful — text, yes, but also images, tables, equations, and graphs. Everything goes in. Nothing gets silently dropped.

What makes it interesting isn't just that it handles more formats. It's that it understands the relationships between them. A caption under a chart connects to the chart. A footnote connects to the table it references. The system builds a kind of knowledge map, not just a pile of extracted words.

For a business owner, this matters most if you work with contracts, financial reports, technical specs, or any document where the meaning lives as much in a table as in a sentence.

The team behind it also built LightRAG, which has become one of the most trusted tools in this space. This isn't a side experiment.

Words worth knowing

RAG — Stands for "Retrieval-Augmented Generation." It's how AI tools answer questions from your documents instead of just general knowledge. Think of it as giving the AI a specific library to search before it speaks.

Multimodal — Able to understand multiple types of content at once: text, images, tables, not just words on a page.

Open-source — The code is public and free to use. Anyone can inspect it, host it themselves, or build on top of it.


If you rely on documents where the numbers in a table matter as much as the words around them, it's worth asking whether your current AI setup is actually reading all of it.

Check it out →

Written by David at AC0.AI. Follow on @ac0hero

Want us to audit your site? Takes 60 seconds →