infly
/

Infinity-Parser-7B

Model card Files Files and versions

zuminghuang commited on Oct 17

Commit

16d34bf

·

verified ·

1 Parent(s): d17c41b

Update README.md

Files changed (1) hide show

README.md +19 -3

README.md CHANGED Viewed

@@ -1,3 +1,19 @@
----
-license: apache-2.0
----

+# Infinity-Parser-7B
+<a><img src="assets/logo.png" height="16" width="16" style="display: inline"><b> Paper </b></a> |
+<a href="https://github.com/infly-ai/INF-MLLM/tree/main/Infinity-Parser"><img src="https://github.githubassets.com/images/modules/logos_page/GitHub-Mark.png" height="16" width="16" style="display: inline"><b> Github </b></a> |
+<a href="https://huggingface.co/spaces/infly/Infinity-Parser-Demo">💬<b> Web Demo </b></a>
+# Introduction
+We develop Infinity-Parser, an end-to-end scanned document parsing model trained with reinforcement learning. By incorporating verifiable rewards based on layout and content, Infinity-Parser maintains the original document's structure and content with high fidelity. Extensive evaluations on benchmarks in cluding OmniDocBench, olmOCR-Bench, PubTabNet, and FinTabNet show that Infinity-Parser consistently achieves state-of-the-art performance across a broad range of document types, languages, and structural complexities, substantially outperforming both specialized document parsing systems and general-purpose vision-language models.
+# Architecture
+Overview of Infinity-Parser training framework. Our model is optimized via reinforcement finetuning with edit distance, layout, and order-based rewards.
+![image](assets/architecture.png)
+# License
+This dataset is licensed under apache-2.0.