Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -37,6 +37,7 @@ It is a fine-tune of **Qwen 2.5-VL-7B** using ~10 k synthetic doc-to-Reasoning-t
 ### Arena ranking (using trueskill-2 ranking system):
 <p align="center">
 | Rank | Model                                   | μ     | σ    | μ − 3σ |
 | ---- | --------------------------------------- | ----- | ---- | ------ |
 | 🥇 1 | **gemini-flash-reasoning**              | 26.75 | 0.80 | 24.35  |
@@ -46,6 +47,7 @@ It is a fine-tune of **Qwen 2.5-VL-7B** using ~10 k synthetic doc-to-Reasoning-t
 | 5    | **gpt-4o**                              | 24.48 | 0.80 | 22.08  |
 | 6    | **gemini-flash-w/o\_reasoning**         | 24.11 | 0.79 | 21.74  |
 | 7    | **RolmoOCR**                            | 23.53 | 0.82 | 21.07  |
 </p>
 *we plan to realease a markdown arena, similar to llmArena, for complex document to markdown task to help evaluate different document to markdown solution*

 ### Arena ranking (using trueskill-2 ranking system):
 <p align="center">
 | Rank | Model                                   | μ     | σ    | μ − 3σ |
 | ---- | --------------------------------------- | ----- | ---- | ------ |
 | 🥇 1 | **gemini-flash-reasoning**              | 26.75 | 0.80 | 24.35  |
 | 5    | **gpt-4o**                              | 24.48 | 0.80 | 22.08  |
 | 6    | **gemini-flash-w/o\_reasoning**         | 24.11 | 0.79 | 21.74  |
 | 7    | **RolmoOCR**                            | 23.53 | 0.82 | 21.07  |
 </p>
 *we plan to realease a markdown arena, similar to llmArena, for complex document to markdown task to help evaluate different document to markdown solution*