ragflow / deepdoc /parser

Commit History

Update comments (#4569)
fa82d94

jinhai-2012 commited on

Refactor. (#4487)
83ee116

Kevin Hu commited on

Catch the exception while parsing pptx. (#4202)
74b229a

Kevin Hu commited on

fix chunk method "Table" losing content when the Excel file has multi… (#4123)
75faa66

ly0303521 commited on

Fix json file parse (#4004)
a11ffc5

jinhai-2012 commited on

Added static check at PR CI (#3921)
fe9b6b3

zhichyu commited on

Fix errors detected by Ruff (#3918)
0404a52

zhichyu commited on

Fix parsing JSON file error (#3829)
43bceb7

jinhai-2012 commited on

Fix: page_chars attribute does not exist in some formats of PDF (#3796)
3c857ed

cyhasuka commited on

Fix out of boundary. (#3786)
1275b47

Kevin Hu commited on

Edit chunk shall update instead of insert it (#3709)
1b2aab6

zhichyu commited on

Always open text file for write with UTF-8 (#3688)
9a73da2

zhichyu commited on

Added kb_id filter to knn. Fix #3458 (#3513)
aebd986

zhichyu commited on

Introduced beartype (#3460)
eae0334

zhichyu commited on

Move settings initialization after module init phase (#3438)
6101699

jinhai-2012 commited on

Use consistent log file names, introduced initLogger (#3403)
8bc2fc9

zhichyu commited on

fix bug about fetching knowledge graph (#3394)
6a9fa6b

Kevin Hu commited on

Rework logging (#3358)
22fe41e

zhichyu commited on

Fix:#3230 When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful (#3249)
ff43695

kuschzzp Kevin Hu commited on

search between multiple indiices for team function (#3079)
95863fc

Kevin Hu commited on

bigger resolution for OCR (#2919)
7b6220c

Kevin Hu commited on

fix: torch dependency start error (#2777)
0de98c4

chongcb chongchuanbing Kevin Hu commited on

trival (#2650)
03a3e60

Kevin Hu commited on

rename get_txt to get_text (#2649)
d624a4e

Kevin Hu commited on

Add get_txt function (#2639)
82adb1c

yqkcn Kevin Hu commited on

Fix: renrank_model and pdf_parser bugs | Update: session API (#2601)
678763e

liuhua liuhua commited on

add lighten control (#2567)
dbcbb17

Kevin Hu commited on

fix parsing spaces in russian language PDFs (#1987) (#2427)
bac5213

Hyperb0t commited on

optimize text parser (#2144)
13b285d

Kevin Hu commited on

Format file format from Windows/dos to Unix (#1949)
aeb6dbc

jinhai-2012 commited on

fix parameter error (#1925)
1f75d02

Kevin Hu commited on

add sql to naive parser (#1908)
ef2a724

Kevin Hu commited on

add support for eml file parser (#1768)
6ed07a9

黄腾 aopstudio Kevin Hu commited on

Fix docx parser line bug (#1715)
dda4c86

H Kevin Hu commited on

fix: When parsing the bold content in PDF, the result is duplicated. (#1729)
971f83c

leecjnew commited on

Fix pdfparser content confusion (#1700)
eec0415

H commited on

pypdf2 to pypdf (#1684)
10534c3

Kevin Hu commited on

fix bug of ragflowdocxpparser (#1642)
fe5404c

Kevin Hu commited on

fix generate error (#1590)
b5e86a6

Kevin Hu commited on

fix bug about divided by zero (#1482)
4e6516f

Kevin Hu commited on

fix: Delete hardcode (#1464)
789efbc

Yuhao Tsui commited on

fix pdf_paser char content confusion (#1462)
1164cba

H commited on

fix pdf_parser content confusion (#1458)
ece4f03

H commited on

Support table for markdown file in general parser (#1278)
5bfd79c

aopstudio commited on

Fix occasional errors in pdf table recognition (#1277)
b4b278b

aopstudio commited on

feat: support json file (#1217)
77b7e10

WangBaoling KevinHuSh commited on

fix too long query exception (#1195)
9aa975e

KevinHuSh commited on

add self-rag (#1070)
a49657b

KevinHuSh commited on

Update readme and add license (#1018)
9cba22c

jinhai-2012 commited on

fix bug in pdf parser (#986)
eefeab4

KevinHuSh commited on