Commit History
Refactor. (#4487)
83ee116
Kevin Hu
commited on
Catch the exception while parsing pptx. (#4202)
74b229a
Kevin Hu
commited on
fix chunk method "Table" losing content when the Excel file has multi… (#4123)
75faa66
Fix json file parse (#4004)
a11ffc5
Added static check at PR CI (#3921)
fe9b6b3
Fix errors detected by Ruff (#3918)
0404a52
Fix parsing JSON file error (#3829)
43bceb7
Fix: page_chars attribute does not exist in some formats of PDF (#3796)
3c857ed
Fix out of boundary. (#3786)
1275b47
Kevin Hu
commited on
Edit chunk shall update instead of insert it (#3709)
1b2aab6
Always open text file for write with UTF-8 (#3688)
9a73da2
Added kb_id filter to knn. Fix #3458 (#3513)
aebd986
Introduced beartype (#3460)
eae0334
Move settings initialization after module init phase (#3438)
6101699
Use consistent log file names, introduced initLogger (#3403)
8bc2fc9
fix bug about fetching knowledge graph (#3394)
6a9fa6b
Kevin Hu
commited on
Rework logging (#3358)
22fe41e
Fix:#3230 When parsing a docx file using the Book parsing method, to_page is always -1, resulting in a block count of 0 even if parsing is successful (#3249)
ff43695
kuschzzp
Kevin Hu
commited on
search between multiple indiices for team function (#3079)
95863fc
Kevin Hu
commited on
bigger resolution for OCR (#2919)
7b6220c
Kevin Hu
commited on
fix: torch dependency start error (#2777)
0de98c4
trival (#2650)
03a3e60
Kevin Hu
commited on
rename get_txt to get_text (#2649)
d624a4e
Kevin Hu
commited on
Add get_txt function (#2639)
82adb1c
yqkcn
Kevin Hu
commited on
Fix: renrank_model and pdf_parser bugs | Update: session API (#2601)
678763e
liuhua
liuhua
commited on
add lighten control (#2567)
dbcbb17
Kevin Hu
commited on
fix parsing spaces in russian language PDFs (#1987) (#2427)
bac5213
optimize text parser (#2144)
13b285d
Kevin Hu
commited on
Format file format from Windows/dos to Unix (#1949)
aeb6dbc
fix parameter error (#1925)
1f75d02
Kevin Hu
commited on
add sql to naive parser (#1908)
ef2a724
Kevin Hu
commited on
add support for eml file parser (#1768)
6ed07a9
Fix docx parser line bug (#1715)
dda4c86
H
Kevin Hu
commited on
fix: When parsing the bold content in PDF, the result is duplicated. (#1729)
971f83c
Fix pdfparser content confusion (#1700)
eec0415
H
commited on
pypdf2 to pypdf (#1684)
10534c3
Kevin Hu
commited on
fix bug of ragflowdocxpparser (#1642)
fe5404c
Kevin Hu
commited on
fix generate error (#1590)
b5e86a6
Kevin Hu
commited on
fix bug about divided by zero (#1482)
4e6516f
Kevin Hu
commited on
fix: Delete hardcode (#1464)
789efbc
Yuhao Tsui
commited on
fix pdf_paser char content confusion (#1462)
1164cba
H
commited on
fix pdf_parser content confusion (#1458)
ece4f03
H
commited on
Support table for markdown file in general parser (#1278)
5bfd79c
Fix occasional errors in pdf table recognition (#1277)
b4b278b
feat: support json file (#1217)
77b7e10
fix too long query exception (#1195)
9aa975e
KevinHuSh
commited on
add self-rag (#1070)
a49657b
KevinHuSh
commited on
Update readme and add license (#1018)
9cba22c
fix bug in pdf parser (#986)
eefeab4
KevinHuSh
commited on