Saiful Haq commited on
Commit
6f51432
·
1 Parent(s): e4902a2

added readme

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md CHANGED
@@ -1,3 +1,50 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+
5
+
6
+ # IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages
7
+
8
+
9
+ Authors: Saiful Haq, Ashutosh Sharma, Pushpak Bhattacharyya
10
+
11
+ Paper link: https://arxiv.org/abs/2312.09508
12
+
13
+ ## Kindly cite our paper, If you are are using our datasets or models:
14
+
15
+ @article{haq2023indicirsuite,
16
+ title={IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages},
17
+ author={Haq, Saiful and Sharma, Ashutosh and Bhattacharyya, Pushpak},
18
+ journal={arXiv preprint arXiv:2312.09508},
19
+ year={2023}
20
+ }
21
+
22
+ ## About
23
+
24
+ This repository contains Multilingual ColBERT models in 11 Indian Languages.
25
+
26
+ ## Language Code to Language Mapping
27
+
28
+ asm_Beng: Assamese Language
29
+
30
+ ben_Beng: Bengali Language
31
+
32
+ guj_Gujr: Gujarati Language
33
+
34
+ hin_Deva: Hindi Language
35
+
36
+ kan_Knda: Kannada Language
37
+
38
+ mal_Mlym: Malyalam Language
39
+
40
+ mar_Deva: Marathi Language
41
+
42
+ ory_Orya: Oriya Language
43
+
44
+ pan_Guru: Punjabi Language
45
+
46
+ tam_Taml: Tamil Language
47
+
48
+ tel_Telu: Telugu Language
49
+
50
+