rabindralamsal commited on
Commit
d37d80f
·
verified ·
1 Parent(s): 58ba24a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - crisistransformers/CT-M1-Complete
4
+ ---
5
+ ## CrisisTransformers Mini Models
6
+
7
+ This is the *Medium* variant among the mini models (Medium, Small, Tiny) that were introduced in the paper ["Actionable Help" in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts](https://arxiv.org/abs/2502.16839).
8
+
9
+ These models are the first crisis-specific mini models optimized for deployment in resource-constrained settings. Across 13 crisis classification tasks, our mini models surpass BERT (also outperform or match the performance of RoBERTa, MPNet, and BERTweet), offering higher accuracy with significantly smaller sizes and faster speeds. The Medium model is 47% smaller with 3.8% higher accuracy at 3.5x speed, the Small model is 68% smaller with a 1.8% accuracy gain at 7.7x speed, and the Tiny model, 83% smaller, matches BERT's accuracy at 18.6x speed. All models outperform existing distilled variants, setting new benchmarks. Refer to the [associated paper](https://arxiv.org/abs/2502.16839) for more details.
10
+
11
+ ## Architecture
12
+ | mini model | # attention heads | # layers | intermediate size| output size | # parameters | source |
13
+ |--|--|--|--|--|--|--|
14
+ |Medium|8|8|2048|512|58 million| [crisistransformers/medium](https://huggingface.co/crisistransformers/medium) |
15
+ |Small|6|6|1536|384|35 million| [crisistransformers/small](https://huggingface.co/crisistransformers/small) |
16
+ |Tiny|4|4|1024|256|19 million| [crisistransformers/tiny](https://huggingface.co/crisistransformers/tiny) |
17
+
18
+ ## Uses
19
+ These models should be finetuned for downstream tasks just like [BERT](https://huggingface.co/bert-base-cased) and [RoBERTa](https://huggingface.co/roberta-base).
20
+
21
+ ## Citation
22
+ If you use these models in your research/project, please cite the following paper:
23
+ ```
24
+ @article{lamsal2025actionable,
25
+ title={"Actionable Help" in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts},
26
+ author={Rabindra Lamsal and
27
+ Maria Rodriguez Read and
28
+ Shanika Karunasekera and
29
+ Muhammad Imran},
30
+ year={2025},
31
+ eprint={2502.16839},
32
+ archivePrefix={arXiv},
33
+ primaryClass={cs.CL}
34
+ }
35
+ ```