Text Classification
Transformers
Safetensors
English
bert
kortukov commited on
Commit
5f17c32
·
verified ·
1 Parent(s): 58f2a1f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -1,3 +1,30 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - kortukov/answer-equivalence-dataset
5
+ language:
6
+ - en
7
+ pipeline_tag: text-classification
8
  ---
9
+
10
+ # Overview
11
+ BEM - BERT Matching model from paper [Tomayto, Tomahto. Beyond Token-level Answer Equivalence for Question Answering Evaluation](https://arhttps://arxiv.org/abs/2202.07654xiv.org/abs/2202.07654) (reproduction).
12
+
13
+ It is a [bert-base-uncased](https://huggingface.co/bert-base-uncased) model trained on the [Answer Equivalence dataset](https://huggingface.co/datasets/kortukov/answer-equivalence-dataset)
14
+
15
+ Consider this example (pseudocode):
16
+ ```python
17
+ question = 'how is the weather in california'
18
+ reference answer = 'infrequent rain'
19
+ candidate answer = 'rain'
20
+ bem(question, reference, candidate) ~ 0
21
+ ```
22
+
23
+ This model can be used as a metric to evaluate automatic question answering systems: when the produced answer is different from the reference, it might still be equivalent to the reference and hence count as correct.
24
+
25
+ See the paper [Tomayto, Tomahto. Beyond Token-level Answer Equivalence for Question Answering Evaluation](https://arxiv.org/abs/2202.07654) for a detailed explanation of how the data was collected and how this metric compares to others such as exact match of F1.
26
+
27
+ # Example use
28
+
29
+ TODO
30
+