dennlinger commited on
Commit
d185ff4
·
1 Parent(s): 5923eb8

Include slight sectioning.

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -1,6 +1,10 @@
1
- This model has been fine-tuned for the task described in the paper *Topical Change Detection in Documents via Embeddings of Long Sequences* and is our best-performing base-transformer model. You can find more detailed information in our GitHub page for the paper [here](https://github.com/dennlinger/TopicalChange), or read the [paper itself](https://arxiv.org/abs/2012.03619).
 
2
 
 
3
  The training task is to determine whether two text segments (paragraphs) belong to the same topical section or not. This can be utilized to create a topical segmentation of a document by consecutively predicting the "togetherness" of two models.
4
 
 
 
5
 
6
  Note that this model is *not* trained to work on classifying single texts, but only works with two (separated) inputs.
 
1
+ # About this model: Topical Change Detection in Documents
2
+ This model has been fine-tuned for the task described in the paper *Topical Change Detection in Documents via Embeddings of Long Sequences* and is our best-performing base-transformer model. You can find more detailed information in our GitHub page for the paper [here](https://github.com/dennlinger/TopicalChange), or read the [paper itself](https://arxiv.org/abs/2012.03619). The weights are based on RoBERTa-base.
3
 
4
+ # Training objective
5
  The training task is to determine whether two text segments (paragraphs) belong to the same topical section or not. This can be utilized to create a topical segmentation of a document by consecutively predicting the "togetherness" of two models.
6
 
7
+ # Performance
8
+ The results of this model can be found in the paper. We average over models from five different random seeds, which is why the specific results for this model might be different from the exact values in the paper.
9
 
10
  Note that this model is *not* trained to work on classifying single texts, but only works with two (separated) inputs.