QQhahaha commited on
Commit
659b2d3
·
1 Parent(s): 1192454

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Text Summarization
2
+ This is a assignment of Applied Deep Learning which is a course of National Taiwan University(NTU).
3
+ ### Task Description:Chinese News Summarization (Title Generation)
4
+ input(news content):
5
+ ```
6
+ 從小就很會念書的李悅寧, 在眾人殷殷期盼下,以榜首之姿進入臺大醫學院, 但始終忘不了對天文的熱情。大學四年級一場遠行後,她決心遠赴法國攻讀天文博士。 從小沒想過當老師的她,再度跌破眾人眼鏡返台任教,......
7
+ ```
8
+ output(news title):
9
+ ```
10
+ 榜首進台大醫科卻休學 、27歲拿到法國天文博士 李悅寧跌破眾人眼鏡返台任教
11
+ ```
12
+ ### Objective
13
+ - Fine-tune a pre-trained model:[google/mt5-small](https://huggingface.co/google/mt5-small) to pass the baseline.
14
+ - Compare the difference between beam search, top k sampling, top p sampling, temperature.
15
+ ```
16
+ Baseline(f1-score):rouge-1: 22.0, rouge-2: 8.5, rouge-L: 20.5
17
+ ```
18
+ ### Experiments
19
+ - Greedy
20
+ After the model generate the probility of every token as result, Greedy is the simplest way to choose the next word with most probable word(argmax).
21
+ However, there is a problem that it's easy to choose the duplicate word with Greedy strategy.
22
+ ```
23
+ Greedy Result(f1-score):rouge-1: 15.7, rouge-2: 4.9, rouge-L: 14.8
24
+ ```
25
+ - Beam Search
26
+ Beam Search strategy is keeping track of the k most probable sentences and finding the best one as a result.
27
+ Therefore, if beam size is setting as 1, it becomes Greedy. We can say that beam search kind of solves the problem of Greedy.
28
+ However, if beam size is too large, the result will turn into too generic and less relevant though the result is safe and "correct".
29
+ For example
30
+ ```
31
+ I love to listen Taylor Swift's songs so I decide to participate the concert of Taylor.
32
+ ```
33
+ - Top k Sampling
34
+
35
+ - Top p Sampling
36
+
37
+ - Temperature
38
+