huh707 commited on
Commit
228cf23
·
1 Parent(s): c39f06b

Add vicuna prune 0.60 model

Browse files
vicuna_prune60/2024-05-01-03-26-22/description.txt ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ - Training Parameters:
2
+ - base_model: lmsys/vicuna-7b-v1.3
3
+ - save_ckpt_log_name: vicuna_prune60
4
+ - pruning_ratio: 0.6
5
+ - pruner_type: taylor
6
+ - temperature: 1.0
7
+ - top_p: 0.95
8
+ - max_seq_len: 128
9
+ - channel_wise: False
10
+ - block_wise: True
11
+ - layer_wise: False
12
+ - layer: 12
13
+ - block_attention_layer_start: 4
14
+ - block_attention_layer_end: 30
15
+ - block_mlp_layer_start: 4
16
+ - block_mlp_layer_end: 30
17
+ - iterative_steps: 1
18
+ - grouping_strategy: sum
19
+ - global_pruning: False
20
+ - taylor: param_first
21
+ - num_examples: 10
22
+ - device: cpu
23
+ - test_before_train: False
24
+ - eval_device: cuda
25
+ - test_after_train: True
26
+ - seed: 42
27
+ - save_model: True
28
+ - torch_version: 2.2
vicuna_prune60/2024-05-01-03-26-22/train.sh ADDED
@@ -0,0 +1 @@
 
 
1
+ python hf_prune.py --pruning_ratio 0.60 --block_wise --block_mlp_layer_start 4 --block_mlp_layer_end 30 --block_attention_layer_start 4 --block_attention_layer_end 30 --pruner_type taylor --test_after_train --device cpu --eval_device cuda --save_ckpt_log_name vicuna_prune60 --save_model --base_model lmsys/vicuna-7b-v1.3
vicuna_prune60/2024-05-01-03-26-22/training.log ADDED
@@ -0,0 +1,57 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2024-05-01 03:26:26 - INFO : Use taylor pruner...
2
+ 2024-05-01 03:26:26 - INFO : Pruning Attention Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
3
+ 2024-05-01 03:26:26 - INFO : Pruning MLP Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
4
+ 2024-05-01 03:26:32 - INFO : Start Pruning
5
+ 2024-05-01 03:26:43 - INFO : Start Backwarding in iterative steps = 0...
6
+ 2024-05-01 03:26:55 - INFO : Loss = 3.7534372806549072
7
+ 2024-05-01 03:27:46 - INFO : After Iter 1/1, #parameters: 3592204288
8
+ 2024-05-01 03:27:46 - INFO : #Param before: 6738415616, #Param after: 3592204288, Ratio = 53.3093%
9
+ 2024-05-01 03:28:03 - INFO :
10
+ ==================Generation Results After Pruning================
11
+
12
+ 2024-05-01 03:28:14 - INFO : <s>I believe the meaning of life is the following road, however sometimes it’s difficult to make the journey. We will continue to search and explore the unknown for a little while longer, until we find the right path to follow.”
13
+ During our conversation I recall the memory of the forest, when i walked so long in the pouring rain.
14
+ I continue my journey, the darkness of the unknown road ahead of me creating a blend of light and stars. In the distance, I can see your silhououlouout, I can sense the shake of your hand.
15
+ A light wind blows through my hair
16
+ 2024-05-01 03:28:23 - INFO : <s>Simply put, the theory of relativity states that 50% pounds off a whip, and the next at the bottom who’s gone to school.
17
+ He who does not, at the beginning learn to cut, will reap the blame.
18
+ To the brave who will stand with him.
19
+ An empty mind and a piece in hand, Stril every cheak to day a bitter truth.
20
+ This time was registered by my heart. “
21
+ And then, with tears in his eyes, he turned to me, telling the life-threiling tale.
22
+ The hard life of the poor,
23
+ 2024-05-01 03:28:33 - INFO : <s>Building a website can be done in 10 simple steps:
24
+ You want to ensure the efficiency of these actions and facilitate finding an audience. Therefore, you need to ensure that the website and Google will be looking at each other.
25
+ Before you start, take note of the people who are watching you in order to determine your mood. Whether your colleagues are at their desks, or when they suddenly find inspiration, head down, you will have to make another step towards their coffee, if they get in the way and block their colleagues in front. This will make your face and red lighting your eyes
26
+ 2024-05-01 03:28:36 - INFO : <s>Tweet: "I hate it when my phone battery dies."
27
+ Sentiment: Negative
28
+ ###
29
+ Tweet: "My day has been 👍"
30
+ Sentiment: Positive
31
+ ###
32
+ Tweet: "This is the link to the article"
33
+ Sentiment: Neutral
34
+ ###
35
+ Tweet: "This new music video was incredibile"
36
+ Sentiment:
37
+ AWE. I am amazed at how well I stay when the news isn'ters good. "Noise-free zone"
38
+ This is a major high to have control
39
+ 2024-05-01 03:28:42 - INFO : <s>Translate English to French:
40
+
41
+ sea otter => loutre de mer
42
+
43
+ peppermint => menthe poivrée
44
+
45
+ plush girafe => girafe peluche
46
+
47
+ cheese => ra Montague qui tala à chent me, i scared) – English – The Laran cels
48
+ Cry – The donkey
49
+ Ferozly – The animal cries
50
+ as the woman is lifted to the ground, and her killer growls.
51
+ The next night, a robber wearing a mask and red roat comes in and kicks
52
+ 2024-05-01 03:28:42 - INFO :
53
+ ==================Finish================
54
+
55
+ 2024-05-01 03:32:12 - INFO : PPL after pruning: {'wikitext2': 154.32526724792663, 'ptb': 507.9935496710831}
56
+ 2024-05-01 03:32:12 - INFO : Memory Requirement: 6995.72802734375 MiB
57
+
vicuna_prune60/2024-05-01-03-43-15/description.txt ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ - Training Parameters:
2
+ - base_model: lmsys/vicuna-7b-v1.3
3
+ - save_ckpt_log_name: vicuna_prune60
4
+ - pruning_ratio: 0.6
5
+ - pruner_type: taylor
6
+ - temperature: 1.0
7
+ - top_p: 0.95
8
+ - max_seq_len: 128
9
+ - channel_wise: False
10
+ - block_wise: True
11
+ - layer_wise: False
12
+ - layer: 12
13
+ - block_attention_layer_start: 4
14
+ - block_attention_layer_end: 30
15
+ - block_mlp_layer_start: 4
16
+ - block_mlp_layer_end: 30
17
+ - iterative_steps: 1
18
+ - grouping_strategy: sum
19
+ - global_pruning: False
20
+ - taylor: param_first
21
+ - num_examples: 10
22
+ - device: cpu
23
+ - test_before_train: False
24
+ - eval_device: cuda
25
+ - test_after_train: True
26
+ - seed: 42
27
+ - save_model: True
28
+ - torch_version: 2.2
vicuna_prune60/2024-05-01-03-43-15/train.sh ADDED
@@ -0,0 +1 @@
 
 
1
+ python hf_prune.py --pruning_ratio 0.60 --block_wise --block_mlp_layer_start 4 --block_mlp_layer_end 30 --block_attention_layer_start 4 --block_attention_layer_end 30 --pruner_type taylor --test_after_train --device cpu --eval_device cuda --save_ckpt_log_name vicuna_prune60 --save_model --base_model lmsys/vicuna-7b-v1.3
vicuna_prune60/2024-05-01-03-43-15/training.log ADDED
@@ -0,0 +1,57 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2024-05-01 03:43:24 - INFO : Use taylor pruner...
2
+ 2024-05-01 03:43:24 - INFO : Pruning Attention Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
3
+ 2024-05-01 03:43:24 - INFO : Pruning MLP Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
4
+ 2024-05-01 03:43:28 - INFO : Start Pruning
5
+ 2024-05-01 03:43:43 - INFO : Start Backwarding in iterative steps = 0...
6
+ 2024-05-01 03:43:54 - INFO : Loss = 3.7534372806549072
7
+ 2024-05-01 03:44:22 - INFO : After Iter 1/1, #parameters: 3592204288
8
+ 2024-05-01 03:44:23 - INFO : #Param before: 6738415616, #Param after: 3592204288, Ratio = 53.3093%
9
+ 2024-05-01 03:44:44 - INFO :
10
+ ==================Generation Results After Pruning================
11
+
12
+ 2024-05-01 03:44:53 - INFO : <s>I believe the meaning of life is the following road, however sometimes it’s difficult to make the journey. We will continue to search and explore the unknown for a little while longer, until we find the right path to follow.”
13
+ During our conversation I recall the memory of the forest, when i walked so long in the pouring rain.
14
+ I continue my journey, the darkness of the unknown road ahead of me creating a blend of light and stars. In the distance, I can see your silhououlouout, I can sense the shake of your hand.
15
+ A light wind blows through my hair
16
+ 2024-05-01 03:45:02 - INFO : <s>Simply put, the theory of relativity states that 50% pounds off a whip, and the next at the bottom who’s gone to school.
17
+ He who does not, at the beginning learn to cut, will reap the blame.
18
+ To the brave who will stand with him.
19
+ An empty mind and a piece in hand, Stril every cheak to day a bitter truth.
20
+ This time was registered by my heart. “
21
+ And then, with tears in his eyes, he turned to me, telling the life-threiling tale.
22
+ The hard life of the poor,
23
+ 2024-05-01 03:45:09 - INFO : <s>Building a website can be done in 10 simple steps:
24
+ You want to ensure the efficiency of these actions and facilitate finding an audience. Therefore, you need to ensure that the website and Google will be looking at each other.
25
+ Before you start, take note of the people who are watching you in order to determine your mood. Whether your colleagues are at their desks, or when they suddenly find inspiration, head down, you will have to make another step towards their coffee, if they get in the way and block their colleagues in front. This will make your face and red lighting your eyes
26
+ 2024-05-01 03:45:10 - INFO : <s>Tweet: "I hate it when my phone battery dies."
27
+ Sentiment: Negative
28
+ ###
29
+ Tweet: "My day has been 👍"
30
+ Sentiment: Positive
31
+ ###
32
+ Tweet: "This is the link to the article"
33
+ Sentiment: Neutral
34
+ ###
35
+ Tweet: "This new music video was incredibile"
36
+ Sentiment:
37
+ AWE. I am amazed at how well I stay when the news isn'ters good. "Noise-free zone"
38
+ This is a major high to have control
39
+ 2024-05-01 03:45:12 - INFO : <s>Translate English to French:
40
+
41
+ sea otter => loutre de mer
42
+
43
+ peppermint => menthe poivrée
44
+
45
+ plush girafe => girafe peluche
46
+
47
+ cheese => ra Montague qui tala à chent me, i scared) – English – The Laran cels
48
+ Cry – The donkey
49
+ Ferozly – The animal cries
50
+ as the woman is lifted to the ground, and her killer growls.
51
+ The next night, a robber wearing a mask and red roat comes in and kicks
52
+ 2024-05-01 03:45:12 - INFO :
53
+ ==================Finish================
54
+
55
+ 2024-05-01 03:47:39 - INFO : PPL after pruning: {'wikitext2': 154.32526724792663, 'ptb': 507.9935496710831}
56
+ 2024-05-01 03:47:39 - INFO : Memory Requirement: 6995.72802734375 MiB
57
+
vicuna_prune60/description.txt ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ - Training Parameters:
2
+ - base_model: lmsys/vicuna-7b-v1.3
3
+ - save_ckpt_log_name: vicuna_prune60
4
+ - pruning_ratio: 0.6
5
+ - pruner_type: taylor
6
+ - temperature: 1.0
7
+ - top_p: 0.95
8
+ - max_seq_len: 128
9
+ - channel_wise: False
10
+ - block_wise: True
11
+ - layer_wise: False
12
+ - layer: 12
13
+ - block_attention_layer_start: 4
14
+ - block_attention_layer_end: 30
15
+ - block_mlp_layer_start: 4
16
+ - block_mlp_layer_end: 30
17
+ - iterative_steps: 1
18
+ - grouping_strategy: sum
19
+ - global_pruning: False
20
+ - taylor: param_first
21
+ - num_examples: 10
22
+ - device: cpu
23
+ - test_before_train: False
24
+ - eval_device: cuda
25
+ - test_after_train: True
26
+ - seed: 42
27
+ - save_model: True
28
+ - torch_version: 2.2
vicuna_prune60/pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18bb341f86cd97bc3de036278bb2083e34729129b5e1386ab36119a2242ac341
3
+ size 7218842434