Add vicuna prune 0.60 model
Browse files- vicuna_prune60/2024-05-01-03-26-22/description.txt +28 -0
- vicuna_prune60/2024-05-01-03-26-22/train.sh +1 -0
- vicuna_prune60/2024-05-01-03-26-22/training.log +57 -0
- vicuna_prune60/2024-05-01-03-43-15/description.txt +28 -0
- vicuna_prune60/2024-05-01-03-43-15/train.sh +1 -0
- vicuna_prune60/2024-05-01-03-43-15/training.log +57 -0
- vicuna_prune60/description.txt +28 -0
- vicuna_prune60/pytorch_model.bin +3 -0
vicuna_prune60/2024-05-01-03-26-22/description.txt
ADDED
@@ -0,0 +1,28 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
- Training Parameters:
|
2 |
+
- base_model: lmsys/vicuna-7b-v1.3
|
3 |
+
- save_ckpt_log_name: vicuna_prune60
|
4 |
+
- pruning_ratio: 0.6
|
5 |
+
- pruner_type: taylor
|
6 |
+
- temperature: 1.0
|
7 |
+
- top_p: 0.95
|
8 |
+
- max_seq_len: 128
|
9 |
+
- channel_wise: False
|
10 |
+
- block_wise: True
|
11 |
+
- layer_wise: False
|
12 |
+
- layer: 12
|
13 |
+
- block_attention_layer_start: 4
|
14 |
+
- block_attention_layer_end: 30
|
15 |
+
- block_mlp_layer_start: 4
|
16 |
+
- block_mlp_layer_end: 30
|
17 |
+
- iterative_steps: 1
|
18 |
+
- grouping_strategy: sum
|
19 |
+
- global_pruning: False
|
20 |
+
- taylor: param_first
|
21 |
+
- num_examples: 10
|
22 |
+
- device: cpu
|
23 |
+
- test_before_train: False
|
24 |
+
- eval_device: cuda
|
25 |
+
- test_after_train: True
|
26 |
+
- seed: 42
|
27 |
+
- save_model: True
|
28 |
+
- torch_version: 2.2
|
vicuna_prune60/2024-05-01-03-26-22/train.sh
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
python hf_prune.py --pruning_ratio 0.60 --block_wise --block_mlp_layer_start 4 --block_mlp_layer_end 30 --block_attention_layer_start 4 --block_attention_layer_end 30 --pruner_type taylor --test_after_train --device cpu --eval_device cuda --save_ckpt_log_name vicuna_prune60 --save_model --base_model lmsys/vicuna-7b-v1.3
|
vicuna_prune60/2024-05-01-03-26-22/training.log
ADDED
@@ -0,0 +1,57 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2024-05-01 03:26:26 - INFO : Use taylor pruner...
|
2 |
+
2024-05-01 03:26:26 - INFO : Pruning Attention Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
|
3 |
+
2024-05-01 03:26:26 - INFO : Pruning MLP Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
|
4 |
+
2024-05-01 03:26:32 - INFO : Start Pruning
|
5 |
+
2024-05-01 03:26:43 - INFO : Start Backwarding in iterative steps = 0...
|
6 |
+
2024-05-01 03:26:55 - INFO : Loss = 3.7534372806549072
|
7 |
+
2024-05-01 03:27:46 - INFO : After Iter 1/1, #parameters: 3592204288
|
8 |
+
2024-05-01 03:27:46 - INFO : #Param before: 6738415616, #Param after: 3592204288, Ratio = 53.3093%
|
9 |
+
2024-05-01 03:28:03 - INFO :
|
10 |
+
==================Generation Results After Pruning================
|
11 |
+
|
12 |
+
2024-05-01 03:28:14 - INFO : <s>I believe the meaning of life is the following road, however sometimes it’s difficult to make the journey. We will continue to search and explore the unknown for a little while longer, until we find the right path to follow.”
|
13 |
+
During our conversation I recall the memory of the forest, when i walked so long in the pouring rain.
|
14 |
+
I continue my journey, the darkness of the unknown road ahead of me creating a blend of light and stars. In the distance, I can see your silhououlouout, I can sense the shake of your hand.
|
15 |
+
A light wind blows through my hair
|
16 |
+
2024-05-01 03:28:23 - INFO : <s>Simply put, the theory of relativity states that 50% pounds off a whip, and the next at the bottom who’s gone to school.
|
17 |
+
He who does not, at the beginning learn to cut, will reap the blame.
|
18 |
+
To the brave who will stand with him.
|
19 |
+
An empty mind and a piece in hand, Stril every cheak to day a bitter truth.
|
20 |
+
This time was registered by my heart. “
|
21 |
+
And then, with tears in his eyes, he turned to me, telling the life-threiling tale.
|
22 |
+
The hard life of the poor,
|
23 |
+
2024-05-01 03:28:33 - INFO : <s>Building a website can be done in 10 simple steps:
|
24 |
+
You want to ensure the efficiency of these actions and facilitate finding an audience. Therefore, you need to ensure that the website and Google will be looking at each other.
|
25 |
+
Before you start, take note of the people who are watching you in order to determine your mood. Whether your colleagues are at their desks, or when they suddenly find inspiration, head down, you will have to make another step towards their coffee, if they get in the way and block their colleagues in front. This will make your face and red lighting your eyes
|
26 |
+
2024-05-01 03:28:36 - INFO : <s>Tweet: "I hate it when my phone battery dies."
|
27 |
+
Sentiment: Negative
|
28 |
+
###
|
29 |
+
Tweet: "My day has been 👍"
|
30 |
+
Sentiment: Positive
|
31 |
+
###
|
32 |
+
Tweet: "This is the link to the article"
|
33 |
+
Sentiment: Neutral
|
34 |
+
###
|
35 |
+
Tweet: "This new music video was incredibile"
|
36 |
+
Sentiment:
|
37 |
+
AWE. I am amazed at how well I stay when the news isn'ters good. "Noise-free zone"
|
38 |
+
This is a major high to have control
|
39 |
+
2024-05-01 03:28:42 - INFO : <s>Translate English to French:
|
40 |
+
|
41 |
+
sea otter => loutre de mer
|
42 |
+
|
43 |
+
peppermint => menthe poivrée
|
44 |
+
|
45 |
+
plush girafe => girafe peluche
|
46 |
+
|
47 |
+
cheese => ra Montague qui tala à chent me, i scared) – English – The Laran cels
|
48 |
+
Cry – The donkey
|
49 |
+
Ferozly – The animal cries
|
50 |
+
as the woman is lifted to the ground, and her killer growls.
|
51 |
+
The next night, a robber wearing a mask and red roat comes in and kicks
|
52 |
+
2024-05-01 03:28:42 - INFO :
|
53 |
+
==================Finish================
|
54 |
+
|
55 |
+
2024-05-01 03:32:12 - INFO : PPL after pruning: {'wikitext2': 154.32526724792663, 'ptb': 507.9935496710831}
|
56 |
+
2024-05-01 03:32:12 - INFO : Memory Requirement: 6995.72802734375 MiB
|
57 |
+
|
vicuna_prune60/2024-05-01-03-43-15/description.txt
ADDED
@@ -0,0 +1,28 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
- Training Parameters:
|
2 |
+
- base_model: lmsys/vicuna-7b-v1.3
|
3 |
+
- save_ckpt_log_name: vicuna_prune60
|
4 |
+
- pruning_ratio: 0.6
|
5 |
+
- pruner_type: taylor
|
6 |
+
- temperature: 1.0
|
7 |
+
- top_p: 0.95
|
8 |
+
- max_seq_len: 128
|
9 |
+
- channel_wise: False
|
10 |
+
- block_wise: True
|
11 |
+
- layer_wise: False
|
12 |
+
- layer: 12
|
13 |
+
- block_attention_layer_start: 4
|
14 |
+
- block_attention_layer_end: 30
|
15 |
+
- block_mlp_layer_start: 4
|
16 |
+
- block_mlp_layer_end: 30
|
17 |
+
- iterative_steps: 1
|
18 |
+
- grouping_strategy: sum
|
19 |
+
- global_pruning: False
|
20 |
+
- taylor: param_first
|
21 |
+
- num_examples: 10
|
22 |
+
- device: cpu
|
23 |
+
- test_before_train: False
|
24 |
+
- eval_device: cuda
|
25 |
+
- test_after_train: True
|
26 |
+
- seed: 42
|
27 |
+
- save_model: True
|
28 |
+
- torch_version: 2.2
|
vicuna_prune60/2024-05-01-03-43-15/train.sh
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
python hf_prune.py --pruning_ratio 0.60 --block_wise --block_mlp_layer_start 4 --block_mlp_layer_end 30 --block_attention_layer_start 4 --block_attention_layer_end 30 --pruner_type taylor --test_after_train --device cpu --eval_device cuda --save_ckpt_log_name vicuna_prune60 --save_model --base_model lmsys/vicuna-7b-v1.3
|
vicuna_prune60/2024-05-01-03-43-15/training.log
ADDED
@@ -0,0 +1,57 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
2024-05-01 03:43:24 - INFO : Use taylor pruner...
|
2 |
+
2024-05-01 03:43:24 - INFO : Pruning Attention Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
|
3 |
+
2024-05-01 03:43:24 - INFO : Pruning MLP Layer = [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29]
|
4 |
+
2024-05-01 03:43:28 - INFO : Start Pruning
|
5 |
+
2024-05-01 03:43:43 - INFO : Start Backwarding in iterative steps = 0...
|
6 |
+
2024-05-01 03:43:54 - INFO : Loss = 3.7534372806549072
|
7 |
+
2024-05-01 03:44:22 - INFO : After Iter 1/1, #parameters: 3592204288
|
8 |
+
2024-05-01 03:44:23 - INFO : #Param before: 6738415616, #Param after: 3592204288, Ratio = 53.3093%
|
9 |
+
2024-05-01 03:44:44 - INFO :
|
10 |
+
==================Generation Results After Pruning================
|
11 |
+
|
12 |
+
2024-05-01 03:44:53 - INFO : <s>I believe the meaning of life is the following road, however sometimes it’s difficult to make the journey. We will continue to search and explore the unknown for a little while longer, until we find the right path to follow.”
|
13 |
+
During our conversation I recall the memory of the forest, when i walked so long in the pouring rain.
|
14 |
+
I continue my journey, the darkness of the unknown road ahead of me creating a blend of light and stars. In the distance, I can see your silhououlouout, I can sense the shake of your hand.
|
15 |
+
A light wind blows through my hair
|
16 |
+
2024-05-01 03:45:02 - INFO : <s>Simply put, the theory of relativity states that 50% pounds off a whip, and the next at the bottom who’s gone to school.
|
17 |
+
He who does not, at the beginning learn to cut, will reap the blame.
|
18 |
+
To the brave who will stand with him.
|
19 |
+
An empty mind and a piece in hand, Stril every cheak to day a bitter truth.
|
20 |
+
This time was registered by my heart. “
|
21 |
+
And then, with tears in his eyes, he turned to me, telling the life-threiling tale.
|
22 |
+
The hard life of the poor,
|
23 |
+
2024-05-01 03:45:09 - INFO : <s>Building a website can be done in 10 simple steps:
|
24 |
+
You want to ensure the efficiency of these actions and facilitate finding an audience. Therefore, you need to ensure that the website and Google will be looking at each other.
|
25 |
+
Before you start, take note of the people who are watching you in order to determine your mood. Whether your colleagues are at their desks, or when they suddenly find inspiration, head down, you will have to make another step towards their coffee, if they get in the way and block their colleagues in front. This will make your face and red lighting your eyes
|
26 |
+
2024-05-01 03:45:10 - INFO : <s>Tweet: "I hate it when my phone battery dies."
|
27 |
+
Sentiment: Negative
|
28 |
+
###
|
29 |
+
Tweet: "My day has been 👍"
|
30 |
+
Sentiment: Positive
|
31 |
+
###
|
32 |
+
Tweet: "This is the link to the article"
|
33 |
+
Sentiment: Neutral
|
34 |
+
###
|
35 |
+
Tweet: "This new music video was incredibile"
|
36 |
+
Sentiment:
|
37 |
+
AWE. I am amazed at how well I stay when the news isn'ters good. "Noise-free zone"
|
38 |
+
This is a major high to have control
|
39 |
+
2024-05-01 03:45:12 - INFO : <s>Translate English to French:
|
40 |
+
|
41 |
+
sea otter => loutre de mer
|
42 |
+
|
43 |
+
peppermint => menthe poivrée
|
44 |
+
|
45 |
+
plush girafe => girafe peluche
|
46 |
+
|
47 |
+
cheese => ra Montague qui tala à chent me, i scared) – English – The Laran cels
|
48 |
+
Cry – The donkey
|
49 |
+
Ferozly – The animal cries
|
50 |
+
as the woman is lifted to the ground, and her killer growls.
|
51 |
+
The next night, a robber wearing a mask and red roat comes in and kicks
|
52 |
+
2024-05-01 03:45:12 - INFO :
|
53 |
+
==================Finish================
|
54 |
+
|
55 |
+
2024-05-01 03:47:39 - INFO : PPL after pruning: {'wikitext2': 154.32526724792663, 'ptb': 507.9935496710831}
|
56 |
+
2024-05-01 03:47:39 - INFO : Memory Requirement: 6995.72802734375 MiB
|
57 |
+
|
vicuna_prune60/description.txt
ADDED
@@ -0,0 +1,28 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
- Training Parameters:
|
2 |
+
- base_model: lmsys/vicuna-7b-v1.3
|
3 |
+
- save_ckpt_log_name: vicuna_prune60
|
4 |
+
- pruning_ratio: 0.6
|
5 |
+
- pruner_type: taylor
|
6 |
+
- temperature: 1.0
|
7 |
+
- top_p: 0.95
|
8 |
+
- max_seq_len: 128
|
9 |
+
- channel_wise: False
|
10 |
+
- block_wise: True
|
11 |
+
- layer_wise: False
|
12 |
+
- layer: 12
|
13 |
+
- block_attention_layer_start: 4
|
14 |
+
- block_attention_layer_end: 30
|
15 |
+
- block_mlp_layer_start: 4
|
16 |
+
- block_mlp_layer_end: 30
|
17 |
+
- iterative_steps: 1
|
18 |
+
- grouping_strategy: sum
|
19 |
+
- global_pruning: False
|
20 |
+
- taylor: param_first
|
21 |
+
- num_examples: 10
|
22 |
+
- device: cpu
|
23 |
+
- test_before_train: False
|
24 |
+
- eval_device: cuda
|
25 |
+
- test_after_train: True
|
26 |
+
- seed: 42
|
27 |
+
- save_model: True
|
28 |
+
- torch_version: 2.2
|
vicuna_prune60/pytorch_model.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:18bb341f86cd97bc3de036278bb2083e34729129b5e1386ab36119a2242ac341
|
3 |
+
size 7218842434
|