IntelLabs
/

shears-llama-7b-50-cs-super-adapter

Model card Files Files and versions

jinjieyuan commited on May 14, 2024

Commit

abb3b22

·

verified ·

1 Parent(s): 724904c

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ language: en
 license: apache-2.0
 ---
-# Shears Model Card: shears-llama-7b-50-commonsense-super-adapter
 The super-adapter fine-tuned on sparsified LLaMA-7B with some commonsense reasoning datasets using Shears.
@@ -13,7 +13,7 @@ The release of the super-network is to facilitate users to apply their own searc
 ### Information
-- **Model name:** shears-llama-7b-50-commonsense-super-adapter
 - **Base model:** [IntelLabs/shears-llama-7b-50-base](https://huggingface.co/IntelLabs/shears-llama-7b-50-base)
 - **Sparsity:** 50%
 - **Domain:** Commonsense
@@ -65,7 +65,7 @@ def generate_prompt(instruction):
                     """
 base_model = AutoModelForCausalLM.from_pretrained("IntelLabs/shears-llama-7b-50-base")
-model = PeftModel.from_pretrained(base_model, "IntelLabs/shears-llama-7b-50-commonsense-super-adapter")
 model.eval()
 non_zero_params = sum([(param.data != 0).sum().item() for _, param in model.named_parameters()])
@@ -101,7 +101,7 @@ Results of the heuristic sub-network discoverd from the super-network:
 |----------------------|-----------|---------|--------|--------|------------|--------|--------|---------|--------|----------|
 | ChatGPT              | -         | 73.1    | 85.4   | 68.5   | 78.5       | 66.1   | 89.8   | 79.9    | 74.8   | 77.0     |
 | LLaMA-7B-LoRA        | -         | 68.9    | 80.7   | 77.4   | 78.1       | 78.8   | 77.8   | 61.3    | 74.8   | 74.7     |
-| [**LLaMA-7B-Shears**](https://huggingface.co/IntelLabs/shears-llama-7b-50-commonsense-heuristic-adapter)    | **50%**   | 67.3    | 79.1   | 77.5   | 73.3       | 77.7   | 74.4   | 57.9    | 72.8   | 72.5     |
 ## Model Sources

 license: apache-2.0
 ---
+# Shears Model Card: shears-llama-7b-50-cs-super-adapter
 The super-adapter fine-tuned on sparsified LLaMA-7B with some commonsense reasoning datasets using Shears.
 ### Information
+- **Model name:** shears-llama-7b-50-cs-super-adapter
 - **Base model:** [IntelLabs/shears-llama-7b-50-base](https://huggingface.co/IntelLabs/shears-llama-7b-50-base)
 - **Sparsity:** 50%
 - **Domain:** Commonsense
                     """
 base_model = AutoModelForCausalLM.from_pretrained("IntelLabs/shears-llama-7b-50-base")
+model = PeftModel.from_pretrained(base_model, "IntelLabs/shears-llama-7b-50-cs-super-adapter")
 model.eval()
 non_zero_params = sum([(param.data != 0).sum().item() for _, param in model.named_parameters()])
 |----------------------|-----------|---------|--------|--------|------------|--------|--------|---------|--------|----------|
 | ChatGPT              | -         | 73.1    | 85.4   | 68.5   | 78.5       | 66.1   | 89.8   | 79.9    | 74.8   | 77.0     |
 | LLaMA-7B-LoRA        | -         | 68.9    | 80.7   | 77.4   | 78.1       | 78.8   | 77.8   | 61.3    | 74.8   | 74.7     |
+| [**LLaMA-7B-Shears**](https://huggingface.co/IntelLabs/shears-llama-7b-50-cs-heuristic-adapter)    | **50%**   | 67.3    | 79.1   | 77.5   | 73.3       | 77.7   | 74.4   | 57.9    | 72.8   | 72.5     |
 ## Model Sources