IntelLabs
/

shears-llama-7b-50-cs-heuristic-adapter

Model card Files Files and versions

jinjieyuan commited on 28 days ago

Commit

16f1d90

·

verified ·

1 Parent(s): 2bc63c1

Update README.md

Files changed (1) hide show

README.md +22 -7

README.md CHANGED Viewed

@@ -133,8 +133,11 @@ print(output)
 ## Model Sources
-- **Repository:** [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears)
-- **Paper:** [Shears: Unstructured Sparsity with Neural Low-rank Adapter Search](https://arxiv.org/abs/2404.10934)
 ## Ethical Considerations
@@ -151,11 +154,23 @@ Intel is committed to respecting human rights and avoiding causing or contributi
 ## Citation
 ```bash
-@inproceedings{munoz2024shears,
-  title = {Shears: Unstructured Sparsity with Neural Low-rank Adapter Search},
-  author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
-  booktitle={The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-2024)},
-  year={2024}
 }
 ```

 ## Model Sources
+**Repository:** [https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/Shears)
+**Paper:**
+- [Shears: Unstructured Sparsity with Neural Low-rank Adapter Search](https://arxiv.org/abs/2404.10934)
+- [Low-Rank Adapters Meet Neural Architecture Search for LLM Compression](https://arxiv.org/abs/2501.16372)
 ## Ethical Considerations
 ## Citation
 ```bash
+@inproceedings{munoz-etal-2024-shears,
+    title = "Shears: Unstructured Sparsity with Neural Low-rank Adapter Search",
+    author = "Mu{\~n}oz, J. Pablo  and
+      Yuan, Jinjie  and
+      Jain, Nilesh",
+    editor = "Yang, Yi  and
+      Davani, Aida  and
+      Sil, Avi  and
+      Kumar, Anoop",
+    booktitle = "Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 6: Industry Track)",
+    month = jun,
+    year = "2024",
+    address = "Mexico City, Mexico",
+    publisher = "Association for Computational Linguistics",
+    url = "https://aclanthology.org/2024.naacl-industry.34",
+    doi = "10.18653/v1/2024.naacl-industry.34",
+    pages = "395--405",
 }
 ```