Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials
|
2 |
|
3 |
Reference: R. Luu and M.J. Buehler, "BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials," Adv. Science, 2023, DOI: https://doi.org/10.1002/advs.202306724
|
@@ -201,7 +208,7 @@ print(response)
|
|
201 |
|
202 |
### Notes and licenses
|
203 |
|
204 |
-
|
205 |
|
206 |
Orca 2 is licensed under the Microsoft Research License (https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE).
|
207 |
|
@@ -209,9 +216,9 @@ Llama 2 is licensed under the LLAMA 2 Community License (https://ai.meta.com/lla
|
|
209 |
|
210 |
#### Bias, Risks, and Limitations
|
211 |
|
212 |
-
Like in all techniques of modeling, there are possibilities of errors. The base models Llama 2 and Orca 2 models were aligned to not spread misinformation and produce safer responses.
|
213 |
|
214 |
-
|
215 |
|
216 |
Data Biases: Large language models, trained on extensive data, can inadvertently carry biases present in the source data. Consequently, the models may generate outputs that could be potentially biased or unfair.
|
217 |
|
@@ -225,4 +232,4 @@ Hallucination: It is important to be aware and cautious not to entirely rely on
|
|
225 |
|
226 |
Potential for Misuse: Without suitable safeguards, there is a risk that these models could be maliciously used for generating disinformation or harmful content.
|
227 |
|
228 |
-
This model is solely designed for research settings, and its testing has only been carried out in such environments. It should not be used in downstream applications, as additional analysis is needed to assess potential harm or bias in the proposed application.
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
tags:
|
5 |
+
- biology
|
6 |
+
- text-generation-inference
|
7 |
+
---
|
8 |
# BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials
|
9 |
|
10 |
Reference: R. Luu and M.J. Buehler, "BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials," Adv. Science, 2023, DOI: https://doi.org/10.1002/advs.202306724
|
|
|
208 |
|
209 |
### Notes and licenses
|
210 |
|
211 |
+
BioinspiredLLM was fine-tuned based on: https://huggingface.co/microsoft/Orca-2-13b. Details see: https://onlinelibrary.wiley.com/doi/full/10.1002/advs.202306724
|
212 |
|
213 |
Orca 2 is licensed under the Microsoft Research License (https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE).
|
214 |
|
|
|
216 |
|
217 |
#### Bias, Risks, and Limitations
|
218 |
|
219 |
+
Like in all techniques of modeling, there are possibilities of errors. The base models Llama 2 and Orca 2 models were aligned to not spread misinformation and produce safer responses. As a result, BioinspiredLLM has inherited these traits and performs reasonably well in these dimensions. However, it is still of utmost importance for researchers to also verify responses and avoid propagating errors. To minimize risk of mistakes, employing chain-of-thought prompting and RAG methods, as introduced, proves beneficial. Additionally, the system prompt of BioinspiredLLM can be edited to guide context. Further details see the main paper.
|
220 |
|
221 |
+
BioinspiredLLM, built upon the LLaMA 2 and Orca-2 model family, retains many of its limitations, as well as the common limitations of other large language models or limitation caused by its training process, including:
|
222 |
|
223 |
Data Biases: Large language models, trained on extensive data, can inadvertently carry biases present in the source data. Consequently, the models may generate outputs that could be potentially biased or unfair.
|
224 |
|
|
|
232 |
|
233 |
Potential for Misuse: Without suitable safeguards, there is a risk that these models could be maliciously used for generating disinformation or harmful content.
|
234 |
|
235 |
+
This model is solely designed for research settings, and its testing has only been carried out in such environments. It should not be used in downstream applications, as additional analysis is needed to assess potential harm or bias in the proposed application.
|