mjbuehler commited on
Commit
5113d4f
·
1 Parent(s): 6b8db6f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -1,3 +1,10 @@
 
 
 
 
 
 
 
1
  # BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials
2
 
3
  Reference: R. Luu and M.J. Buehler, "BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials," Adv. Science, 2023, DOI: https://doi.org/10.1002/advs.202306724
@@ -201,7 +208,7 @@ print(response)
201
 
202
  ### Notes and licenses
203
 
204
- This model was fine-tuned based on: https://huggingface.co/microsoft/Orca-2-13b (details in https://onlinelibrary.wiley.com/doi/full/10.1002/advs.202306724)/
205
 
206
  Orca 2 is licensed under the Microsoft Research License (https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE).
207
 
@@ -209,9 +216,9 @@ Llama 2 is licensed under the LLAMA 2 Community License (https://ai.meta.com/lla
209
 
210
  #### Bias, Risks, and Limitations
211
 
212
- Like in all techniques of modeling, there are possibilities of errors. The base models Llama 2 and Orca 2 models were aligned to not spread misinformation and produce safer responses. As a result, BioinspiredLLM has inherited these traits and performs reasonably well in these dimensions. However, it is still of utmost importance for researchers to also verify responses and avoid propagating errors, as discussed in recent literature[64] – a standard practice across all modeling techniques. To minimize risk of mistakes, employing chain-of-thought prompting and RAG methods, as introduced, proves beneficial. Additionally, the system prompt of BioinspiredLLM can be edited to guide context. Further details see the main paper.
213
 
214
- This model, built upon the LLaMA 2 and Orca-2 model family, retains many of its limitations, as well as the common limitations of other large language models or limitation caused by its training process, including:
215
 
216
  Data Biases: Large language models, trained on extensive data, can inadvertently carry biases present in the source data. Consequently, the models may generate outputs that could be potentially biased or unfair.
217
 
@@ -225,4 +232,4 @@ Hallucination: It is important to be aware and cautious not to entirely rely on
225
 
226
  Potential for Misuse: Without suitable safeguards, there is a risk that these models could be maliciously used for generating disinformation or harmful content.
227
 
228
- This model is solely designed for research settings, and its testing has only been carried out in such environments. It should not be used in downstream applications, as additional analysis is needed to assess potential harm or bias in the proposed application.
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - biology
6
+ - text-generation-inference
7
+ ---
8
  # BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials
9
 
10
  Reference: R. Luu and M.J. Buehler, "BioinspiredLLM: Conversational Large Language Model for the Mechanics of Biological and Bio-Inspired Materials," Adv. Science, 2023, DOI: https://doi.org/10.1002/advs.202306724
 
208
 
209
  ### Notes and licenses
210
 
211
+ BioinspiredLLM was fine-tuned based on: https://huggingface.co/microsoft/Orca-2-13b. Details see: https://onlinelibrary.wiley.com/doi/full/10.1002/advs.202306724
212
 
213
  Orca 2 is licensed under the Microsoft Research License (https://huggingface.co/microsoft/Orca-2-13b/blob/main/LICENSE).
214
 
 
216
 
217
  #### Bias, Risks, and Limitations
218
 
219
+ Like in all techniques of modeling, there are possibilities of errors. The base models Llama 2 and Orca 2 models were aligned to not spread misinformation and produce safer responses. As a result, BioinspiredLLM has inherited these traits and performs reasonably well in these dimensions. However, it is still of utmost importance for researchers to also verify responses and avoid propagating errors. To minimize risk of mistakes, employing chain-of-thought prompting and RAG methods, as introduced, proves beneficial. Additionally, the system prompt of BioinspiredLLM can be edited to guide context. Further details see the main paper.
220
 
221
+ BioinspiredLLM, built upon the LLaMA 2 and Orca-2 model family, retains many of its limitations, as well as the common limitations of other large language models or limitation caused by its training process, including:
222
 
223
  Data Biases: Large language models, trained on extensive data, can inadvertently carry biases present in the source data. Consequently, the models may generate outputs that could be potentially biased or unfair.
224
 
 
232
 
233
  Potential for Misuse: Without suitable safeguards, there is a risk that these models could be maliciously used for generating disinformation or harmful content.
234
 
235
+ This model is solely designed for research settings, and its testing has only been carried out in such environments. It should not be used in downstream applications, as additional analysis is needed to assess potential harm or bias in the proposed application.