Mykes
/

med_gemma7b_gguf

@@ -1,14 +1,19 @@
 ---
 base_model: unsloth/gemma-7b-bnb-4bit
 language:
-- en
 license: apache-2.0
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - gemma
-- gguf
 ---
 # Uploaded  model
@@ -20,3 +25,38 @@ tags:
 This gemma model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
 base_model: unsloth/gemma-7b-bnb-4bit
 language:
+- ru
 license: apache-2.0
+pipeline_tag: text-generation
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - gemma
+- trl
+- sft
+- chemistry
+- biology
+- medical
 ---
 # Uploaded  model
 This gemma model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
+## Supervised Fine-tuning of gemma-7b
+I used unsloth lib, unsloth/gemma-7b-bnb-4bit, rus_med_dialogues dataset
+10 epochs, 2560 training steps on 1 x RTX 4090
+Training time was about 35 minutes
+The Training Loss in the end was 0.0939
+## Parameters:
+`Unsloth: Fast Gemma patching release 2024.7`
+`GPU: NVIDIA GeForce RTX 4090. Max memory: 23.643 GB. Platform = Linux.`
+`Pytorch: 2.2.0+cu121. CUDA = 8.9. CUDA Toolkit = 12.1.`
+`Bfloat16 = TRUE. FA [Xformers = 0.0.24. FA2 = True]`
+### Example of answer before finetuning:
+**input:**
+*'<bos>Below is the context which is your conversation history and the last user question. Write a response according the context and question.*
+*### Context: user: Недавно мне делали операцию, а рана до сих пор не заживает? assistant: Я понимаю, что это может быть неприятно. Сколько времени прошло с момента операции.*
+*### Question: Месяц. Это сильно плохо?`*
+**output:**
+`### Response: 1. I'm sorry to hear that you're still experiencing pain from your recent surgery. 2. It's understandable that this can be a difficult situation to be in. 3. I'd like to help you find a solution to this problem. 4. Have you considered trying a different'
+### After 10 epochs:
+**input**
+*<bos>Below is the context which is your conversation history and the last user question. Write a response according the context and question.*
+*### Context: user: Недавно мне делали операцию, а рана до сих пор не заживает? assistant: Я понимаю, что это может быть неприятно. Сколько времени прошло с момента операции.*
+*### Question: Месяц. Это сильно плохо?*
+**output:**
+`### Response: 1 месяц - это не так уж и много. В среднем раны заживают за 2-3 месяца. Но если Вы чувствуете сильную боль или не литров, обязательно обратитесь к своему хирургу для проверки.`
+# Disclaimer
+The data provided is solely for research purposes. It may include inaccuracies, potentially harmful content, or biases, and we disclaim any responsibility for these issues. Users should assess the data's accuracy and appropriateness before utilizing it. The opinions and views represented in the data do not necessarily align with those of the author.