yannelli commited on
Commit
243a0f5
·
verified ·
1 Parent(s): d848ca2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -1
README.md CHANGED
@@ -11,4 +11,100 @@ tags:
11
  - code
12
  library_name: transformers
13
  pipeline_tag: text-generation
14
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  - code
12
  library_name: transformers
13
  pipeline_tag: text-generation
14
+ ---
15
+ # Model Card for Laravel-11-Llama-3.2-1B-Instruct
16
+
17
+ This model is a fine-tuned version of Meta's Llama 3.2 1B Instruct model, specifically tailored for Laravel 11 documentation and related queries. It provides more accurate and concise responses to Laravel 11 questions, offering step-by-step instructions for complex queries.
18
+
19
+ ## Model Details
20
+
21
+ ### Model Description
22
+
23
+ - **Developed by:** Ryan Yannelli
24
+ - **Model type:** Language model fine-tuned for Laravel 11 documentation
25
+ - **Language(s) (NLP):** English
26
+ - **License:** MIT
27
+ - **Finetuned from model:** meta-llama/Llama-3.2-1B-Instruct
28
+
29
+ ## Uses
30
+
31
+ ### Direct Use
32
+
33
+ This model is designed to assist developers with Laravel 11 related queries. It can provide quick answers to simple questions about Laravel 11 in a few sentences. For more complex questions, it offers step-by-step instructions and may ask follow-up questions for clarity.
34
+
35
+ ### Out-of-Scope Use
36
+
37
+ This model is specifically trained for Laravel 11 documentation and may not perform well on queries outside this domain. It should not be used for general-purpose language tasks or for documentation of other PHP frameworks or Laravel versions.
38
+
39
+ ## Bias, Risks, and Limitations
40
+
41
+ - The model's knowledge is limited to Laravel 11 documentation up to October 3rd, 2024.
42
+ - Given the small 1B parameter size, attention is not great with longer contexts.
43
+ - The model may not perform well on tasks outside of Laravel 11 documentation.
44
+
45
+ ### Recommendations
46
+
47
+ Users should verify important information or code snippets with official Laravel 11 documentation. The model should be used as an assistant rather than a definitive source of information.
48
+
49
+ ## How to Get Started with the Model
50
+
51
+ To get started with the model locally, you can use one of the following tools:
52
+
53
+ - LM Studio
54
+ - Jan
55
+ - vLLM
56
+ - llama.cpp
57
+
58
+ These tools allow you to run the model on your local machine. Choose the one that best fits your system requirements and preferences.
59
+
60
+ ## Training Details
61
+
62
+ ### Training Data
63
+
64
+ The model was trained on three custom datasets:
65
+ - yannelli/laravel-11-qa
66
+ - yannelli/laravel-11-qa-long-form
67
+ - yannelli/laravel-11-code-samples *(private)*
68
+
69
+ These datasets contain Laravel 11 documentation and related question-answer pairs.
70
+
71
+ ### Training Procedure
72
+
73
+ The model used a multi-stage training approach, consisting of offline data and public datasets.
74
+
75
+ #### Training Hyperparameters
76
+
77
+ - **Training regime:** SFTTrainer
78
+ - **Optimizer:** AdamW 8-Bit
79
+ - **Learning Rate Scheduler Type:** Cosine
80
+ - **Learning Rate:** 0.000095
81
+ - **GPU:** NVIDIA 3070 Ti
82
+
83
+ ## Evaluation
84
+
85
+ Evaluation results have not been published yet.
86
+
87
+ ## Environmental Impact
88
+
89
+ - **Hardware Type:** NVIDIA 3070 Ti GPU
90
+ - **Hours used:** 121
91
+ - **Infrastructure:** Private
92
+ - **Carbon Emitted:** 18.14 kg CO2 eq.
93
+
94
+ Carbon emissions were calculated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute).
95
+
96
+ ## Technical Specifications
97
+
98
+ ### Model Architecture and Objective
99
+
100
+ The model is based on the Meta Llama 3.2 1B Instruct architecture and is fine-tuned for Laravel 11 documentation tasks.
101
+
102
+ ### Compute Infrastructure
103
+
104
+ #### Hardware
105
+
106
+ NVIDIA 3070 Ti GPU
107
+
108
+ ## Model Card Authors
109
+
110
+ Ryan Yannelli