belka13 commited on
Commit
5f6eaaa
·
verified ·
1 Parent(s): e6f0a2a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -137
README.md CHANGED
@@ -5,6 +5,7 @@ tags:
5
  metrics:
6
  - accuracy
7
  - perplexity
 
8
  ---
9
 
10
  # Model Card for Model ID
@@ -17,12 +18,10 @@ This is a 🤗 transformers model card that was pre-trained on a dataset built f
17
 
18
  The model is a language model pre-trained on Kotlin code from the JetBrains Kotlin repository on GitHub. It is designed to complete and understand code in these programming languages.
19
 
20
- - **Developed by:** [More Information Needed]
21
- - **Shared by [optional]:** [More Information Needed]
22
  - **Model type:** Language model
23
  - **Language(s) (NLP):** Kotlin, Python
24
- - **License:** [More Information Needed]
25
- - **Finetuned from model [optional]:** microsoft/phi-1_5
26
 
27
  ### Model Sources [optional]
28
 
@@ -30,25 +29,13 @@ The model is a language model pre-trained on Kotlin code from the JetBrains Kotl
30
 
31
  ## Uses
32
 
33
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
34
-
35
  ### Direct Use
36
 
37
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
38
-
39
- [More Information Needed]
40
-
41
- ### Downstream Use [optional]
42
-
43
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
44
-
45
- [More Information Needed]
46
 
47
  ### Out-of-Scope Use
48
 
49
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
50
-
51
- [More Information Needed]
52
 
53
  ## Bias, Risks, and Limitations
54
 
@@ -58,133 +45,21 @@ The model may have biases and limitations based on code in the JetBrains Kotlin
58
 
59
  Users should be aware of the risks, biases and limitations of the model. Model-generated code should always be tested by experienced developers before use in production. The model is best suited for auxiliary purposes, such as learning Kotlin/Python syntax and style, rather than for writing production code yourself.
60
 
61
- ## How to Get Started with the Model
62
-
63
- Use the code below to get started with the model.
64
-
65
- [More Information Needed]
66
-
67
- ## Training Details
68
-
69
- ### Training Data
70
-
71
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
72
-
73
- [More Information Needed]
74
-
75
- ### Training Procedure
76
-
77
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
78
-
79
- #### Preprocessing [optional]
80
-
81
- [More Information Needed]
82
-
83
-
84
- #### Training Hyperparameters
85
-
86
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
87
-
88
- #### Speeds, Sizes, Times [optional]
89
-
90
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
91
-
92
- [More Information Needed]
93
-
94
  ## Evaluation
95
 
96
- <!-- This section describes the evaluation protocols and provides the results. -->
97
-
98
  ### Testing Data, Factors & Metrics
99
 
100
- #### Testing Data
101
-
102
- <!-- This should link to a Dataset Card if possible. -->
103
-
104
- [More Information Needed]
105
-
106
- #### Factors
107
-
108
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
109
-
110
- [More Information Needed]
111
-
112
  #### Metrics
113
 
114
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
115
-
116
- [More Information Needed]
117
-
118
- ### Results
119
-
120
- [More Information Needed]
121
-
122
- #### Summary
123
-
124
-
125
-
126
- ## Model Examination [optional]
127
-
128
- <!-- Relevant interpretability work for the model goes here -->
129
-
130
- [More Information Needed]
131
-
132
- ## Environmental Impact
133
-
134
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
135
-
136
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
137
-
138
- - **Hardware Type:** [More Information Needed]
139
- - **Hours used:** [More Information Needed]
140
- - **Cloud Provider:** [More Information Needed]
141
- - **Compute Region:** [More Information Needed]
142
- - **Carbon Emitted:** [More Information Needed]
143
-
144
- ## Technical Specifications [optional]
145
-
146
- ### Model Architecture and Objective
147
-
148
- [More Information Needed]
149
-
150
- ### Compute Infrastructure
151
-
152
- [More Information Needed]
153
-
154
- #### Hardware
155
-
156
- [More Information Needed]
157
-
158
- #### Software
159
-
160
- [More Information Needed]
161
-
162
- ## Citation [optional]
163
-
164
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
165
-
166
- **BibTeX:**
167
-
168
- [More Information Needed]
169
-
170
- **APA:**
171
-
172
- [More Information Needed]
173
-
174
- ## Glossary [optional]
175
-
176
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
177
-
178
- [More Information Needed]
179
-
180
- ## More Information [optional]
181
 
182
- [More Information Needed]
 
 
 
 
 
183
 
184
- ## Model Card Authors [optional]
185
 
186
- [More Information Needed]
187
 
188
- ## Model Card Contact
189
 
190
- [More Information Needed]
 
5
  metrics:
6
  - accuracy
7
  - perplexity
8
+ license: mit
9
  ---
10
 
11
  # Model Card for Model ID
 
18
 
19
  The model is a language model pre-trained on Kotlin code from the JetBrains Kotlin repository on GitHub. It is designed to complete and understand code in these programming languages.
20
 
21
+ - **Developed by:** https://huggingface.co/belka13
 
22
  - **Model type:** Language model
23
  - **Language(s) (NLP):** Kotlin, Python
24
+ - **Finetuned from model:** microsoft/phi-1_5
 
25
 
26
  ### Model Sources [optional]
27
 
 
29
 
30
  ## Uses
31
 
 
 
32
  ### Direct Use
33
 
34
+ The model can be used to augment Kotlin and Python code, finding bugs and style inconsistencies in existing code. It can also help in understanding and documenting code bases.
 
 
 
 
 
 
 
 
35
 
36
  ### Out-of-Scope Use
37
 
38
+ The model is not intended to generate arbitrary text or code in programming languages other than Kotlin and Python. It should also not be used to automatically make changes to production code without human review.
 
 
39
 
40
  ## Bias, Risks, and Limitations
41
 
 
45
 
46
  Users should be aware of the risks, biases and limitations of the model. Model-generated code should always be tested by experienced developers before use in production. The model is best suited for auxiliary purposes, such as learning Kotlin/Python syntax and style, rather than for writing production code yourself.
47
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
  ## Evaluation
49
 
 
 
50
  ### Testing Data, Factors & Metrics
51
 
 
 
 
 
 
 
 
 
 
 
 
 
52
  #### Metrics
53
 
54
+ The following metrics were used to evaluate the model in Kotlin and Python:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55
 
56
+ | Model | Dataset | Accuracy | Perplexity |
57
+ |----------------|---------|----------|------------|
58
+ | Default | Kotlin | 0.0031 | 94.78 |
59
+ | | Python | 0.0067 | 6.67 |
60
+ | Fine-tuned | Kotlin | 0.0096 | 62.82 |
61
+ | | Python | 0.0180 | 5.71 |
62
 
 
63
 
 
64
 
 
65