Update README.md
Browse files
README.md
CHANGED
@@ -45,7 +45,7 @@ The DocuMint model can be used directly to generate high-quality docstrings for
|
|
45 |
|
46 |
### Fine-tuning Data
|
47 |
|
48 |
-
The fine-tuning data consists of 100,000 Python functions and their docstrings extracted from popular open-source repositories in the
|
49 |
|
50 |
#### Fine-tuning Hyperparameters
|
51 |
|
@@ -88,7 +88,7 @@ fine_tuned_model = PeftModel.from_pretrained(model, "documint/CodeGemma2B-fine-t
|
|
88 |
|
89 |
#### Hardware
|
90 |
|
91 |
-
Fine-tuning was performed using an Intel 12900K CPU,
|
92 |
|
93 |
|
94 |
## Citation
|
|
|
45 |
|
46 |
### Fine-tuning Data
|
47 |
|
48 |
+
The fine-tuning data consists of 100,000 Python functions and their docstrings extracted from popular open-source repositories in the FOSS ecosystem. Repositories were filtered based on metrics such as number of contributors (> 50), commits (> 5k), stars (> 35k), and forks (> 10k) to focus on well-established and actively maintained projects.
|
49 |
|
50 |
#### Fine-tuning Hyperparameters
|
51 |
|
|
|
88 |
|
89 |
#### Hardware
|
90 |
|
91 |
+
Fine-tuning was performed using an Intel 12900K CPU, a Nvidia RTX-3090 GPU, and 64 GB RAM. Total fine-tuning time was 48 GPU hours.
|
92 |
|
93 |
|
94 |
## Citation
|