OpenMOSE
/

RWKV-x070-2B9-CJE-Instruct

Model card Files Files and versions

OpenMOSE commited on Jan 17

Commit

6e08c30

·

verified ·

1 Parent(s): a2274c6

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ license: apache-2.0
 ## Fine-tuning Details
 ### Training Configuration
-- **Trainer**: RWKV-LM-RLHF
 - **PEFT Mode**: Hybrid Training combining frozen embeddings and Bone (Block Affine Transformation) + full parameter training
 - **SFT Method**: SmoothingLoss SFT
 - **Context Window**: 5120 (trained with 1024 token overlap)
@@ -31,12 +31,12 @@ license: apache-2.0
   - Chain-of-Thought reasoning tasks
 ### How to use
-- Install latest RWKV-Infer (Linux,WSL)
 - make folder 'models'
 ```
 curl http://127.0.0.1:9000/loadmodel -X POST -H "Content-Type: application/json" -d '{"model_filename":"models/rwkv-x070-2b9-cje-instruct-1.pth","model_viewname":"RWKV x070 2B9 CJE Instruct-1","model_strategy":"fp16","endtoken":"\\n\\n\\x17"}'
 ```
-- Enjou with openai compatible api http://127.0.0.1:9000/v1 :)
 ### Important Note
 - Set the end token as '\n\n\x17'

 ## Fine-tuning Details
 ### Training Configuration
+- **Trainer**: RWKV-LM-RLHF (https://github.com/OpenMOSE/RWKV-LM-RLHF)
 - **PEFT Mode**: Hybrid Training combining frozen embeddings and Bone (Block Affine Transformation) + full parameter training
 - **SFT Method**: SmoothingLoss SFT
 - **Context Window**: 5120 (trained with 1024 token overlap)
   - Chain-of-Thought reasoning tasks
 ### How to use
+- Install latest RWKV-Infer (Linux,WSL) (https://github.com/OpenMOSE/RWKV-Infer)
 - make folder 'models'
 ```
 curl http://127.0.0.1:9000/loadmodel -X POST -H "Content-Type: application/json" -d '{"model_filename":"models/rwkv-x070-2b9-cje-instruct-1.pth","model_viewname":"RWKV x070 2B9 CJE Instruct-1","model_strategy":"fp16","endtoken":"\\n\\n\\x17"}'
 ```
+- Enjoy with openai compatible api http://127.0.0.1:9000/v1 :)
 ### Important Note
 - Set the end token as '\n\n\x17'