Ray2333
/

gpt2-large-harmless-reward_model

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ray2333 commited on Jan 15, 2024

Commit

70ba1c0

·

verified ·

1 Parent(s): 2f4c969

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ metrics:
 - accuracy
 ---
-GPT2 large model trained on Anthropic/hh-rlhf harmless dataset. It is specifically used for harmful response detection.
 It achieves an accuracy of 0.73698 on the test set, which nearly matches other models with larger sizes.

 - accuracy
 ---
+GPT2 large model trained on Anthropic/hh-rlhf harmless dataset. It is specifically used for harmful response detection or RLHF.
 It achieves an accuracy of 0.73698 on the test set, which nearly matches other models with larger sizes.