ShelterW
ShelterW
·
AI & ML interests
None yet
Recent Activity
published
a model
about 18 hours ago
ShelterW/Light-R1-7B-DS-AWQ
updated
a model
16 days ago
ShelterW/TinyR1-32B-Preview-AWQ
published
a model
16 days ago
ShelterW/TinyR1-32B-Preview-AWQ
Organizations
None yet
ShelterW's activity
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
#6 opened about 2 months ago
by
ShelterW
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
5
#4 opened about 2 months ago
by
ShelterW
What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?
1
#5 opened about 2 months ago
by
ShelterW
reward is None
1
#3 opened 2 months ago
by
ShelterW
hidden state is nan
1
#2 opened 7 months ago
by
ShelterW
Update README.md
#1 opened almost 2 years ago
by
ShelterW
Update README.md
#1 opened almost 2 years ago
by
ShelterW
Update README.md
#1 opened almost 2 years ago
by
ShelterW