Leo's picture

Leo

elephant-bear

AI & ML interests

None yet

Recent Activity

Organizations

None yet

elephant-bear's activity

view reply

True, but it seems like there’s nothing to be evaluated as of right now. I assume the ultimate goal is to train a new reasoning model and then use the same evaluation metrics as o1 and the DeepSeek-R1.