DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 346
NILE: Internal Consistency Alignment in Large Language Models Paper • 2412.16686 • Published Dec 21, 2024 • 8
DynamicSuperb/CodeSwitchingSemanticGrammarAcceptabilityComparison_CSZS-zh-en Viewer • Updated Jul 25, 2024 • 200 • 73