lixiaonan commited on
Commit
176ffe9
·
verified ·
1 Parent(s): 403a1b3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -1,3 +1,8 @@
 
 
 
 
 
1
  Reka Flash 3.1 is a 21B general-purpose reasoning model that was trained from scratch. It was trained in synthetic and public datasets for supervised finetuning, followed by large-scale RLOO with rule-based rewards. Reka Flash 3.1 is an improved version of Reka Flash 3 due to significant advances in our reinforcement learning stack and curated high-qaulity RL data. Reka Flash 3.1 is particularly strong on coding and as a base model to be finetuned on agentic tasks.
2
  Reka Flash 3.1 improves by 10 points on LiveCodeBench v5 (Full set) from Reka Flash 3. For coding related tasks, Reka Flash 3.1 is competitive with models such as Qwen3-32B. o3-mini, and Gemini 2.5 Flash Thinking. If you want to learn more about how we do reinforcement learning for Reka Flash 3.1 that results in these improvements, please check out this post.
3
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ # Reka Flash 3.1
6
  Reka Flash 3.1 is a 21B general-purpose reasoning model that was trained from scratch. It was trained in synthetic and public datasets for supervised finetuning, followed by large-scale RLOO with rule-based rewards. Reka Flash 3.1 is an improved version of Reka Flash 3 due to significant advances in our reinforcement learning stack and curated high-qaulity RL data. Reka Flash 3.1 is particularly strong on coding and as a base model to be finetuned on agentic tasks.
7
  Reka Flash 3.1 improves by 10 points on LiveCodeBench v5 (Full set) from Reka Flash 3. For coding related tasks, Reka Flash 3.1 is competitive with models such as Qwen3-32B. o3-mini, and Gemini 2.5 Flash Thinking. If you want to learn more about how we do reinforcement learning for Reka Flash 3.1 that results in these improvements, please check out this post.
8