9 2 169

Korek Rybens

Rybens

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

OpenPipe/Deductive-Reasoning-Qwen-32B

reacted to burtenshaw's post with 🤗 7 days ago

I’m super excited to work with @mlabonne to build the first practical example in the reasoning course. 🔗 https://huggingface.co/reasoning-course Here's a quick walk through of the first drop of material that works toward the use case: - a fundamental introduction to reinforcement learning. Answering questions like, ‘what is a reward?’ and ‘how do we create an environment for a language model?’ - Then it focuses on Deepseek R1 by walking through the paper and highlighting key aspects. This is an old school way to learn ML topics, but it always works. - Next, it takes to you Transformers Reinforcement Learning and demonstrates potential reward functions you could use. This is cool because it uses Marimo notebooks to visualise the reward. - Finally, Maxime walks us through a real training notebook that uses GRPO to reduce generation length. I’m really into this because it works and Maxime took the time to validate it share assets and logging from his own runs for you to compare with. Maxime’s work and notebooks have been a major part of the open source community over the last few years. I, like everyone, have learnt so much from them.

liked a Space 10 days ago

fffiloni/consistent-character

View all activity

Organizations

Rybens's activity

liked a model 2 days ago

OpenPipe/Deductive-Reasoning-Qwen-32B

Text Generation • Updated 7 days ago • 671 • 34

reacted to burtenshaw's post with 🤗 7 days ago

Post

3538

I’m super excited to work with @mlabonne to build the first practical example in the reasoning course.

🔗 https://huggingface.co/reasoning-course

Here's a quick walk through of the first drop of material that works toward the use case:

- a fundamental introduction to reinforcement learning. Answering questions like, ‘what is a reward?’ and ‘how do we create an environment for a language model?’

- Then it focuses on Deepseek R1 by walking through the paper and highlighting key aspects. This is an old school way to learn ML topics, but it always works.

- Next, it takes to you Transformers Reinforcement Learning and demonstrates potential reward functions you could use. This is cool because it uses Marimo notebooks to visualise the reward.

- Finally, Maxime walks us through a real training notebook that uses GRPO to reduce generation length. I’m really into this because it works and Maxime took the time to validate it share assets and logging from his own runs for you to compare with.

Maxime’s work and notebooks have been a major part of the open source community over the last few years. I, like everyone, have learnt so much from them.

liked a Space 10 days ago

476

Consistent Character

🤹

Create images of a given character in different poses

liked a model 18 days ago

featherless-ai/Qwerky-72B-Preview

Updated 18 days ago • 134 • 26

liked 3 models about 1 month ago

liked a model 2 months ago

bartowski/Deepthink-Reasoning-7B-GGUF

Text Generation • Updated Jan 4 • 2.18k • 2

liked a model 3 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 18 days ago • 3.12M • • 3.63k

reacted to fuzzy-mittenz's post with ❤️ 3 months ago

Post

1524

So a cool thing happened,
Nomic/GPT4ALL released a "Reasoning/Thinking"(QwQ/o1/o3 type) Model using JavaScript functions to calculate things like the haversine function for distance between two places and so on, it's VERY cool the complex calculative/recursive AI in such a small package..

I was able to adapt their methods to one of my small models "Replicant" 2gb and created a new model with importance matrix Quantization using "THE_KEY" Dataset for better inference in the coding model I pulled from Whiterabbitneo's Qwen2.5 model... I give you Reasoning Rabbit.. enjoy

https://huggingface.co/IntelligentEstate/o3-ReasoningRabbit_Q2.5-Cd-7B-IQ4_XS-GGUF
-IntelligentEstate/o3-ReasoningRabbit_Q2.5-Cd-7B-IQ4_XS-GGUF

https://huggingface.co/IntelligentEstate/Replicant_Warder-o3-Q2.5_3B-iQ5_K_S-GGUF
IntelligentEstate/Replicant_Warder-o3-Q2.5_3B-iQ5_K_S-GGUF

-WhiteRabbitNeo/WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B

upvoted a paper 3 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 78

liked a Space 3 months ago

534

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 2 models 3 months ago

recursal/QRWKV6-32B-Instruct-Preview-v0.1

Text Generation • Updated Jan 27 • 787 • 71

arcee-ai/Virtuoso-Small-GGUF

Updated Dec 3, 2024 • 1.61k • 9

reacted to vansin's post with 🔥 3 months ago

Post

872

Try InternThinker~

https://internlm-chat.intern-ai.org.cn/internthinker

liked a model 3 months ago

arcee-ai/Virtuoso-Small

Updated Jan 13 • 220 • 66

reacted to victor's post with 🔥 3 months ago

Post

2214

Qwen/QwQ-32B-Preview shows us the future (and it's going to be exciting)...

I tested it against some really challenging reasoning prompts and the results are amazing 🤯.

Check this dataset for the results: victor/qwq-misguided-attention

2 replies

liked a model 3 months ago

AIDC-AI/Marco-o1

Text Generation • Updated Nov 23, 2024 • 7.85k • 713

liked 2 models 4 months ago

BlinkDL/rwkv-7-pile

Updated Dec 19, 2024 • 15

fblgit/miniclaus-qw1.5B-UNAMGS

Text Generation • Updated Dec 3, 2024 • 185 • 8