3 19 19

Metal Whale

metalwhale

https://blog.metalwhale.dev/

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a model 9 days ago

deepseek-ai/DeepSeek-R1

liked a model 21 days ago

vikhyatk/moondream2

View all activity

Organizations

None yet

metalwhale's activity

upvoted an article 3 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

3 days ago

• 477

liked a model 9 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 498k • 5.38k

liked a model 21 days ago

vikhyatk/moondream2

Image-Text-to-Text • Updated 22 days ago • 156k • 1.01k

upvoted a paper about 2 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

liked a model about 2 months ago

tencent/HunyuanVideo

Text-to-Video • Updated 10 days ago • 7.74k • 1.54k

upvoted a collection 2 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 24 days ago • 293

upvoted an article 3 months ago

Article

Releasing the largest multilingual open pretraining dataset

•

Nov 13, 2024

• 98

upvoted a paper 3 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

liked 6 models 4 months ago

upvoted a collection 4 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 488

liked 2 models 4 months ago

Qwen/Qwen2.5-7B-Instruct

Text Generation • Updated 19 days ago • 1.27M • 459

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Dec 11, 2024 • 4.95k • 688

liked 2 models 5 months ago

fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 1.4k • 448

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.54M • 8.37k

upvoted a paper 8 months ago

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11, 2024 • 38