Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
16
51
143
Krishna Kaasyap
KrishnaKaasyap
Follow
Juanelopo's profile picture
victor's profile picture
21world's profile picture
3 followers
·
24 following
krishnakaasyap
krishnakaasyap.bsky.social
AI & ML interests
Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks
Recent Activity
liked
a model
4 days ago
deepseek-ai/Janus-Pro-7B
new
activity
11 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B:
SFT (Non-RL) distillation is this good on a sub-100B model?
liked
a model
11 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
View all activity
Organizations
KrishnaKaasyap
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
4 days ago
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
4 days ago
•
79.3k
•
2.11k
liked
3 models
11 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
•
Updated
5 days ago
•
92.7k
•
395
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
5 days ago
•
498k
•
5.36k
deepseek-ai/DeepSeek-R1-Zero
Text Generation
•
Updated
5 days ago
•
17.1k
•
631
liked
a model
16 days ago
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
14 days ago
•
5.9k
•
489
liked
a Space
29 days ago
Running
783
🦀
InstantCoder
liked
a dataset
about 1 month ago
PowerInfer/QWQ-LONGCOT-500K
Viewer
•
Updated
Dec 26, 2024
•
286k
•
1.9k
•
117
liked
4 models
about 1 month ago
PowerInfer/SmallThinker-3B-Preview
Text Generation
•
Updated
15 days ago
•
110k
•
377
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
19 days ago
•
178k
•
529
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
7 days ago
•
409k
•
2.88k
deepseek-ai/DeepSeek-V3-Base
Updated
7 days ago
•
23.4k
•
1.47k
liked
a Space
about 1 month ago
Running
873
🔍
QwQ-32B-Preview
QwQ-32B-Preview
liked
3 models
about 2 months ago
deepseek-ai/DeepSeek-V2.5-1210
Text Generation
•
Updated
Dec 11, 2024
•
310k
•
247
tencent/HunyuanVideo
Text-to-Video
•
Updated
10 days ago
•
7.74k
•
1.54k
Qwen/QwQ-32B-Preview
Text Generation
•
Updated
19 days ago
•
189k
•
1.6k
liked
a model
2 months ago
mistralai/Mistral-Large-Instruct-2411
Updated
Nov 19, 2024
•
1.28M
•
197
liked
a Space
3 months ago
Running
1.28k
🐢
Qwen2.5 Coder Artifacts
liked
a model
3 months ago
Etched/oasis-500m
Updated
Nov 4, 2024
•
165
•
437
liked
2 models
4 months ago
ssmits/Qwen2.5-95B-Instruct
Text Generation
•
Updated
Oct 31, 2024
•
56
•
3
mlabonne/BigLlama-3.1-681B-Instruct
Text Generation
•
Updated
Aug 4, 2024
•
14
•
11
Load more