Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

AI & ML interests

Deep Representation Learning

Recent Activity

updated a model about 14 hours ago
ariG23498/layerskip-hf-smollm-135m-topv2
updated a Space about 15 hours ago
ariG23498/flux-edit
new activity about 15 hours ago
ariG23498/flux-edit:update seeding
View all activity

Articles

Organizations

Hugging Face's profile picture Google's profile picture Notebooks-explorers's profile picture PyTorch Image Models's profile picture Keras's profile picture Hugging Test Lab's profile picture Hugging Face Fellows's profile picture Probing ViTs's profile picture TrystAI's profile picture PyImageSearch's profile picture Keras Dreambooth Event's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture kotol's profile picture gg-hf's profile picture MLX Community's profile picture IBM Granite's profile picture Open Generative Fill's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture nltpt's profile picture nltpt-q's profile picture qrias's profile picture Hugging Face Science's profile picture open/ acc's profile picture wut?'s profile picture LLM from Scratch's profile picture

ariG23498's activity

upvoted an article about 20 hours ago
view article
Article

Mixture of Experts Explained

275
upvoted an article about 21 hours ago
view article
Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By not-lain
19
upvoted an article 2 days ago
view article
Article

Welcome to Inference Providers on the Hub 🔥

171
upvoted an article 3 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

460
upvoted an article 7 days ago
view article
Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

95
upvoted an article 8 days ago
view article
Article

Mastering Long Contexts in LLMs with KVPress

By nvidia
58
upvoted an article 9 days ago
view article
Article

Unlocking Longer Generation with Key-Value Cache Quantization

40
upvoted an article 10 days ago
upvoted an article 14 days ago
view article
Article

Timm ❤️ Transformers: Use any timm model with transformers

37
upvoted 2 articles 15 days ago
view article
Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

61
view article
Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

40
upvoted an article 24 days ago