Mayurkumar Surani's picture

15 11

Mayurkumar Surani

Mayurkumar

·

AI & ML interests

Machine Learning , AI, NLP, Computer Vision, Reinforcement Learning

Recent Activity

liked a model 6 days ago

deepseek-ai/Janus-Pro-7B

upvoted a collection 8 days ago

liked a Space 9 days ago

deepseek-ai/Janus-Pro-7B

View all activity

Organizations

None yet

Mayurkumar's activity

liked a model 6 days ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 6 days ago • 248k • 2.7k

upvoted a collection 8 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 12 days ago • 98

liked a Space 9 days ago

Chat With Janus-Pro-7B

A unified multimodal understanding and generation model.

upvoted a paper 5 months ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5, 2024 • 89

upvoted a paper 7 months ago

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17, 2024 • 78

liked a model 8 months ago

Qwen/Qwen2-72B-Instruct

Text Generation • Updated Oct 8, 2024 • 49.2k • • 701

updated a collection 9 months ago

LLM

3 items • Updated May 26, 2024

liked a model 9 months ago

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8, 2024 • 16.4k • 304

updated a collection 9 months ago

LLM

3 items • Updated May 26, 2024

liked a model 10 months ago

CohereForAI/aya-101

Text2Text Generation • Updated Mar 31, 2024 • 4k • 629

upvoted a paper 11 months ago

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Paper • 2403.10517 • Published Mar 15, 2024 • 33

updated a Space 11 months ago

Mistralai Mixtral 8x7B Instruct V0.1

updated a collection 11 months ago

LLM

3 items • Updated May 26, 2024

upvoted 4 papers 11 months ago

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Paper • 2403.08764 • Published Mar 13, 2024 • 36

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13, 2024 • 48

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 94

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 608

liked 2 models 12 months ago

stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 394k • 2.85k

TencentARC/PhotoMaker

Text-to-Image • Updated Jul 22, 2024 • 77.6k • 421