Marco
AI & ML interests
Recent Activity
Organizations
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition β’ 6B β’ Updated β’ 344k β’ 1.48k -
microsoft/Magma-8B
Image-Text-to-Text β’ 9B β’ Updated β’ 2.39k β’ 407 -
Runtime error4545
Magma UI
πMagma-8B model for UI Agents
-
CohereLabs/aya-vision-32b
Image-Text-to-Text β’ 33B β’ Updated β’ 181 β’ β’ 215
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text β’ 0.6B β’ Updated β’ 58.5k β’ 212 -
Runtime error8282
GOT OCR Transformers
π·Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 254k β’ 698 -
allenai/olmOCR-mix-0225
Viewer β’ Updated β’ 259k β’ 1.38k β’ 159
-
Running on CPU Upgrade11.4k11.4k
Stable Diffusion 2-1
π₯Generate images from text prompts
-
Running5656
SmolVLM 256M Instruct WebGPU
π¨Generate descriptions for images using WebGPU technology
-
Running3535
SmolVLM 500M Instruct WebGPU
π» -
deepseek-ai/Janus-Pro-7B
Any-to-Any β’ Updated β’ 193k β’ 3.48k
-
Running549549
DeepSeek-R1 WebGPU
π§Next-generation reasoning model that runs locally in-browser
-
Running8888
Qwen2.5-1M Demo
π»Upload documents and ask questions
-
mistralai/Mistral-Small-24B-Base-2501
24B β’ Updated β’ 22.7k β’ 257 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text β’ 16B β’ Updated β’ 24.6k β’ 163
-
RunningMCP116116
Consilium MCP Server
π’Multi-AI Expert Consensus Platform
-
SleepingMCP22
MCP Hackathon Deepfake Watchdog
π‘Upload your image and/or voice to scan for deepfake misuse o
-
Running3434
VulnBuster
π‘AI Security Agent: Multi-MCP Code Vulnerability Scanner
-
RunningMCP186186
AI Marketing Content Generator
π¨An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition β’ Updated β’ 270k β’ 1.32k -
Running on T4426426
Parakeet-TDT-0.6b-V2
ΒTranscribe audio to text with timestamps
-
Running on CPU Upgrade3333
Blazing Fast Whisper
πBlazing Fast Whisper Deployed on HF Inference Endpoints
-
Running on CPU Upgrade1.05k1.05k
Open ASR Leaderboard
πView and request speech recognition model benchmarks
-
Running on T45454
RF-DETR
π₯SOTA real-time object detection model
-
Running on CPU Upgrade4949
YOLO ARENA
πcompare performance of top object detectors
-
Running on Zero8888
D-Fine - SOTA Real-Time Object Detector
β‘Object Detection on Images and Video
-
Running on ZeroMCP2828
Gaze LLE
πGaze Target Estimation
-
Running on ZeroMCP504504
LatentSync
πAudio Conditioned LipSync with Latent Diffusion Models
-
Running on Zero197197
BEN2
πRemove background from images and videos
-
Build error8080
SmolVLM
πGenerate answers by combining text and images
-
Runtime error5656
SmolVLM2 HighlightGenerator
π¨Generate video highlights from uploaded video
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text β’ 8B β’ Updated β’ 3.84k β’ 160 -
kyutai/hibiki-2b-pytorch-bf16
Translation β’ Updated β’ 1.05k β’ 55 -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech β’ Updated β’ 50.5k β’ 1.1k -
Running on Zero625625
DiβͺβͺRhythm
πΆBlazingly Fast and Embarrassingly Simple Song Generation
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech β’ Updated β’ 14.5k β’ 149 -
Running209209
Kokoro Text-to-Speech
π£High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text β’ 8B β’ Updated β’ 3.84k β’ 160 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition β’ 0.3B β’ Updated β’ 161k β’ 474
-
RunningMCP116116
Consilium MCP Server
π’Multi-AI Expert Consensus Platform
-
SleepingMCP22
MCP Hackathon Deepfake Watchdog
π‘Upload your image and/or voice to scan for deepfake misuse o
-
Running3434
VulnBuster
π‘AI Security Agent: Multi-MCP Code Vulnerability Scanner
-
RunningMCP186186
AI Marketing Content Generator
π¨An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition β’ Updated β’ 270k β’ 1.32k -
Running on T4426426
Parakeet-TDT-0.6b-V2
ΒTranscribe audio to text with timestamps
-
Running on CPU Upgrade3333
Blazing Fast Whisper
πBlazing Fast Whisper Deployed on HF Inference Endpoints
-
Running on CPU Upgrade1.05k1.05k
Open ASR Leaderboard
πView and request speech recognition model benchmarks
-
Running on T45454
RF-DETR
π₯SOTA real-time object detection model
-
Running on CPU Upgrade4949
YOLO ARENA
πcompare performance of top object detectors
-
Running on Zero8888
D-Fine - SOTA Real-Time Object Detector
β‘Object Detection on Images and Video
-
Running on ZeroMCP2828
Gaze LLE
πGaze Target Estimation
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition β’ 6B β’ Updated β’ 344k β’ 1.48k -
microsoft/Magma-8B
Image-Text-to-Text β’ 9B β’ Updated β’ 2.39k β’ 407 -
Runtime error4545
Magma UI
πMagma-8B model for UI Agents
-
CohereLabs/aya-vision-32b
Image-Text-to-Text β’ 33B β’ Updated β’ 181 β’ β’ 215
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text β’ 0.6B β’ Updated β’ 58.5k β’ 212 -
Runtime error8282
GOT OCR Transformers
π·Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 254k β’ 698 -
allenai/olmOCR-mix-0225
Viewer β’ Updated β’ 259k β’ 1.38k β’ 159
-
Running on ZeroMCP504504
LatentSync
πAudio Conditioned LipSync with Latent Diffusion Models
-
Running on Zero197197
BEN2
πRemove background from images and videos
-
Build error8080
SmolVLM
πGenerate answers by combining text and images
-
Runtime error5656
SmolVLM2 HighlightGenerator
π¨Generate video highlights from uploaded video
-
Running on CPU Upgrade11.4k11.4k
Stable Diffusion 2-1
π₯Generate images from text prompts
-
Running5656
SmolVLM 256M Instruct WebGPU
π¨Generate descriptions for images using WebGPU technology
-
Running3535
SmolVLM 500M Instruct WebGPU
π» -
deepseek-ai/Janus-Pro-7B
Any-to-Any β’ Updated β’ 193k β’ 3.48k
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text β’ 8B β’ Updated β’ 3.84k β’ 160 -
kyutai/hibiki-2b-pytorch-bf16
Translation β’ Updated β’ 1.05k β’ 55 -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech β’ Updated β’ 50.5k β’ 1.1k -
Running on Zero625625
DiβͺβͺRhythm
πΆBlazingly Fast and Embarrassingly Simple Song Generation
-
Running549549
DeepSeek-R1 WebGPU
π§Next-generation reasoning model that runs locally in-browser
-
Running8888
Qwen2.5-1M Demo
π»Upload documents and ask questions
-
mistralai/Mistral-Small-24B-Base-2501
24B β’ Updated β’ 22.7k β’ 257 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text β’ 16B β’ Updated β’ 24.6k β’ 163
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech β’ Updated β’ 14.5k β’ 149 -
Running209209
Kokoro Text-to-Speech
π£High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text β’ 8B β’ Updated β’ 3.84k β’ 160 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition β’ 0.3B β’ Updated β’ 161k β’ 474