John Gil Cubas's picture

15 284

John Gil Cubas

JohnRoger

·

JohnGilCubas

AI & ML interests

Natural language processing, computer social sciences

Recent Activity

upvoted a collection about 9 hours ago

liked a model about 9 hours ago

NousResearch/DeepHermes-3-Mistral-24B-Preview

liked a model about 9 hours ago

bartowski/RekaAI_reka-flash-3-GGUF

View all activity

Organizations

None yet

JohnRoger's activity

upvoted a collection about 9 hours ago

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 2 hours ago • 30

upvoted a collection 2 days ago

Gemma 3 Release

9 items • Updated about 13 hours ago • 228

upvoted an article 3 days ago

Article

Open R1: Update #3

By

and 9 others •

3 days ago

• 207

upvoted a collection 5 days ago

Sarashina2.2

Large Language Models developed by SB Intuitions. Pretrained and instruction-tuned models are available in three sizes: 0.5B, 1B, and 3B. • 6 items • Updated 9 days ago • 4

upvoted 3 collections 7 days ago

Command Models

Latest C4AI Command models • 6 items • Updated about 22 hours ago • 17

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 4 items • Updated 12 days ago • 38

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 29 days ago • 83

upvoted an article 7 days ago

Article

Welcome the Falcon 3 Family of Open Models!

Dec 17, 2024

• 121

upvoted a collection 11 days ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 78

upvoted 3 collections 16 days ago

Hermes 3

The Hermes 3 Series of Models • 12 items • Updated 29 days ago • 112

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated Jan 17 • 11

Deepthink and Reasoning

Best for Deepthink and Reasoning • 14 items • Updated Jan 24 • 20

upvoted 2 papers 16 days ago

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published 24 days ago • 36

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Paper • 2502.11573 • Published 25 days ago • 9

upvoted an article 16 days ago

Article

Small Language Models (SLMs): A Comprehensive Overview

By

•

20 days ago

• 15