Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

ParScale

community

AI & ML interests

None defined yet.

ParScale 's collections 4

🏠 ParScale-1.8B

Base models trained on 1T high-quality tokens, demonstrating strong competitiveness among existing SOTA small models (<2B).

ParScale/ParScale-1.8B-P8

Text Generation • 2B • Updated May 17 • 31 • 6
ParScale/ParScale-1.8B-P4

Text Generation • 2B • Updated May 17 • 61 • 1
ParScale/ParScale-1.8B-P2

Text Generation • 2B • Updated May 17 • 6
ParScale/ParScale-1.8B-P1

Text Generation • 2B • Updated May 17 • 3 • 1

🪛 ParScale-Qwen-PEFT

Checkpoints for PEFT Qwen-2.5; backbone weight is frozen.

ParScale/ParScale-Qwen-3B-P8-Python

Text Generation • 3B • Updated May 17 • 52
ParScale/ParScale-Qwen-3B-P4-Python

Text Generation • 3B • Updated May 17 • 25
ParScale/ParScale-Qwen-3B-P2-Python

Text Generation • 3B • Updated May 17 • 11

💬 ParScale-1.8B-Inst

Instruct models from the ParScale-1.8B base models, trained on SmolTalk-1M to enable conversational capabilities.

ParScale/ParScale-1.8B-P8-Inst

Text Generation • 2B • Updated May 17 • 63 • 2
ParScale/ParScale-1.8B-P4-Inst

Text Generation • 2B • Updated May 17 • 16 • 1
ParScale/ParScale-1.8B-P2-Inst

Text Generation • 2B • Updated May 17 • 6
ParScale/ParScale-1.8B-P1-Inst

Text Generation • 2B • Updated May 17 • 75 • 1

🔥 ParScale-QwenInit

Continual pre-training Qwen-2.5-3B model.

ParScale/ParScale-QwenInit-3B-P8-Python

3B • Updated May 17 • 24
ParScale/ParScale-QwenInit-3B-P4-Python

3B • Updated May 17 • 7
ParScale/ParScale-QwenInit-3B-P2-Python

3B • Updated May 17 • 11
ParScale/ParScale-QwenInit-3B-P1-Python

3B • Updated May 17 • 9

🏠 ParScale-1.8B

Base models trained on 1T high-quality tokens, demonstrating strong competitiveness among existing SOTA small models (<2B).

ParScale/ParScale-1.8B-P8

Text Generation • 2B • Updated May 17 • 31 • 6
ParScale/ParScale-1.8B-P4

Text Generation • 2B • Updated May 17 • 61 • 1
ParScale/ParScale-1.8B-P2

Text Generation • 2B • Updated May 17 • 6
ParScale/ParScale-1.8B-P1

Text Generation • 2B • Updated May 17 • 3 • 1

💬 ParScale-1.8B-Inst

Instruct models from the ParScale-1.8B base models, trained on SmolTalk-1M to enable conversational capabilities.

ParScale/ParScale-1.8B-P8-Inst

Text Generation • 2B • Updated May 17 • 63 • 2
ParScale/ParScale-1.8B-P4-Inst

Text Generation • 2B • Updated May 17 • 16 • 1
ParScale/ParScale-1.8B-P2-Inst

Text Generation • 2B • Updated May 17 • 6
ParScale/ParScale-1.8B-P1-Inst

Text Generation • 2B • Updated May 17 • 75 • 1

🪛 ParScale-Qwen-PEFT

Checkpoints for PEFT Qwen-2.5; backbone weight is frozen.

ParScale/ParScale-Qwen-3B-P8-Python

Text Generation • 3B • Updated May 17 • 52
ParScale/ParScale-Qwen-3B-P4-Python

Text Generation • 3B • Updated May 17 • 25
ParScale/ParScale-Qwen-3B-P2-Python

Text Generation • 3B • Updated May 17 • 11

🔥 ParScale-QwenInit

Continual pre-training Qwen-2.5-3B model.

ParScale/ParScale-QwenInit-3B-P8-Python

3B • Updated May 17 • 24
ParScale/ParScale-QwenInit-3B-P4-Python

3B • Updated May 17 • 7
ParScale/ParScale-QwenInit-3B-P2-Python

3B • Updated May 17 • 11
ParScale/ParScale-QwenInit-3B-P1-Python

3B • Updated May 17 • 9

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs