Jellyfish042's picture

4 2 15

Jellyfish042

Jellyfish042

·

AI & ML interests

None yet

Recent Activity

updated a Space 17 days ago

Jellyfish042/UncheatableEval

liked a model 3 months ago

BlinkDL/rwkv-8-pile

updated a Space 3 months ago

Jellyfish042/RWKV-Compare

View all activity

Organizations

None yet

updated a Space 17 days ago

UncheatableEval

Explore LLM compression data with interactive filters

liked a model 3 months ago

BlinkDL/rwkv-8-pile

Updated Jun 30 • 16

updated a Space 3 months ago

RWKV Compare

Convert RWKV parameters to equivalent values

published a Space 3 months ago

RWKV Compare

Convert RWKV parameters to equivalent values

New activity in tiiuae/Falcon-H1-1.5B-Base 3 months ago

Hugging Face implementation is very slow during prefill

#2 opened 3 months ago by

liked a model 4 months ago

howard-hou/RWKV-X

Updated Apr 18 • 4

New activity in microsoft/bitnet-b1.58-2B-4T 5 months ago

configuration_bitnet.py missing

#4 opened 5 months ago by

upvoted a paper 6 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

commented a paper 6 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153 •

liked a Space 6 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a model 6 months ago

BlinkDL/rwkv-7-world

Text Generation • Updated May 31 • 104

updated 3 datasets 9 months ago

Jellyfish042/parity_1m

Viewer • Updated Dec 1, 2024 • 5 • 30

Jellyfish042/parity_1m

Viewer • Updated Dec 1, 2024 • 5 • 30

Jellyfish042/sudoku_500k

Updated Nov 29, 2024 • 16

updated a Space 10 months ago

UncheatableEval

Explore LLM compression data with interactive filters

upvoted a paper about 1 year ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57

liked a Space over 1 year ago

UncheatableEval

Explore LLM compression data with interactive filters

liked a model over 1 year ago

BlinkDL/rwkv-6-world

Text Generation • Updated Nov 13, 2024 • 147

updated a model over 1 year ago

Jellyfish042/QLing-1.8B-Chat-V0

Text Generation • 2B • Updated Jan 17, 2024 • 6 • 1

liked a dataset over 1 year ago

Jellyfish042/Chinese-LIMA-V0

Viewer • Updated Jan 17, 2024 • 1k • 9 • 6