3 7 40

Weijing Huang

waleking

AI & ML interests

Language Models

Recent Activity

liked a dataset 11 days ago

OpenStellarTeam/Chinese-SimpleQA

liked a dataset 17 days ago

allenai/olmOCR-mix-0225

upvoted a paper 28 days ago

TransMLA: Multi-head Latent Attention Is All You Need

View all activity

Organizations

None yet

waleking's activity

liked a dataset 11 days ago

OpenStellarTeam/Chinese-SimpleQA

Viewer • Updated Dec 16, 2024 • 3k • 785 • 24

liked a dataset 17 days ago

allenai/olmOCR-mix-0225

Viewer • Updated 17 days ago • 259k • 4.34k • 90

upvoted a paper 28 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published about 1 month ago • 47

liked a dataset about 1 month ago

Anthropic/EconomicIndex

Viewer • Updated Feb 10 • 3.51k • 3.85k • 190

upvoted a paper about 1 month ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5 • 15

upvoted an article about 1 month ago

Article

Replicating DeepSeek R1 for Information Extraction

•

Jan 31

• 38

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

liked a Space 2 months ago

535

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted an article 3 months ago

Article

Deriving DPO's Loss

•

Dec 24, 2024

• 26

liked a dataset 5 months ago

m-a-p/MAP-CC

Viewer • Updated Jul 11, 2024 • 1.77B • 9.88k • 67

liked a dataset 6 months ago

Lyte/Reasoner-1o1-v0.3-HQ

Viewer • Updated Sep 18, 2024 • 370 • 93 • 8

liked a dataset 7 months ago

pints-ai/Expository-Prose-V1

Viewer • Updated Aug 12, 2024 • 6.67M • 954 • 19

liked a model 7 months ago

PleIAs/OCRonos-Vintage

Text Generation • Updated Aug 8, 2024 • 478 • 79

liked a dataset 9 months ago

mikex86/stackoverflow-posts

Viewer • Updated Aug 1, 2023 • 58.3M • 3.49k • 50

liked 2 datasets 10 months ago

Replete-AI/code_bagel

Viewer • Updated Oct 8, 2024 • 2.22M • 255 • 95

kenhktsui/open-react-retrieval-multi-neg-result-new-kw

Viewer • Updated Aug 7, 2023 • 25.2k • 114 • 3

liked a dataset 11 months ago

PleIAs/Post-OCR-Correction

Viewer • Updated Apr 28, 2024 • 50.4k • 1.67k • 127

liked a model 11 months ago

shenzhi-wang/Llama3-8B-Chinese-Chat

Text Generation • Updated Jul 4, 2024 • 43.2k • 677

liked 2 datasets 11 months ago

YanweiLi/MGM-Pretrain

Viewer • Updated Apr 21, 2024 • 1.27M • 37 • 16

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 5.79k • 611