Xiang Fu

craigxiangfu

https://fufoundation.co

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

liked a dataset 26 days ago

facebook/natural_reasoning

upvoted a paper 28 days ago

Self-rewarding correction for mathematical reasoning

View all activity

Organizations

craigxiangfu's activity

upvoted a paper 24 days ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Paper • 2502.13124 • Published Feb 18 • 6

liked a dataset 26 days ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 12.1k • 477

upvoted a paper 28 days ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 82

upvoted a collection 29 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 579

liked a model 29 days ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 1.11M • • 2.22k

upvoted 3 papers about 1 month ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 27

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 95

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6 • 44

liked a Space about 1 month ago

2.4k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper about 2 months ago

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14 • 17

upvoted 4 papers 3 months ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 39

upvoted 2 papers 6 months ago

Can Models Learn Skill Composition from Examples?

Paper • 2409.19808 • Published Sep 29, 2024 • 10

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 175

upvoted a paper 7 months ago

Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11, 2024 • 31

liked a dataset 10 months ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jan 31 • 3.3B • 284k • 656

liked 2 models 10 months ago

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 662k • 6.12k

meta-llama/Meta-Llama-3-70B

Text Generation • Updated Sep 27, 2024 • 18.3k • 855