Bo Zheng's picture

4 2

Bo Zheng

bzheng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

authored a paper 25 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

upvoted a paper 28 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

View all activity

Organizations

bzheng's activity

upvoted a paper 9 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 10 days ago • 61

authored a paper 25 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 29 days ago • 48

upvoted a paper 28 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 29 days ago • 48

authored a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 344

authored a paper 3 months ago

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

Paper • 2410.21157 • Published Oct 28, 2024 • 6

authored a paper 6 months ago

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23, 2024 • 22

authored a paper 7 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

authored a paper 8 months ago

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published May 24, 2024 • 44

New activity in Qwen/Qwen1.5-MoE-A2.7B-Chat 10 months ago

请问这个版本GPU内存消耗28G与14B对比如何?

#7 opened 10 months ago by

RuntimeError: cutlassF: no kernel found to launch!

#1 opened 10 months ago by

authored a paper 11 months ago

AtomoVideo: High Fidelity Image-to-Video Generation

Paper • 2403.01800 • Published Mar 4, 2024 • 22