Bingxuan Wang's picture

3 3 1

Bingxuan Wang

YellowDoge

·

AI & ML interests

None yet

Recent Activity

new activity 13 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B:Vocab size in config.json mismatches the actual tokenizer size

authored a paper 15 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper about 1 month ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

View all activity

Organizations

None yet

YellowDoge's activity

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-7B 13 days ago

Vocab size in config.json mismatches the actual tokenizer size

#4 opened 15 days ago by

authored a paper 15 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 16 days ago • 302

upvoted a paper about 1 month ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 40

upvoted a paper about 2 months ago

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 85

liked a dataset 3 months ago

blitt/SPoRC

Updated Nov 19, 2024 • 48 • 8

authored a paper 8 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 62

New activity in THUDM/glm-4-9b-chat 8 months ago

KeyError: '<|endoftext|>' when using the tokenizer

#3 opened 8 months ago by

upvoted a paper 11 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 43

authored a paper 11 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 43