1 6

Yuhui Xu

yuhuixu

https://yuhuixu1993.github.io/

yuhuixu1993

AI & ML interests

None yet

Recent Activity

authored a paper 9 days ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

authored a paper 9 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

upvoted a paper 9 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

View all activity

Organizations

None yet

yuhuixu's activity

authored 2 papers 9 days ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 44

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 12 days ago • 34

upvoted a paper 9 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 12 days ago • 34

commented a paper 9 days ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 12 days ago • 34 •

updated a model 20 days ago

yuhuixu/merged_model_linear_0.6_0.4

Text Generation • Updated 20 days ago • 9

published a model 20 days ago

yuhuixu/merged_model_linear_0.6_0.4

Text Generation • Updated 20 days ago • 9

updated a model 20 days ago

yuhuixu/merged_model_linear_0.5_0.5

Text Generation • Updated 20 days ago • 6

published a model 20 days ago

yuhuixu/merged_model_linear_0.5_0.5

Text Generation • Updated 20 days ago • 6

updated a model 20 days ago

yuhuixu/merged_model_linear_0.4_0.6

Text Generation • Updated 20 days ago • 9

published a model 20 days ago

yuhuixu/merged_model_linear_0.4_0.6

Text Generation • Updated 20 days ago • 9

upvoted an article 20 days ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

20 days ago

• 62

updated 3 models 29 days ago

upvoted 2 papers 4 months ago

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 46

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

authored a paper 4 months ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

authored 3 papers 7 months ago

Latency-Aware Differentiable Neural Architecture Search

Paper • 2001.06392 • Published Jan 17, 2020

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Paper • 1907.05737 • Published Jul 12, 2019

Trained Rank Pruning for Efficient Deep Neural Networks

Paper • 1812.02402 • Published Dec 6, 2018 • 1