GuoLiangTang

Tommy930

https://github.com/TommyTang930

AI & ML interests

LLM，NLP，ML

Recent Activity

upvoted a paper about 3 hours ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

upvoted a paper about 3 hours ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

upvoted a paper about 3 hours ago

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

View all activity

Organizations

None yet

Tommy930's activity

upvoted 6 papers about 3 hours ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published about 15 hours ago • 1

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published about 18 hours ago • 5

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published about 14 hours ago • 3

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published 1 day ago • 3

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published about 17 hours ago • 13

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Paper • 2502.04306 • Published about 14 hours ago • 7

upvoted 3 papers about 7 hours ago

Large Language Model Guided Self-Debugging Code Generation

Paper • 2502.02928 • Published 2 days ago • 6

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published 3 days ago • 6

Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation

Paper • 2502.00306 • Published 6 days ago • 2

upvoted 11 papers 1 day ago

Jailbreaking with Universal Multi-Prompts

Paper • 2502.01154 • Published 4 days ago • 5

A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods

Paper • 2502.01618 • Published 4 days ago • 5

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 1 day ago • 7

Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models

Paper • 2501.19054 • Published 7 days ago • 6

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published 3 days ago • 3

Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models

Paper • 2501.19389 • Published 7 days ago • 2

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Paper • 2502.00674 • Published 5 days ago • 8