Edmon02
's Collections
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper
•
2309.14717
•
Published
•
44
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper
•
2310.09199
•
Published
•
29
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4
on mock CFA Exams
Paper
•
2310.08678
•
Published
•
14
MiniGPT-v2: large language model as a unified interface for
vision-language multi-task learning
Paper
•
2310.09478
•
Published
•
21
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper
•
2310.11453
•
Published
•
104
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper
•
2310.17631
•
Published
•
35
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme
Long Sequence Transformer Models
Paper
•
2309.14509
•
Published
•
20
Skywork: A More Open Bilingual Foundation Model
Paper
•
2310.19341
•
Published
•
6
UFOGen: You Forward Once Large Scale Text-to-Image Generation via
Diffusion GANs
Paper
•
2311.09257
•
Published
•
48
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
APIs
Paper
•
2307.16789
•
Published
•
101
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
59
MobileQuant: Mobile-friendly Quantization for On-device Language Models
Paper
•
2408.13933
•
Published
•
16
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page
Document Understanding
Paper
•
2409.03420
•
Published
•
27
Scaling Smart: Accelerating Large Language Model Pre-training with Small
Model Initialization
Paper
•
2409.12903
•
Published
•
23
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
141
Language Models Learn to Mislead Humans via RLHF
Paper
•
2409.12822
•
Published
•
11
MathCoder2: Better Math Reasoning from Continued Pretraining on
Model-translated Mathematical Code
Paper
•
2410.08196
•
Published
•
48
Transformer^2: Self-adaptive LLMs
Paper
•
2501.06252
•
Published
•
55
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System
Collaboration
Paper
•
2505.20256
•
Published
•
17
VideoREPA: Learning Physics for Video Generation through Relational
Alignment with Foundation Models
Paper
•
2505.23656
•
Published
•
24
SuperWriter: Reflection-Driven Long-Form Generation with Large Language
Models
Paper
•
2506.04180
•
Published
•
33
MemOS: A Memory OS for AI System
Paper
•
2507.03724
•
Published
•
153
CriticLean: Critic-Guided Reinforcement Learning for Mathematical
Formalization
Paper
•
2507.06181
•
Published
•
41
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed
Inference
Paper
•
2508.02193
•
Published
•
129
Memp: Exploring Agent Procedural Memory
Paper
•
2508.06433
•
Published
•
33
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning
Paper
•
2508.08221
•
Published
•
42
Pass@k Training for Adaptively Balancing Exploration and Exploitation of
Large Reasoning Models
Paper
•
2508.10751
•
Published
•
26
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust
GAIA Problem Solving
Paper
•
2508.09889
•
Published
•
32
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper
•
2508.06471
•
Published
•
169