Di Liu's picture

3

Di Liu

diliu0349

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 4 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

upvoted an article 9 months ago

Article

MInference 1.0: 10x Faster Million Context Inference with a Single GPU

By

•

Jul 11, 2024

• 13

upvoted a paper 9 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 44