Autellix: An Efficient Serving Engine for LLM Agents as General Programs Paper • 2502.13965 • Published 20 days ago • 18
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding Paper • 2502.05609 • Published Feb 8 • 18
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding Paper • 2502.05609 • Published Feb 8 • 18
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding Paper • 2502.05609 • Published Feb 8 • 18 • 3
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations Paper • 2404.13948 • Published Apr 22, 2024 • 1
Test-Time Self-Adaptive Small Language Models for Question Answering Paper • 2310.13307 • Published Oct 20, 2023
Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker Paper • 2305.13729 • Published May 23, 2023 • 1
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering Paper • 2310.17490 • Published Oct 26, 2023
Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation Paper • 2203.07735 • Published Mar 15, 2022
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity Paper • 2403.14403 • Published Mar 21, 2024 • 6
Towards Effective Counter-Responses: Aligning Human Preferences with Strategies to Combat Online Trolling Paper • 2410.04164 • Published Oct 5, 2024
EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation Paper • 2412.12559 • Published Dec 17, 2024 • 1
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations Paper • 2404.13948 • Published Apr 22, 2024 • 1
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published Jan 10 • 68
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published Dec 22, 2024 • 30
Knowledge-Augmented Large Language Models for Personalized Contextual Query Suggestion Paper • 2311.06318 • Published Nov 10, 2023 • 2