DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Paper • 2502.20730 • Published 14 days ago • 33
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 29 days ago • 143
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 29 days ago • 184
PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models Paper • 2412.18608 • Published Dec 24, 2024 • 15
TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing Paper • 2312.05605 • Published Dec 9, 2023 • 3
Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes Paper • 2312.06353 • Published Dec 11, 2023 • 7
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations Paper • 2312.06674 • Published Dec 7, 2023 • 8