SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper ā¢ 2501.17161 ā¢ Published 17 days ago ā¢ 105
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper ā¢ 2408.08872 ā¢ Published Aug 16, 2024 ā¢ 98
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper ā¢ 2407.12784 ā¢ Published Jul 17, 2024 ā¢ 49