LLM
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
OpenMOSS Team of SII
True Speech-to-Speech Language Model
MOSS-TTSD: Text to Spoken Dialogue Generation