| base_model: | |
| - thuml/bytesized32-world-model-sft | |
| datasets: | |
| - thuml/bytesized32-world-model-cot | |
| license: mit | |
| tags: | |
| - text-game | |
| - world-model | |
| - rlvr | |
| pipeline_tag: text-generation | |
| library_name: transformers | |
| [Project Page](https://thuml.github.io/RLVR-World/) | |
| [Github Repository](https://github.com/thuml/RLVR-World) | |
| See https://github.com/thuml/RLVR-World for examples for using this model. | |
| ## Citation | |
| ``` | |
| @article{wu2025rlvr, | |
| title={RLVR-World: Training World Models with Reinforcement Learning}, | |
| author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, | |
| journal={arXiv preprint arXiv:2505.13934}, | |
| year={2025}, | |
| } | |
| ``` |