thuml
/

bytesized32-world-model-rlvr-binary-reward

Model card Files Files and versions

bytesized32-world-model-rlvr-binary-reward / README.md

nielsr's picture

nielsr HF Staff

Add library_name and pipeline_tag metadata

749c5f6 verified 5 months ago

|

658 Bytes

	---
	base_model:
	- thuml/bytesized32-world-model-sft
	datasets:
	- thuml/bytesized32-world-model-cot
	license: mit
	tags:
	- text-game
	- world-model
	- rlvr
	pipeline_tag: text-generation
	library_name: transformers
	---

	[Project Page](https://thuml.github.io/RLVR-World/)
	[Github Repository](https://github.com/thuml/RLVR-World)

	See https://github.com/thuml/RLVR-World for examples for using this model.

	## Citation

	```
	@article{wu2025rlvr,
	title={RLVR-World: Training World Models with Reinforcement Learning},
	author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long},
	journal={arXiv preprint arXiv:2505.13934},
	year={2025},
	}
	```