Reza8848
/

MUFFIN-T5-11B

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MUFFIN-T5-11B / README.md

Reza8848's picture

Update README.md

1d183f4 verified about 1 year ago

|

history blame contribute delete

1.25 kB

	---
	datasets:
	- Reza8848/MUFFIN_68k
	language:
	- en
	license: mit
	---

	<img src="https://cdn-uploads.huggingface.co/production/uploads/6434a6e8ea46c009904c617e/J_4FHXmtM6TuRnN3aL06y.png" width="38" height="38">


	This is the model weight of MUFFIN-T5-11B (Multi-Faceted Instructions).

	We fine-tune the [T5-11B](https://huggingface.co/t5-11b) model on our [MUFFIN dataset](https://arxiv.org/abs/2312.02436).

	We released both 3B and 11B models:
	\|Model\|Number of parameters\|
	\|-\|-\|
	\|[MUFFIN-T5-3B](https://huggingface.co/Reza8848/MUFFIN-T5-3B)\|3 billion\|
	\|[MUFFIN-T5-11B](https://huggingface.co/Reza8848/MUFFIN-T5-11B)\|11 billion\|

	Please refer to [MUFFIN-T5-3B](https://huggingface.co/Reza8848/MUFFIN-T5-3B) for detailed documentation.




	## 🥳 Citation

	Please kindly cite our paper if you use any resources in this repository:

	```bibtex
	@inproceedings{Lou2023MUFFIN,
	title={{MUFFIN}: Curating Multi-Faceted Instructions for Improving Instruction Following},
	author={Renze Lou and Kai Zhang and Jian Xie and Yuxuan Sun and Janice Ahn and Hanzi Xu and Yu su and Wenpeng Yin},
	booktitle={The Twelfth International Conference on Learning Representations},
	year={2024},
	url={https://openreview.net/forum?id=1vrS1zwekw}
	}
	```