English
ettin-checkpoints / README.md
orionweller's picture
Update README.md
aaac719 verified
metadata
license: mit
language:
  - en

Ettin Checkpoints

License: MIT Paper Models GitHub

This repository contains the raw training checkpoints for the Ettin models. Each model contains a unique subdirectory, e.g. enc-150m for Ettin-Encoder-150m, with three subfolders for decay, ext, and pretrain.

These files work with Composer and contain all state needed to resume pre-training. Please see the ModernBERT repository for usage details.

πŸ”— Related Resources

Citation

@misc{weller2025seqvsseqopen,
      title={Seq vs Seq: An Open Suite of Paired Encoders and Decoders}, 
      author={Orion Weller and Kathryn Ricci and Marc Marone and Antoine Chaffin and Dawn Lawrie and Benjamin Van Durme},
      year={2025},
      eprint={2507.11412},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2507.11412}, 
}