---
pipeline_tag: sentence-similarity
library_name: transformers
license: mit
---

# Model Card: Assisting Mathematical Formalization with A Learning-based Premise Retriever

## Model Description

This model is designed for **premise retrieval** in **Lean**, based on the **state representation** of Lean. The model follows the architecture described in the paper:

[Assisting Mathematical Formalization with A Learning-based Premise Retriever](https://arxiv.org/abs/2501.13959)

The model implementation and code are available at:

[GitHub Repository](https://github.com/ruc-ai4math/Premise-Retrieval)

[Try our model](https://premise-search.com)

---

## Available Models

The repository provides several models for different tasks:

### Pre-trained Models
Located in the `Pretrain_Model` folder:

- **410_stable_random**: Pre-trained model for **retrieval fine-tuning**.
- **410_stable_random_1024**: Pre-trained model for **reranking fine-tuning**.

### Fine-tuned Models

#### Retrieval Fine-tuned Models
Located in the `Finetune_Model` folder:

- Models fine-tuned on specific datasets for **retrieval tasks**.

#### Rerank Fine-tuned Models
Located in the `Rerank` folder:

- Models fine-tuned on specific datasets for **reranking tasks**.


## Citation
If you use this model, please cite the following paper:

```
@misc{tao2025assistingmathematicalformalizationlearningbased,
      title={Assisting Mathematical Formalization with A Learning-based Premise Retriever}, 
      author={Yicheng Tao and Haotian Liu and Shanwen Wang and Hongteng Xu},
      year={2025},
      eprint={2501.13959},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2501.13959}, 
}
```