This is the official checkpoint of feedback model trained using COFFEE-GYM with PPO strategy.
This model generates natural language feedback given an erroneous code.
For further detials, please see our paper.
- Downloads last month
- 9
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.