DailyChat-350M

A finetuned version of Codegen-350M-nl on the 'daily_dialog' dataset. The idea of this model is to create one that is capable of holding a decent conversation.

Training Procedure

This was trained on Kaggle's servers using 1x NVIDIA P100. This model was trained for 1 epoch with learning rate 1e-2.

Biases & Limitations

This likely contains the same biases and limitations as the original model that it is based on, and additionally heavy biases from the dataset. It can generate offensive input when prompted, so user discretion is advised.

Intended Use

Dialog generation, chat agents.

Downloads last month
113
Safetensors
Model size
441M params
Tensor type
F32
·
BOOL
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Dataset used to train DarwinAnim8or/DailyChat-350M