---
title: Nano Gpt Shakespeare Data
emoji: 🐢
colorFrom: pink
colorTo: blue
sdk: gradio
sdk_version: 4.0.2
app_file: app.py
pinned: false
license: mit
---

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

## Task : Training a character-level GPT on the works of Shakespeare data from scratch

#  Contributors :

## Gosula Sunandini 
github repository : https://github.com/sunandhini96
## Katipally Vigneshwar Reddy
github repository : https://github.com/katipallyvig8899


### Developed the nano gpt application using gradio.


# Some of steps followed during the training:

### Data preparation:

-> download single (1MB) file and turn it from raw text into one large stream of integers(Tokenize the text into characters. In character-level models, each character becomes a token and data encoding ) 

` python data/shakespeare_char/prepare.py `

This creates a train.bin and val.bin in that data directory.

### Train the model:

To train the model run the following command:

` python train.py config/train_shakespeare_char.py `

### Sample data generation:

To get sample data run the following command:

`  python sample.py --out_dir=out-shakespeare-char `