--- title: Nano Gpt Shakespeare Data emoji: 🐢 colorFrom: pink colorTo: blue sdk: gradio sdk_version: 4.0.2 app_file: app.py pinned: false license: mit --- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference ## Task : Training a character-level GPT on the works of Shakespeare data from scratch # Contributors : ## Gosula Sunandini github repository : https://github.com/sunandhini96 ## Katipally Vigneshwar Reddy github repository : https://github.com/katipallyvig8899 ### Developed the nano gpt application using gradio. # Some of steps followed during the training: ### Data preparation: -> download single (1MB) file and turn it from raw text into one large stream of integers(Tokenize the text into characters. In character-level models, each character becomes a token and data encoding ) ` python data/shakespeare_char/prepare.py ` This creates a train.bin and val.bin in that data directory. ### Train the model: To train the model run the following command: ` python train.py config/train_shakespeare_char.py ` ### Sample data generation: To get sample data run the following command: ` python sample.py --out_dir=out-shakespeare-char `