Spaces:

prowriting
/

paraphraser-api

Running

App Files Files Community

prowriting commited on 12 days ago

Commit

79f6639

verified ·

1 Parent(s): 9c00f33

Fix tokenizer errors and restructure project for Space deployment

Browse files

- Removed old conflicting main.py file that was causing runtime errors
- Added proper app.py entrypoint compatible with Hugging Face Spaces
- Updated requirements.txt to include sentencepiece and tiktoken
- Ensured T5 tokenizer loads correctly by supporting SentencePiece
- Packaged all files into a clean zip for upload

Files changed (3) hide show

README.md +31 -6
app.py +27 -4
requirements.txt +1 -1

README.md CHANGED Viewed

@@ -1,14 +1,39 @@
 ---
-title: My Hugging Face Space
-emoji: 🚀
-colorFrom: blue
-colorTo: green
 sdk: gradio
 sdk_version: "4.29.0"
 app_file: app.py
 pinned: false
 ---
-# My Hugging Face Space
-This is a demo space fixed with proper configuration.

 ---
+title: Paraphrasing App
+emoji: 🔄
+colorFrom: indigo
+colorTo: blue
 sdk: gradio
 sdk_version: "4.29.0"
 app_file: app.py
 pinned: false
 ---
+# 🔄 Paraphrasing App
+This Space uses a **T5 transformer model** to paraphrase input text into different variations.
+It is built with **Gradio** and **Hugging Face Transformers**.
+## 🚀 Features
+- Enter any sentence or paragraph
+- Get multiple paraphrased outputs
+- Powered by pretrained **T5 model**
+## 🛠️ Requirements
+All dependencies are listed in `requirements.txt`:
+- `transformers`
+- `torch`
+- `sentencepiece`
+- `tiktoken`
+- `gradio`
+## 💡 Example
+Input:
+> "The quick brown fox jumps over the lazy dog."
+Output:
+- "A fast brown fox leaps over a lazy dog."
+- "The lazy dog was jumped over by a quick brown fox."
+---
+Built with ❤️ using Hugging Face Spaces

app.py CHANGED Viewed

@@ -1,7 +1,30 @@
 import gradio as gr
-def greet(name):
-    return f"Hello {name}!"
-iface = gr.Interface(fn=greet, inputs="text", outputs="text")
-iface.launch()

 import gradio as gr
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+# Load model and tokenizer
+model_name = "t5-small"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
+def paraphrase(text, num_return_sequences=3, num_beams=5):
+    input_text = "paraphrase: " + text + " </s>"
+    inputs = tokenizer.encode(input_text, return_tensors="pt", max_length=512, truncation=True)
+    outputs = model.generate(
+        inputs,
+        max_length=512,
+        num_beams=num_beams,
+        num_return_sequences=num_return_sequences,
+        temperature=1.5
+    )
+    return [tokenizer.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=True) for output in outputs]
+demo = gr.Interface(
+    fn=paraphrase,
+    inputs=[gr.Textbox(lines=3, label="Enter text"), gr.Slider(1, 5, value=3, step=1, label="Number of outputs")],
+    outputs=gr.List(label="Paraphrased Sentences"),
+    title="🔄 Paraphrasing App",
+    description="Paraphrase any input text using a pretrained T5 transformer model."
+)
+if __name__ == "__main__":
+    demo.launch()

requirements.txt CHANGED Viewed

@@ -1,5 +1,5 @@
-gradio==4.29.0
 transformers
 torch
 sentencepiece
 tiktoken

 transformers
 torch
 sentencepiece
 tiktoken
+gradio==4.29.0