AventIQ-AI
/

roberta-chatbot

Safetensors

roberta

Model card Files Files and versions Community

varshamishra commited on 5 days ago

Commit

b04921e

verified ·

1 Parent(s): 9a225ba

Update README.md

Browse files

Files changed (1) hide show

README.md +72 -67

README.md CHANGED Viewed

@@ -1,46 +1,53 @@
 # RoBERTa Fine-Tuned Model for Question Answering
 This repository hosts a fine-tuned version of the RoBERTa model optimized for question-answering tasks using the [SQuAD](w) dataset. The model is designed to efficiently perform question answering while maintaining high accuracy.
-## Model Details- **Model Architecture**: RoBERTa- **Task**: Question Answering- **Dataset**: [SQuAD](w) (Stanford Question Answering Dataset)- **Quantization**: FP16- **Fine-tuning Framework**: Hugging Face Transformers
-## Usage### Installation```python
-from transformers import RobertaForQuestionAnswering, RobertaTokenizer
 import torch
-# Load the fine-tuned RoBERTa model and tokenizer
-model_name = 'roberta_finetuned_squad'  # Your fine-tuned RoBERTa model
-model = RobertaForQuestionAnswering.from_pretrained(model_name)
 tokenizer = RobertaTokenizer.from_pretrained(model_name)
-# Move the model to GPU if available
-device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
-model.to(device)
-# Quantize the model to FP16
-model = model.half()
-# Save the quantized model and tokenizer
-model.save_pretrained("./quantized_roberta_model_squad")
-tokenizer.save_pretrained("./quantized_roberta_model_squad")
-'''
-# Example input for testing (10 questions)input_texts = [
-    "What is the capital of France?",
-    "Who is the CEO of Tesla?",
-    "What is the largest ocean in the world?",
-    "When did World War II end?",
-    "Who wrote 'Pride and Prejudice'?",
-    "What is the square root of 64?",
-    "What is the boiling point of water?",
-    "Who painted the Mona Lisa?",
-    "What is the currency of Japan?",
-    "Who discovered penicillin?"]
-Context text for answering questionscontext = "Paris is the capital of France. Elon Musk is the CEO of Tesla. The Pacific Ocean is the largest ocean in the world. World War II ended in 1945. Jane Austen wrote 'Pride and Prejudice'. The square root of 64 is 8. Water boils at 100 degrees Celsius. Leonardo da Vinci painted the Mona Lisa. The currency of Japan is the Yen. Alexander Fleming discovered penicillin."# Process each input questionfor input_text in input_texts:
-    # Tokenize input question and context    inputs = tokenizer(input_text, context, return_tensors="pt").to(device)
-    # Perform inferencewith torch.no_grad():
-        outputs = model(**inputs)        start_scores = outputs.start_logits
-        end_scores = outputs.end_logits
-    # Get the predicted answer
-    start_index = torch.argmax(start_scores)
-    end_index = torch.argmax(end_scores)
-    # Decode the predicted answer
-    answer = tokenizer.convert_tokens_to_string(tokenizer.convert_ids_to_tokens(inputs.input_ids[0][start_index:end_index+1]))
-    print(f"Question: {input_text}")
-    print(f"Answer: {answer}\n")
-'''
 📊 Evaluation Results
 After fine-tuning the RoBERTa-base model for question answering, we evaluated the model's performance on the validation set from the SQuAD dataset. The following results were obtained:
@@ -58,36 +65,34 @@ Quantization
 Post-training quantization was applied using PyTorch's built-in quantization framework to reduce the model size and improve inference efficiency.
 Repository Structure
-├── model/               # Contains the quantized model files├── tokenizer_config/    # Tokenizer configuration and vocabulary files├── model.safetensors/   # Quantized Model
 ├── README.md            # Model documentation
-Limitations
-The model is primarily trained on the SQuAD dataset and may not perform well on domain-specific question-answering tasks without additional fine-tuning. The model may struggle with highly ambiguous or multi-answer questions.
-Contributing
-Contributions are welcome! Feel free to open an issue or submit a pull request if you have suggestions or improvements.
-## License
-This model and code are provided under the [MIT License](w). You are free to use, modify, and distribute this model for both academic and commercial purposes, provided that appropriate credit is given.
-## Acknowledgments- [RoBERTa](w): A Robustly Optimized BERT Pretraining Approach by Facebook AI Research (FAIR).- [SQuAD](w): Stanford Question Answering Dataset, created by the Stanford NLP Group.
-## Contact
-For any questions or suggestions, please feel free to reach out via [Issues](w) on GitHub or contact the maintainers at [[email protected]](mailto:[email protected]).
-## Citation
-If you use this model in your research or project, please cite the following paper:
-@article{liu2019roberta, title={RoBERTa: A Robustly Optimized BERT Pretraining Approach}, author={Liu, Yinhan and Ott, Myle and Goyal, Naman and Du, Jingfei and Joshi, Mandar and Chen, Danqi and Levy, Omer and Lewis, Mike and Zettlemoyer, Luke and Facebook AI}, journal={arXiv preprint arXiv:1907.11692}, year={2019} }
-Frequently Asked Questions (FAQ)
-Q: How can I fine-tune this model on a custom dataset?
-A: To fine-tune the model on your own dataset, you can follow these steps:1. Preprocess your dataset into a format compatible with the Hugging Face `Trainer`.2. Use the `RobertaForQuestionAnswering` class and set up the fine-tuning loop using the `Trainer` API from Hugging Face.3. Train the model on your dataset and evaluate it using metrics like F1 and Exact Match.
-Q: What if my model is running out of memory during inference?
-A: If you are running out of memory, try the following:
-- Use smaller batch sizes or batch the inference.
-- Perform inference on CPU if GPU memory is insufficient.
-- Quantize the model further (e.g., FP16 to INT8) to reduce the model size.
-Q: Can I use this model for other NLP tasks?
-A: This model is primarily fine-tuned for question answering. If you want to adapt it for other NLP tasks (such as sentiment analysis or text classification), you will need to modify the head of the model accordingly and fine-tune it on the relevant dataset.
----
-We hope this model helps you in your NLP tasks! Feel free to contribute improvements or share your results with us!

 # RoBERTa Fine-Tuned Model for Question Answering
 This repository hosts a fine-tuned version of the RoBERTa model optimized for question-answering tasks using the [SQuAD](w) dataset. The model is designed to efficiently perform question answering while maintaining high accuracy.
+## Model Details
+- **Model Architecture**: RoBERTa
+- **Task**: Question Answering
+- **Dataset**: [SQuAD](w) (Stanford Question Answering Dataset)
+- **Quantization**: FP16
+- **Fine-tuning Framework**: Hugging Face Transformers
+## 🚀 Usage
+### Installation
+```bash
+pip install transformers torch
+```
+### Loading the Model
+```python
+from transformers import RobertaTokenizer, RobertaForQuestionAnswering
 import torch
+device = "cuda" if torch.cuda.is_available() else "cpu"
+model_name = "AventIQ-AI/roberta-chatbot"
+model = RobertaForQuestionAnswering.from_pretrained(model_name).to(device)
 tokenizer = RobertaTokenizer.from_pretrained(model_name)
+```
+### Chatbot Inference
+```python
+from transformers import pipeline
+# Load QA pipeline
+qa_pipeline = pipeline("question-answering", model=model, tokenizer=tokenizer, device=0)
+# Test sample question
+# Updated context and question for a flight prediction example
+# Updated context and question for flight prediction example
+context = "Flight AI101 departs from New York at 10:00 AM and arrives in San Francisco at 1:30 PM. The flight duration is 5 hours and 30 minutes."
+question = "What is the duration of Flight AI101?"
+# Get answer
+result = qa_pipeline(question=question, context=context)
+print(result)
+```
 📊 Evaluation Results
 After fine-tuning the RoBERTa-base model for question answering, we evaluated the model's performance on the validation set from the SQuAD dataset. The following results were obtained:
 Post-training quantization was applied using PyTorch's built-in quantization framework to reduce the model size and improve inference efficiency.
 Repository Structure
+```
+.
+├── model/               # Contains the quantized model files
+├── tokenizer_config/    # Tokenizer configuration and vocabulary files
+├── model.safetensors/   # Quantized Model
+├── README.md            # Model documentation
+```
+## ⚡ Quantization Details
+Post-training quantization was applied using PyTorch's built-in quantization framework. The model was quantized to Float16 (FP16) to reduce model size and improve inference efficiency while balancing accuracy.
+## 📂 Repository Structure
+```
+.
+├── model/               # Contains the quantized model files
+├── tokenizer_config/    # Tokenizer configuration and vocabulary files
+├── model.safetensors/   # Quantized Model
 ├── README.md            # Model documentation
+```
+## ⚠️ Limitations
+- The model may struggle with highly ambiguous sentences.
+- Quantization may lead to slight degradation in accuracy compared to full-precision models.
+- Performance may vary across different writing styles and sentence structures.
+## 🤝 Contributing
+Contributions are welcome! Feel free to open an issue or submit a pull request if you have suggestions or improvements.