--- title: Qwen2-VL Amazon Listing Generator emoji: 🛒 colorFrom: blue colorTo: green sdk: gradio sdk_version: 4.44.0 app_file: app.py pinned: false license: apache-2.0 --- # 🛒 Qwen2-VL Amazon Listing Generator (LoRA) This Hugging Face Space showcases a **fine-tuned Qwen2-VL-7B model with LoRA adapter** trained to generate **Amazon-style product listings** from product images. ## 🚀 Features - **Vision-Language Model**: Qwen2-VL-7B-Instruct with custom LoRA adapter - **Amazon Listing Generation**: Creates structured product listings with: - Product title - Bullet points (key features) - Product description - Keywords - Product category - **CPU Optimized**: Runs on free CPU hardware (may take 1-2 minutes per generation) ## 🔧 Model Details - **Base Model**: [Qwen/Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct) - **LoRA Adapter**: [soupstick/qwen2vl-amazon-ft-lora](https://huggingface.co/soupstick/qwen2vl-amazon-ft-lora) - **Fine-tuning**: Specialized for e-commerce product listing generation ## 🎯 How to Use 1. **Upload Image**: Click on the image upload area and select a product photo 2. **Optional Prompt**: Modify the instruction if needed (default works well) 3. **Generate**: Click "Generate Listing" and wait for results 4. **Review Output**: Get structured Amazon-style listing in JSON format ## 📋 Expected Output Format ```json { "title": "Product Title Here", "bullet_points": [ "• Key feature 1", "• Key feature 2", "• Key feature 3" ], "description": "Detailed product description...", "keywords": "relevant, product, keywords", "category": "Product > Category > Subcategory" } ``` ## ⚡ Performance Notes - **CPU Mode**: This demo runs on CPU hardware for free access - **Processing Time**: 1-2 minutes per generation due to CPU limitations - **Image Size**: Automatically resized to 512px for optimal performance - **Memory Optimized**: Uses float32 and low memory settings ## 🔗 Links - [Model Repository](https://huggingface.co/soupstick/qwen2vl-amazon-ft-lora) - [Base Model](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct) - [Qwen2-VL Paper](https://arxiv.org/abs/2409.12191) ## ⚠️ Limitations - **Demo Purpose**: This is a prototype for concept demonstration - **Accuracy**: Results depend on training data quality and model size - **Speed**: CPU inference is slower than GPU (upgrade hardware for faster results) - **Languages**: Primarily trained on English product descriptions ## 🛠️ Technical Stack - **Framework**: Transformers, PEFT (LoRA), Gradio - **Model**: Qwen2-VL-7B with custom LoRA adapter on Unsloth-AI - **Hardware**: CPU-optimized for Hugging Face Spaces free tier --- *Built with ❤️ using Hugging Face Spaces*