--- library_name: peft license: other base_model: Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4 tags: - llama-factory - lora - generated_from_trainer model-index: - name: Qwen2-VL-2B-Instruct-GPTQ-Int4-LoRA-SurveillanceVideo-Classification-250210 results: [] pipeline_tag: video-classification --- # Qwen2-VL-2B-Instruct-GPTQ-Int4-LoRA-SurveillanceVideo-Classification-250210 This model is a fine-tuned version of [Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4) on the Surveillance Video Classification dataset. ## Model description This model takes a video as input and classifies it into one of the following six classes [1. loitering, 2. breaking and entering, 3. abandonment, 4. falling down, 5. fighting, 6. arson] LLaMA-Factory was used for training, with the same hyperparameters as described below. ## Intended uses & limitations This Model Fine-tuned by the Prompt Below. The same is true when running inference. ```python messages = [ { "role": "user", "content": [ { "type": "video", "video": video_path, "max_pixels": 640 * 360, # "fps": 1.0 # maybe default fps = 1.0 }, { "type": "text", "text": ( "