Post
1933
I loved the idea of the Boxing by
sergiopaniego/vlm_object_understanding
And https://huggingface.co/spaces/webml-community/fastvlm-webgpu
So I tried to combine the two idea, unfortunately I can’t seem to get it consistent and I only worked on the File Upload side. You may have to change the prompt a bit to suite the video you upload but it seems to semi work. If anyone knows a better way to fix this, I really wanted to use this for a project but I can’t seem to figure it out.
Quazim0t0/FastVLMBoxes
I used videos from here and uploaded them to try it out.
https://pixabay.com/videos/search/branch+birds/
And https://huggingface.co/spaces/webml-community/fastvlm-webgpu
So I tried to combine the two idea, unfortunately I can’t seem to get it consistent and I only worked on the File Upload side. You may have to change the prompt a bit to suite the video you upload but it seems to semi work. If anyone knows a better way to fix this, I really wanted to use this for a project but I can’t seem to figure it out.
Quazim0t0/FastVLMBoxes
I used videos from here and uploaded them to try it out.
https://pixabay.com/videos/search/branch+birds/