Pavel Iakubovskii
qubvel-hf
AI & ML interests
Computer Vision models
Recent Activity
upvoted
a
collection
3 days ago
YOLOE
upvoted
an
article
3 days ago
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
liked
a model
12 days ago
xingyang1/Distill-Any-Depth
Organizations
qubvel-hf's activity

upvoted
a
collection
3 days ago
Collection
10 items
•
Updated
•
2

upvoted
an
article
3 days ago
Article
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
By
and 3 others
•
•
298
upvoted
a
paper
12 days ago

upvoted
an
article
12 days ago
Article
SmolVLM2: Bringing Video Understanding to Every Device
•
209

reacted to
clem's
post with 🔥
12 days ago
Post
5888
Super happy to welcome Nvidia as our latest enterprise hub customer. They have almost 2,000 team members using Hugging Face, and close to 20,000 followers of their org. Can't wait to see what they'll open-source for all of us in the coming months!
Nvidia's org: https://huggingface.co/nvidia
Enterprise hub: https://huggingface.co/enterprise
Nvidia's org: https://huggingface.co/nvidia
Enterprise hub: https://huggingface.co/enterprise
Update code snippet
#6 opened 15 days ago
by
qubvel-hf

Update code snippet
#11 opened 15 days ago
by
qubvel-hf

Update code snippet
#8 opened 15 days ago
by
qubvel-hf


upvoted
a
paper
17 days ago
SigLip2 Does Not Reproduce Expected Results
3
#7 opened 20 days ago
by
dogukan-bg

commented on
SigLIP 2: A better multilingual vision language encoder
19 days ago
btw, also observed "." and capitalized template influences the confidence quite a bit

commented on
SigLIP 2: A better multilingual vision language encoder
19 days ago
Not sure what's up as I'm not familiar with this codebase (and no time to dig in), but for siglip what you're supposed to do is do sigmoid(zimg @ ztxt * temperature + bias)
from what you describe, I would bet the bias and/or temperature are missing?
The ground-truth reference code is https://colab.research.google.com/github/google-research/big_vision/blob/main/big_vision/configs/proj/image_text/SigLIP2_demo.ipynb
Hey @giffmana , temperature and bias are applied under the hood, see