H company

Team

company

Verified

https://www.hcompany.ai/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

marc-thibault-h new activity 19 days ago

Hcompany/Holo1-Localization:Update app.py and requirements following Mungert's proposed setup

marc-thibault-h updated a Space 19 days ago

Hcompany/Holo1-Localization

marc-thibault-h new activity 19 days ago

Hcompany/Holo1-Localization:Fixed demo

View all activity

Articles

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Jun 3

• 70

merve

posted an update about 12 hours ago

Post

196

large AI labs have dropped so many open models last week 🔥 don't miss out on them

→ Apple released on-device vision LMs apple/fastvlm-68ac97b9cd5cacefdd04872e & apple/mobileclip2-68ac947dcb035c54bcd20c47
→ OpenGVLab released InternVL3.5, 32 new vision LMs with one based on gpt-oss! (OS) OpenGVLab/internvl35-68ac87bd52ebe953485927fb
→ MSFT released a killer small TTS model (OS) microsoft/VibeVoice-1.5B

find more herehttps://huggingface.co/collections/merve/august-29-releases-68b5a3754cfb8abf59e2b486

sergiopaniego

posted an update 6 days ago

Post

320

It's now posible to do end-2-end ML without leaving the @huggingface Hub, by combining TRL + HF jobs + Trackio!!

🐡We just released a full guide explaining the process.

Go check it out!

📖 Guide: https://huggingface.co/docs/trl/main/en/jobs_training

💡 Reminder: HF Jobs is only available for Pro, Team, or Enterprise plans. Yet another reason to upgrade

merve

posted an update 7 days ago

Post

5783

first vision language model built off openai/gpt-oss-20b just dropped! 🔥

InternVL3.5 comes with 32 models 🤯 pre-trained, fine-tuned, aligned in various sizes OpenGVLab/internvl35-68ac87bd52ebe953485927fb
comes with gpt-oss or Qwen3 for LLM part ⤵️

1 reply

marc-thibault-h

in Hcompany/Holo1-Localization 19 days ago

Update app.py and requirements following Mungert's proposed setup

#4 opened 19 days ago by

marc-thibault-h

updated a Space 19 days ago

Holo1 Localization

📚

Web Localization powered by Holo1

marc-thibault-h

in Hcompany/Holo1-Localization 19 days ago

Fixed demo

#2 opened 22 days ago by

Mungert

sergiopaniego

posted an update 20 days ago

Post

2864

So you can now SFT a model with hf jobs + TRL in ONE command lol 🏎️💨

Without worrying about infrastructure since it runs entirely on HF!

docs: https://huggingface.co/docs/huggingface_hub/main/en/guides/jobs
blog: https://huggingface.co/blog/hf-cli

MatsLRichter

updated a Space 21 days ago

Holo1 Localization

📚

Web Localization powered by Holo1

MatsLRichter

in Hcompany/Holo1-Localization 21 days ago

Update requirements.txt

#3 opened 21 days ago by

MatsLRichter

sergiopaniego

posted an update 21 days ago

Post

388

New Zero-Shot Object Detectors in transformers! 🥽

We’ve added LLMDet and MM GroundingDINO, plus a demo Space to compare them with others 🖼️

Play with it: ariG23498/zero-shot-od

sergiopaniego

posted an update 22 days ago

Post

355

Missed last week's OpenAI GPT OSS release?

Here are 2 quick-start recipes we developed to get you up to speed:

🏃‍♀️ How to run gpt-oss-20b on Google Colab
https://cookbook.openai.com/articles/gpt-oss/run-colab

🧑‍🔧 Fine-tuning with gpt-oss and Hugging Face Transformers
https://cookbook.openai.com/articles/gpt-oss/fine-tune-transfomers

marc-thibault-h

updated a Space 24 days ago

Holo1 Navigation

🐠

Web Navigation powered by Holo1 Vision-Language Action Model

sergiopaniego

posted an update 26 days ago

Post

442

Latest TRL release brings major upgrades for multimodal alignment!

We dive into 3 new techniques to improve VLM post-training in our new blog:

🌋 GRPO
🎞️ GSPO
🐙 MPO
➕ vLLM integration for online training w/ transformers backend\

🐡 Blog: https://huggingface.co/blog/trl-vlm-alignment

merve

posted an update 26 days ago

Post

3229

GPT-4.1-mini level model right in your iPhone 🤯

openbmb/MiniCPM-V-4 is only 4B while surpassing GPT-4.1-mini in vision benchmarks 🔥

allows commercial use as well!

sergiopaniego

posted an update 27 days ago

Post

2171

OpenAI's open models are out! 💃

Try: https://www.gpt-oss.com/
Learn: https://huggingface.co/blog/welcome-openai-gpt-oss

1 reply

merve

posted an update 28 days ago

Post

1118

we're all sleeping on this OCR model rednote-hilab/dots.ocr 🔥

dots.ocr is a new 3B model with sota performance, support for 100 languages & allowing commercial use! 🤯

single e2e model to extract image, convert tables, formula, and more into markdown 📝
try it MohamedRashad/Dots-OCR

sergiopaniego

posted an update 28 days ago

Post

3404

Want to learn how to align a Vision Language Model (VLM) for reasoning using GRPO and TRL? 🌋

🧑‍🍳 We've got you covered!!

NEW multimodal post training recipe to align a VLM using TRL in @HuggingFace 's Cookbook.

Go to the recipe 👉https://huggingface.co/learn/cookbook/fine_tuning_vlm_grpo_trl

Powered by the latest TRL v0.20 release, this recipe shows how to teach Qwen2.5-VL-3B-Instruct to reason over images 🌋

merve

posted an update 28 days ago

Post

654

massive releases and tons of Flux 1. Krea LoRas past week!
here's some of the picks, find more models in collection 🫡 merve/releases-august-2-6890c14248203522b7d0267f

LLMs 💬
> Tencent dropped tencent/Hunyuan-7B-Instruct
> Qwen released Qwen/Qwen3-Coder-30B-A3B-Instruct, 30B MoE with 3B params for coding (OS)

vision/multimodal
> RedNote released rednote-hilab/dots.ocr - 3B OCR model (OS)
> Cohere released CohereLabs/command-a-vision-07-2025 - 112B (dense!) VLM for 6 languages
> StepFun-AI shipped stepfun-ai/step3 - 321B MoE VLM (OS)
> Skywork shipped Skywork/Skywork-UniPic-1.5B - new any-to-any model (image+text → image+text) (OS)

sergiopaniego

posted an update 29 days ago

Post

4500

Just included example scripts for aligning models using GSPO (including VLM example) 🙆‍♂️🙆‍♂️

GSPO is the latest RL alignment algo by @Alibaba_Qwen and it's already supported in the latest TRL v0.20 release.

Super-easy-to-get-started example scripts below, GO run them!👩‍💻👩‍💻

🧑‍🎨 Script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo.py
🦄 VLM script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo_vlm.py
🧩 More TRL examples: https://huggingface.co/docs/trl/main/en/example_overview
🧙‍♂️ GSPO paper: Group Sequence Policy Optimization (2507.18071)

merve

posted an update about 1 month ago

Post

2231

Cohere just dropped CohereLabs/command-a-vision-07-2025, a 112B (dense!) vision LM
> based on SigLIP2 & Command-A
> built for enterprise use cases 🔥
> use with Inference Providers or transformers 🤗
read their blog https://huggingface.co/blog/CohereLabs/introducing-command-a-vision-07-2025

2 replies

AI & ML interests

Recent Activity

Articles

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Team members 31

Hcompany's activity

Update app.py and requirements following Mungert's proposed setup

Holo1 Localization

Fixed demo

Holo1 Localization

Update requirements.txt

Holo1 Navigation