3 25 36

Brigitte Tousignant

BrigitteTousi

AI & ML interests

None yet

Recent Activity

reacted to fdaudens's post with 🚀 about 6 hours ago

💪 The open-source community is really unstoppable: +5M total downloads for DeepSeek models on @hf.co +4M are from the 700 models created by the community That's 30% more than yesterday!

reacted to davidberenstein1957's post with 👍 about 6 hours ago

tldr; Parquet is awesome, DuckDB too! Datasets on the Hugging Face Hub rely on parquet files. We can interact with these files using DuckDB as a fast in-memory database system. One of DuckDB’s features is vector similarity search which can be used with or without an index. blog: https://huggingface.co/learn/cookbook/vector_search_with_hub_as_backend

reacted to fdaudens's post with 🔥 about 6 hours ago

🎯 Kokoro TTS just hit v1.0! 🚀 Small but mighty: 82M parameters, runs locally, speaks multiple languages. The best part? It's Apache 2.0 licensed! This could unlock so many possibilities ✨ Check it out: https://huggingface.co/hexgrad/Kokoro-82M

View all activity

Articles

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

Jun 24, 2024

• 34

AI Watermarking 101: Tools and Techniques

Feb 26, 2024

• 15

Organizations

BrigitteTousi's activity

reacted to fdaudens's post with 🚀 about 6 hours ago

Post

582

💪 The open-source community is really unstoppable:

+5M total downloads for DeepSeek models on @hf .co
+4M are from the 700 models created by the community
That's 30% more than yesterday!

reacted to davidberenstein1957's post with 👍 about 6 hours ago

Post

702

tldr; Parquet is awesome, DuckDB too!

Datasets on the Hugging Face Hub rely on parquet files. We can interact with these files using DuckDB as a fast in-memory database system. One of DuckDB’s features is vector similarity search which can be used with or without an index.

blog:
https://huggingface.co/learn/cookbook/vector_search_with_hub_as_backend

reacted to fdaudens's post with 🔥 about 6 hours ago

Post

841

🎯 Kokoro TTS just hit v1.0! 🚀

Small but mighty: 82M parameters, runs locally, speaks multiple languages. The best part? It's Apache 2.0 licensed!
This could unlock so many possibilities ✨

Check it out: hexgrad/Kokoro-82M

1 reply

reacted to pagezyhf's post with 🔥 about 6 hours ago

Post

368

We published https://huggingface.co/blog/deepseek-r1-aws!

If you are using AWS, give a read. It is a running document to showcase how to deploy and fine-tune DeepSeek R1 models with Hugging Face on AWS.

We're working hard to enable all the scenarios, whether you want to deploy to Inference Endpoints, Sagemaker or EC2; with GPUs or with Trainium & Inferentia.

We have full support for the distilled models, DeepSeek-R1 support is coming soon!! I'll keep you posted.

Cheers

liked a dataset 1 day ago

HuggingFaceFW/fineweb

Viewer • Updated 28 days ago • 48.6B • 438k • 1.83k

reacted to AdinaY's post with 🔥 1 day ago

Post

2522

It’s not just a flood of model releases, papers are dropping just as fast 🚀

Here are the 10 most upvoted papers from the Chinese community:
👉 zh-ai-community/2025-january-papers-679933cbf0f3ced11f5a168a

reacted to davanstrien's post with 👀 1 day ago

Post

1279

Why choose between strong LLM reasoning and efficient models?

Use DeepSeek to generate high-quality training data, then distil that knowledge into ModernBERT answerdotai/ModernBERT-base for fast, efficient classification.

Blog post: https://danielvanstrien.xyz/posts/2025/deepseek/distil-deepseek-modernbert.html

liked 2 Spaces 1 day ago

Configuration error

😻

Like History

Configuration error

5.54k

🥑

DALL·E mini

reacted to m-ric's post with ➕🤗❤️🚀🔥 2 days ago

Post

2807

𝗧𝗵𝗲 𝗛𝘂𝗯 𝘄𝗲𝗹𝗰𝗼𝗺𝗲𝘀 𝗲𝘅𝘁𝗲𝗿𝗻𝗮𝗹 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗽𝗿𝗼𝘃𝗶𝗱𝗲𝗿𝘀!

✅ Hosting our own inference was not enough: now the Hub 4 new inference providers: fal, Replicate, SambaNova Systems, & Together AI.

Check model cards on the Hub: you can now, in 1 click, use inference from various providers (cf video demo)

Their inference can also be used through our Inference API client. There, you can use either your custom provider key, or your HF token, then billing will be handled directly on your HF account, as a way to centralize all expenses.

💸 Also, PRO users get 2$ inference credits per month!

Read more in the announcement 👉 https://huggingface.co/blog/inference-providers

1 reply

reacted to odellus's post with 🧠 2 days ago

Post

1467

Tired: shitposting on bsky
Wired: shitposting on hf

1 reply

reacted to chansung's post with 👍 2 days ago

Post

1837

Simple summary on DeepSeek AI's Janus-Pro: A fresh take on multimodal AI!

It builds on its predecessor, Janus, by tweaking the training methodology rather than the model architecture. The result? Improved performance in understanding and generating multimodal data.

Janus-Pro uses a three-stage training strategy, similar to Janus, but with key modifications:
✦ Stage 1 & 2: Focus on separate training for specific objectives, rather than mixing data.
✦ Stage 3: Fine-tuning with a careful balance of multimodal data.

Benchmarks show Janus-Pro holds its own against specialized models like TokenFlow XL and MetaMorph, and other multimodal models like SD3 Medium and DALL-E 3.

The main limitation? Low image resolution (384x384). However, this seems like a strategic choice to focus on establishing a solid "recipe" for multimodal models. Future work will likely leverage this recipe and increased computing power to achieve higher resolutions.

reacted to fdaudens's post with 👍🚀 2 days ago

Post

1574

🚀 The open source community is unstoppable: 4M total downloads for DeepSeek models on Hugging Face, with 3.2M coming from the +600 models created by the community.

That's 30% more than yesterday!

1 reply

reacted to cfahlgren1's post with ❤️ 2 days ago

Post

1631

If you haven't seen yet, we just released Inference Providers 🔀

> 4 new serverless inference providers on the Hub 🤯
> Use your HF API key or personal key with all providers 🔑
> Chat with Deepseek R1, V3, and more on HF Hub 🐋
> We support Sambanova, TogetherAI, Replicate, and Fal.ai 💪

Best of all, we don't charge any markup on top of the provider 🫰 Have you tried it out yet? HF Pro accounts get $2 of free usage for the provider inference.

upvoted an article 2 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

3 days ago

• 460