968 202 692

Tom Aarsen

tomaarsen

https://linkedin.com/in/tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Recent Activity

new activity about 4 hours ago

jxm/cde-small-v2:Clean up README slightly

new activity about 6 hours ago

Alibaba-NLP/gte-modernbert-base:Entering on MTEB

new activity about 6 hours ago

Alibaba-NLP/gte-modernbert-base:NaN values when input is longer than context window?

View all activity

Articles

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

• 70

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 73

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Dec 6, 2023

• 6

🕳️ Attention Sinks in LLMs for endless fluency

Oct 9, 2023

• 7

Organizations

tomaarsen's activity

upvoted 2 articles about 18 hours ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

1 day ago

• 19

Article

State of open video generation models in Diffusers

4 days ago

• 23

upvoted 2 articles 1 day ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

3 days ago

• 460

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

•

2 days ago

• 13

upvoted a paper 2 days ago

SPLADE-v3: New baselines for SPLADE

Paper • 2403.06789 • Published Mar 11, 2024 • 2

upvoted an article 2 days ago

Article

Welcome to Inference Providers on the Hub 🔥

3 days ago

• 171

upvoted 2 collections 7 days ago

Logical GLiNER

Collection

3 items • Updated Dec 27, 2024 • 2

Language Detection

Collection

StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText • 2 items • Updated 4 days ago • 3

upvoted 3 articles 8 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

8 days ago

• 95

Article

Mastering Long Contexts in LLMs with KVPress

•

8 days ago

• 58

Article

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

9 days ago

• 29

upvoted an article 10 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

•

10 days ago

• 28

upvoted a paper 10 days ago

CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval

Paper • 2411.12644 • Published Nov 19, 2024 • 3

upvoted an article 14 days ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

15 days ago

• 37

upvoted an article 15 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

16 days ago

• 129

upvoted an article 17 days ago

Article

Python Is All You Need? Introducing Dria-Agent-α

•

20 days ago

• 22

upvoted a paper 18 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 59

upvoted 2 papers 23 days ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 17

Fietje: An open, efficient LLM for Dutch

Paper • 2412.15450 • Published Dec 19, 2024 • 4

upvoted an article 24 days ago

Article

Announcing NVIDIA Cosmos World Foundation Models

•

24 days ago

• 23

Tom Aarsen

AI & ML interests

Recent Activity

Articles

Train 400x faster Static Embedding Models with Sentence Transformers

Finally, a Replacement for BERT: Introducing ModernBERT

Welcome Gemma 2 - Google's new open LLM

Training and Finetuning Embedding Models with Sentence Transformers v3

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

🪆 Introduction to Matryoshka Embedding Models

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

🕳️ Attention Sinks in LLMs for endless fluency

Organizations

tomaarsen's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

State of open video generation models in Diffusers

Open-R1: a fully open reproduction of DeepSeek-R1

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Welcome to Inference Providers on the Hub 🔥

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Mastering Long Contexts in LLMs with KVPress

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

Fine-tune ModernBERT for RAG with Synthetic Data

Timm ❤️ Transformers: Use any timm model with transformers

Train 400x faster Static Embedding Models with Sentence Transformers

Python Is All You Need? Introducing Dria-Agent-α

Announcing NVIDIA Cosmos World Foundation Models