Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
18
93
Jia-Ying Lin
linekin
Follow
21world's profile picture
1 follower
ยท
3 following
AI & ML interests
None yet
Recent Activity
liked
a model
17 days ago
AdaptLLM/Adapt-MLLM-to-Domains
reacted
to
m-ric
's
post
with ๐
29 days ago
After 6 years, BERT, the workhorse of encoder models, finally gets a replacement: ๐ช๐ฒ๐น๐ฐ๐ผ๐บ๐ฒ ๐ ๐ผ๐ฑ๐ฒ๐ฟ๐ป๐๐๐ฅ๐ง! ๐ค We talk a lot about โจGenerative AIโจ, meaning "Decoder version of the Transformers architecture", but this is only one of the ways to build LLMs: encoder models, that turn a sentence in a vector, are maybe even more widely used in industry than generative models. The workhorse for this category has been BERT since its release in 2018 (that's prehistory for LLMs). It's not a fancy 100B parameters supermodel (just a few hundred millions), but it's an excellent workhorse, kind of a Honda Civic for LLMs. Many applications use BERT-family models - the top models in this category cumulate millions of downloads on the Hub. โก๏ธ Now a collaboration between Answer.AI and LightOn just introduced BERT's replacement: ModernBERT. ๐ง๐;๐๐ฅ: ๐๏ธ Architecture changes: โ First, standard modernizations: - Rotary positional embeddings (RoPE) - Replace GeLU with GeGLU, - Use Flash Attention 2 โจ The team also introduced innovative techniques like alternating attention instead of full attention, and sequence packing to get rid of padding overhead. ๐ฅ As a result, the model tops the game of encoder models: It beats previous standard DeBERTaV3 for 1/5th the memory footprint, and runs 4x faster! Read the blog post ๐ https://huggingface.co/blog/modernbert
reacted
to
m-ric
's
post
with โค๏ธ
29 days ago
Since I published it on GitHub a few days ago, Hugging Face's new agentic library ๐๐บ๐ผ๐น๐ฎ๐ด๐ฒ๐ป๐๐ has gathered nearly 4k stars ๐คฏ โก๏ธ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. โจ Sounds like something you'd like to do? Apply here ๐ https://apply.workable.com/huggingface/j/AF1D4E3FEB/
View all activity
Organizations
None yet
linekin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
17 days ago
AdaptLLM/Adapt-MLLM-to-Domains
Updated
Dec 14, 2024
โข
10
liked
2 models
29 days ago
NousResearch/Hermes-3-Llama-3.2-3B-GGUF
Updated
Dec 18, 2024
โข
27.3k
โข
38
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
โข
Updated
Sep 8, 2024
โข
75.5k
โข
โข
289
liked
a model
30 days ago
microsoft/phi-4
Text Generation
โข
Updated
3 days ago
โข
506k
โข
1.69k
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-V3
Text Generation
โข
Updated
14 days ago
โข
1.11M
โข
โข
3.25k
liked
a dataset
about 2 months ago
taide/taide-bench
Viewer
โข
Updated
Apr 12, 2024
โข
500
โข
92
โข
14
liked
a model
about 2 months ago
google/gemma-2-9b
Text Generation
โข
Updated
Aug 7, 2024
โข
82.8k
โข
638
liked
a model
2 months ago
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
โข
Updated
Dec 2, 2024
โข
97.6k
โข
371
liked
a model
3 months ago
vikhyatk/moondream2
Image-Text-to-Text
โข
Updated
29 days ago
โข
153k
โข
1.02k
liked
a dataset
3 months ago
aigrant/awesome-taiwan-knowledge
Preview
โข
Updated
16 days ago
โข
89
โข
16
liked
a model
3 months ago
thenlper/gte-large
Sentence Similarity
โข
Updated
Nov 15, 2024
โข
571k
โข
264
liked
2 models
4 months ago
neulab/Pangea-7B
Updated
Oct 24, 2024
โข
16.3k
โข
124
facebook/bart-large-mnli
Zero-Shot Classification
โข
Updated
Sep 5, 2023
โข
3.1M
โข
โข
1.29k
liked
5 models
5 months ago
meetkai/functionary-small-v3.2
Updated
Sep 25, 2024
โข
4.29k
โข
33
meetkai/functionary-medium-v3.1
Updated
Sep 25, 2024
โข
181
โข
56
Team-ACE/ToolACE-8B
Updated
Oct 22, 2024
โข
13.8k
โข
46
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
โข
Updated
1 day ago
โข
1.76M
โข
1.11k
poloclub/UniTable
Updated
Apr 2, 2024
โข
23
liked
a dataset
5 months ago
gyr66/privacy_detection
Viewer
โข
Updated
Oct 17, 2023
โข
2.52k
โข
43
โข
3
liked
a model
5 months ago
Alibaba-NLP/gte-multilingual-base
Sentence Similarity
โข
Updated
29 days ago
โข
829k
โข
185
Load more