sardinelab
/

SparseModernBERT-alpha1.5

Model card Files Files and versions

mtreviso commited on Jun 27

Commit

eb62600

·

verified ·

1 Parent(s): 6439868

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -7,12 +7,11 @@ datasets:
 # SparseModernBERT α=1.5 Model Card
-Models from AdaSplash. Check the original codebase [here](https://github.com/deep-spin/SparseModernBERT).
 ## Model Overview
-SparseModernBERT-α1.5 is a masked language model based on [ModernBERT](https://github.com/AnswerDotAI/ModernBERT) that replaces the standard softmax attention with an adaptive sparse attention mechanism (AdaSplash) using Triton. The sparsity parameter α = 1.5 yields moderately sparse attention patterns, improving efficiency while maintaining performance.
 **Key features:**

 # SparseModernBERT α=1.5 Model Card
 ## Model Overview
+SparseModernBERT-alpha1.5 is a masked language model based on [ModernBERT](https://github.com/AnswerDotAI/ModernBERT) that replaces the standard softmax attention with an adaptive sparse attention mechanism (AdaSplash) using Triton.
+The sparsity parameter α = 1.5 yields moderately sparse attention patterns, improving efficiency while maintaining performance.
 **Key features:**