zhongwang commited on
Commit
cd63253
·
verified ·
1 Parent(s): 2aa2aa8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -1,5 +1,11 @@
1
  ---
2
  license: bsd
 
 
 
 
 
 
3
  ---
4
 
5
  This is the base model of GenomeOcean-4B. It is trained with Causal Language Modeling (CLM) and uses a BPE tokenizer with 4096 tokens. It supports a maximum sequence length of 10240 tokens (~50kbp).
 
1
  ---
2
  license: bsd
3
+ tags:
4
+ - biology
5
+ - genomics
6
+ - metagenomics
7
+ - DNA
8
+ - microbiome
9
  ---
10
 
11
  This is the base model of GenomeOcean-4B. It is trained with Causal Language Modeling (CLM) and uses a BPE tokenizer with 4096 tokens. It supports a maximum sequence length of 10240 tokens (~50kbp).