genbio-ai
/

AIDO.RNA-650M

Model card Files Files and versions Community

DianLiI commited on Nov 27, 2024

Commit

cd5f23a

·

verified ·

1 Parent(s): aa383a0

Update README.md

Files changed (1) hide show

README.md +47 -1

README.md CHANGED Viewed

@@ -1,3 +1,49 @@
 ## AIDO.RNA 650M
-AIDO.RNA 650M is an RNA foundation model trained on 42 million non-coding RNA sequences at single-nucleotide resolution.

 ## AIDO.RNA 650M
+AIDO.RNA 650M is an RNA foundation model trained on 42 million non-coding RNA sequences at single-nucleotide resolution.
+## How to Use
+### Build any downstream models from this backbone
+#### Embedding
+```python
+from genbio_finetune.tasks import Embed
+model = Embed.from_config({"model.backbone": "rnafm_650m_cds"}).eval()
+collated_batch = model.collate({"sequences": ["ACGT", "AGCT"]})
+embedding = model(collated_batch)
+print(embedding.shape)
+print(embedding)
+```
+#### Sequence Level Classification
+```python
+import torch
+from genbio_finetune.tasks import SequenceClassification
+model = SequenceClassification.from_config({"model.backbone": "rnafm_650m_cds", "model.n_classes": 2}).eval()
+collated_batch = model.collate({"sequences": ["ACGT", "AGCT"]})
+logits = model(collated_batch)
+print(logits)
+print(torch.argmax(logits, dim=-1))
+```
+#### Token Level Classification
+```python
+import torch
+from genbio_finetune.tasks import TokenClassification
+model = TokenClassification.from_config({"model.backbone": "rnafm_650m_cds", "model.n_classes": 3}).eval()
+collated_batch = model.collate({"sequences": ["ACGT", "AGCT"]})
+logits = model(collated_batch)
+print(logits)
+print(torch.argmax(logits, dim=-1))
+```
+#### Regression
+```python
+from genbio_finetune.tasks import SequenceRegression
+model = SequenceRegression.from_config({"model.backbone": "rnafm_650m_cds"}).eval()
+collated_batch = model.collate({"sequences": ["ACGT", "AGCT"]})
+logits = model(collated_batch)
+print(logits)
+```
+#### Or use our one-liner CLI to finetune or evaluate any of the above!
+```
+gbft fit --model SequenceClassification --model.backbone rnafm_650m_cds --data SequenceClassification --data.path <hf_or_local_path_to_your_dataset>
+gbft test --model SequenceClassification --model.backbone rnafm_650m_cds --data SequenceClassification --data.path <hf_or_local_path_to_your_dataset>
+```
+For more information, visit: [Model Generator](https://github.com/genbio-ai/modelgenerator)