Spaces:
Sleeping
Sleeping
Upload 2 files
Browse files- README.md +38 -48
- requirements.txt +0 -0
README.md
CHANGED
@@ -6,86 +6,76 @@ colorTo: blue
|
|
6 |
sdk: gradio
|
7 |
sdk_version: "3.50.2"
|
8 |
app_file: app.py
|
9 |
-
pinned:
|
10 |
---
|
11 |
|
|
|
12 |
|
13 |
-
|
14 |
|
15 |
-
|
|
|
|
|
|
|
|
|
16 |
|
17 |
---
|
18 |
|
19 |
## 🚀 Features
|
20 |
|
21 |
-
| Module
|
22 |
-
|
23 |
-
| 🧠 Phrase → Protein
|
24 |
-
|
|
25 |
-
|
|
26 |
-
| 🧬
|
27 |
-
|
|
28 |
-
| 📊 Analyze Protein | Visualize amino acid composition as bar plot |
|
29 |
-
| 📚 Symbolic Language | GeneForgeLang syntax allows structured protein definitions |
|
30 |
|
31 |
---
|
32 |
|
33 |
-
## 🧪 Example
|
34 |
|
35 |
-
###
|
36 |
|
37 |
```
|
38 |
-
|
|
|
|
|
39 |
```
|
40 |
|
41 |
-
###
|
42 |
-
- Seed: `MKKK`
|
43 |
-
- Generated protein: realistic sequence (via ProtGPT2)
|
44 |
-
- Properties: length, charge, MW
|
45 |
-
- Description: *“This protein contains a kinase domain, a nuclear localization signal, and lysine acetylation at a specific position.”*
|
46 |
-
- Export: `.fasta` format
|
47 |
-
- Graph: bar plot of amino acid composition
|
48 |
-
|
49 |
-
---
|
50 |
-
|
51 |
-
## ▶️ How to Use
|
52 |
-
|
53 |
-
1. Clone this repo
|
54 |
-
2. Install dependencies:
|
55 |
|
56 |
```
|
57 |
-
|
|
|
|
|
58 |
```
|
59 |
|
60 |
-
|
61 |
-
|
62 |
-
```
|
63 |
-
python app.py
|
64 |
-
```
|
65 |
|
66 |
-
|
67 |
|
68 |
-
|
69 |
-
|
70 |
-
|
71 |
|
72 |
---
|
73 |
|
74 |
-
## 📁
|
75 |
|
76 |
-
| File
|
77 |
-
|
78 |
-
| `app.py`
|
79 |
-
| `semillas.json`
|
80 |
-
| `
|
81 |
-
| `requirements.txt`
|
82 |
-
| `README.md`
|
83 |
|
84 |
---
|
85 |
|
86 |
## 🧠 Developed by
|
87 |
|
88 |
**Fundación de Neurociencias**
|
89 |
-
|
90 |
|
91 |
Join us in shaping symbolic bio-AI.
|
|
|
6 |
sdk: gradio
|
7 |
sdk_version: "3.50.2"
|
8 |
app_file: app.py
|
9 |
+
pinned: true
|
10 |
---
|
11 |
|
12 |
+
# 🧬 GeneForgeLang: Symbolic-to-Sequence & Cross-Modality Biomolecular Design Toolkit
|
13 |
|
14 |
+
**GeneForgeLang** is a symbolic, generative language that allows scientists to design and interpret DNA, RNA, and protein sequences with unified syntax and AI support.
|
15 |
|
16 |
+
This toolkit enables:
|
17 |
+
- Generation of realistic proteins from symbolic design
|
18 |
+
- Translation of symbolic phrases across DNA ↔ RNA ↔ Protein
|
19 |
+
- Structured, human-readable and AI-trainable syntax
|
20 |
+
- Semantic equivalence across molecular layers
|
21 |
|
22 |
---
|
23 |
|
24 |
## 🚀 Features
|
25 |
|
26 |
+
| Module | Description |
|
27 |
+
|----------------------------|-------------|
|
28 |
+
| 🧠 Phrase → Protein | Generate realistic protein sequences from symbolic phrases |
|
29 |
+
| 🔁 Transcode Across Molecules | Translate GeneForgeLang phrases between DNA, RNA, and Protein |
|
30 |
+
| 📚 Universal Grammar | One structure to rule them all: motifs, domains, PTMs, splicing |
|
31 |
+
| 🧬 Compact Notation | Prefixes, accents, and structural markers for efficiency |
|
32 |
+
| 🧠 AI-Ready Output | Compatible with transformer-based models like ProtGPT2 |
|
|
|
|
|
33 |
|
34 |
---
|
35 |
|
36 |
+
## 🧪 Example Input Phrases
|
37 |
|
38 |
+
### DNA → RNA
|
39 |
|
40 |
```
|
41 |
+
~d:Prom[TATA]-Exon1-Intr1-Exon2
|
42 |
+
↓
|
43 |
+
:r:Cap5'-Ex1-Ex2-UTR3'
|
44 |
```
|
45 |
|
46 |
+
### RNA → Protein
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
47 |
|
48 |
```
|
49 |
+
:r:Ex1-Ex2
|
50 |
+
↓
|
51 |
+
^p:Dom(Kin)-Mot(NLS)
|
52 |
```
|
53 |
|
54 |
+
---
|
|
|
|
|
|
|
|
|
55 |
|
56 |
+
## ▶️ How to Use
|
57 |
|
58 |
+
1. Launch this app on Hugging Face or locally
|
59 |
+
2. Choose your input phrase and source/target modality
|
60 |
+
3. Get your translated output or generated protein
|
61 |
|
62 |
---
|
63 |
|
64 |
+
## 📁 Files
|
65 |
|
66 |
+
| File | Description |
|
67 |
+
|-----------------------|-------------|
|
68 |
+
| `app.py` | Full Gradio app (all tabs) |
|
69 |
+
| `semillas.json` | Seed dictionary |
|
70 |
+
| `transcoder.py` | Script for DNA/RNA/protein conversion |
|
71 |
+
| `requirements.txt` | Python dependencies |
|
72 |
+
| `README.md` | This file |
|
73 |
|
74 |
---
|
75 |
|
76 |
## 🧠 Developed by
|
77 |
|
78 |
**Fundación de Neurociencias**
|
79 |
+
MIT License
|
80 |
|
81 |
Join us in shaping symbolic bio-AI.
|
requirements.txt
CHANGED
Binary files a/requirements.txt and b/requirements.txt differ
|
|