ManMenGon commited on
Commit
6ef5528
·
verified ·
1 Parent(s): 55868b6

Upload 2 files

Browse files
Files changed (2) hide show
  1. README.md +38 -48
  2. requirements.txt +0 -0
README.md CHANGED
@@ -6,86 +6,76 @@ colorTo: blue
6
  sdk: gradio
7
  sdk_version: "3.50.2"
8
  app_file: app.py
9
- pinned: false
10
  ---
11
 
 
12
 
13
- # 🧬 GeneForgeLang: Symbolic-to-Sequence Protein Design Toolkit
14
 
15
- GeneForgeLang is a symbolic language and toolset for generative biology. It connects high-level biological design intentions to low-level amino acid sequences via AI, rules, and natural language.
 
 
 
 
16
 
17
  ---
18
 
19
  ## 🚀 Features
20
 
21
- | Module | Description |
22
- |--------------------------|-------------|
23
- | 🧠 Phrase → Protein | Generate realistic protein sequences from symbolic phrases |
24
- | 🧪 Protein Phrase | Infer functional motifs from amino acid sequences |
25
- | 📖 Phrase → Description | Translate symbolic design into scientific English |
26
- | 🧬 Mutate Protein | Generate variants of proteins from the same symbolic seed |
27
- | 📦 Export to FASTA | Download generated proteins for downstream use |
28
- | 📊 Analyze Protein | Visualize amino acid composition as bar plot |
29
- | 📚 Symbolic Language | GeneForgeLang syntax allows structured protein definitions |
30
 
31
  ---
32
 
33
- ## 🧪 Example
34
 
35
- ### Input Phrase
36
 
37
  ```
38
- ^p:Dom(Kin)-Mot(NLS)*AcK@147=Localize(Nucleus)
 
 
39
  ```
40
 
41
- ### Output
42
- - Seed: `MKKK`
43
- - Generated protein: realistic sequence (via ProtGPT2)
44
- - Properties: length, charge, MW
45
- - Description: *“This protein contains a kinase domain, a nuclear localization signal, and lysine acetylation at a specific position.”*
46
- - Export: `.fasta` format
47
- - Graph: bar plot of amino acid composition
48
-
49
- ---
50
-
51
- ## ▶️ How to Use
52
-
53
- 1. Clone this repo
54
- 2. Install dependencies:
55
 
56
  ```
57
- pip install -r requirements.txt
 
 
58
  ```
59
 
60
- 3. Launch the interface:
61
-
62
- ```
63
- python app.py
64
- ```
65
 
66
- 4. Navigate to:
67
 
68
- ```
69
- http://127.0.0.1:7860
70
- ```
71
 
72
  ---
73
 
74
- ## 📁 Repository Structure
75
 
76
- | File | Description |
77
- |----------------------------|---------------------------------------------|
78
- | `app.py` | Main UI app with all functionality |
79
- | `semillas.json` | Phrase-to-seed dictionary |
80
- | `translate_to_geneforgelang.py` | Reverse translator |
81
- | `requirements.txt` | Python dependencies |
82
- | `README.md` | This file |
83
 
84
  ---
85
 
86
  ## 🧠 Developed by
87
 
88
  **Fundación de Neurociencias**
89
- Licensed under the MIT License
90
 
91
  Join us in shaping symbolic bio-AI.
 
6
  sdk: gradio
7
  sdk_version: "3.50.2"
8
  app_file: app.py
9
+ pinned: true
10
  ---
11
 
12
+ # 🧬 GeneForgeLang: Symbolic-to-Sequence & Cross-Modality Biomolecular Design Toolkit
13
 
14
+ **GeneForgeLang** is a symbolic, generative language that allows scientists to design and interpret DNA, RNA, and protein sequences with unified syntax and AI support.
15
 
16
+ This toolkit enables:
17
+ - Generation of realistic proteins from symbolic design
18
+ - Translation of symbolic phrases across DNA ↔ RNA ↔ Protein
19
+ - Structured, human-readable and AI-trainable syntax
20
+ - Semantic equivalence across molecular layers
21
 
22
  ---
23
 
24
  ## 🚀 Features
25
 
26
+ | Module | Description |
27
+ |----------------------------|-------------|
28
+ | 🧠 Phrase → Protein | Generate realistic protein sequences from symbolic phrases |
29
+ | 🔁 Transcode Across Molecules | Translate GeneForgeLang phrases between DNA, RNA, and Protein |
30
+ | 📚 Universal Grammar | One structure to rule them all: motifs, domains, PTMs, splicing |
31
+ | 🧬 Compact Notation | Prefixes, accents, and structural markers for efficiency |
32
+ | 🧠 AI-Ready Output | Compatible with transformer-based models like ProtGPT2 |
 
 
33
 
34
  ---
35
 
36
+ ## 🧪 Example Input Phrases
37
 
38
+ ### DNA → RNA
39
 
40
  ```
41
+ ~d:Prom[TATA]-Exon1-Intr1-Exon2
42
+
43
+ :r:Cap5'-Ex1-Ex2-UTR3'
44
  ```
45
 
46
+ ### RNA → Protein
 
 
 
 
 
 
 
 
 
 
 
 
 
47
 
48
  ```
49
+ :r:Ex1-Ex2
50
+
51
+ ^p:Dom(Kin)-Mot(NLS)
52
  ```
53
 
54
+ ---
 
 
 
 
55
 
56
+ ## ▶️ How to Use
57
 
58
+ 1. Launch this app on Hugging Face or locally
59
+ 2. Choose your input phrase and source/target modality
60
+ 3. Get your translated output or generated protein
61
 
62
  ---
63
 
64
+ ## 📁 Files
65
 
66
+ | File | Description |
67
+ |-----------------------|-------------|
68
+ | `app.py` | Full Gradio app (all tabs) |
69
+ | `semillas.json` | Seed dictionary |
70
+ | `transcoder.py` | Script for DNA/RNA/protein conversion |
71
+ | `requirements.txt` | Python dependencies |
72
+ | `README.md` | This file |
73
 
74
  ---
75
 
76
  ## 🧠 Developed by
77
 
78
  **Fundación de Neurociencias**
79
+ MIT License
80
 
81
  Join us in shaping symbolic bio-AI.
requirements.txt CHANGED
Binary files a/requirements.txt and b/requirements.txt differ