Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available:
5.44.1
metadata
title: GeneForgeLang
emoji: π§¬
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 3.50.2
app_file: app.py
pinned: true
𧬠GeneForgeLang: Symbolic-to-Sequence & Cross-Modality Biomolecular Design Toolkit
GeneForgeLang is a symbolic and generative language for cross-modality biomolecular design.
It enables unified AI-powered workflows to design, interpret and translate DNA, RNA, and protein sequences using a compact, human-readable grammar.
This project provides:
- A symbolic language spanning all biological layers (genomic, transcriptomic, proteomic)
- Realistic sequence generation via AI models like ProtGPT2
- Scientific interpretation of symbolic phrases in natural language
- Cross-modality transcoders (e.g., DNA β RNA β Protein and vice versa)
- An interactive Gradio-based UI for easy use and integration
π Key Features
Module | Description |
---|---|
π§ Phrase β Sequence | Generate DNA, RNA, or protein from symbolic design |
π Transcode Phrases | Translate GeneForgeLang phrases across modalities |
π Phrase β Description | Generate scientific English descriptions of symbolic inputs |
π Sequence β Phrase | Infer functional phrases from real sequences |
𧬠Mutate Sequence (WIP) | Generate variants for symbolic seeds (under development) |
π¦ Export to FASTA (WIP) | Save generated sequences to .fasta (to be implemented) |
π Analyze Sequence (WIP) | Visualize amino acid composition or base content |
π§ͺ Example Input Phrases
~d:Prom[TATA]-Exon1-Intr1-Exon2
β
:r:Cap5'-Ex1-Ex2-UTR3'
β
^p:Dom(Kin)-Mot(NLS)*AcK@147
βΆοΈ How to Use Locally
- Clone this repo:
git clone https://github.com/Fundacion-de-Neurociencias/GeneForgeLang.git
cd GeneForgeLang
- Install dependencies:
pip install -r requirements.txt
- Launch the interface:
python app.py
- Navigate to:
http://127.0.0.1:7860
π File Structure
File | Description |
---|---|
app.py |
Full Gradio app (4 tabs) |
semillas.json |
Phrase-to-seed dictionary |
generate_from_phrase.py |
Symbolic-to-sequence generator |
describe_phrase.py |
Phrase interpreter to scientific English |
translate_to_geneforgelang.py |
Sequence-to-symbolic phrase translation |
transcoder.py |
Modality switcher (DNA β RNA β Protein) |
requirements.txt |
Python dependencies |
README.md |
This file |
π§ Developed by
FundaciΓ³n de Neurociencias
Licensed under the MIT License
Join us in shaping the future of symbolic bio-AI. Contributions welcome!