Text-to-Image
Transformers
English
Lod34 commited on
Commit
7f8f057
·
verified ·
1 Parent(s): 9391509

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -27
README.md CHANGED
@@ -1,23 +1,8 @@
1
- ---
2
- license: mit
3
- datasets:
4
- - pawkanarek/spraix_1024
5
- language:
6
- - en
7
- base_model:
8
- - google-bert/bert-base-uncased
9
- pipeline_tag: text-to-image
10
- tags:
11
- - animation
12
- - character
13
- - sprite
14
- - game
15
- ---
16
- # Animator2D
17
 
18
- Animator2D is an artificial intelligence model designed to generate pixel-art sprite animations based on textual descriptions. The model uses a BERT-based text encoder to extract textual features and a convolutional generative network to create animated sprites.
19
 
20
- ## Model Description
21
 
22
  - **Name:** Animator2D
23
  - **Input:**
@@ -25,18 +10,33 @@ Animator2D is an artificial intelligence model designed to generate pixel-art sp
25
  - Number of animation frames
26
  - Character action
27
  - Viewing direction
28
- - **Output:** Animated sprite in image format
 
 
 
 
29
 
30
- ## Dataset
31
 
32
- The model was trained using the [spraix\_1024](https://huggingface.co/datasets/pawkanarek/spraix_1024) dataset, which contains animated sprites with detailed textual descriptions.
33
 
34
- ## Future Goals
 
 
 
 
 
 
35
 
36
- This is only the first version of the model. In the future, we aim to improve it with the following updates:
37
 
38
- - **Expand output formats:** Currently, the model generates a single frame sheet. We plan to implement the ability to export output in multiple formats, including folders containing separate images, animated GIFs, and videos.
39
- - **Optimize frame management:** The current frame count is manually defined, but we aim to improve control by introducing a more intuitive system that considers factors such as FPS and the actual animation duration.
40
- - **Enhance the model:** The current model is still in an early stage. Future updates will focus on making sprite generation more precise and consistent by improving architecture and training data quality.
41
- - **Customization of sprite dimensions:** We will implement an input that allows specifying the character's height in pixels. This will enable adaptation of the generated sprite's graphical style, ensuring greater flexibility and customization possibilities (e.g., Pokémon style vs. Metal Slug style).
 
 
 
 
42
 
 
 
1
+ # 🎨 Animator2D
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
+ Animator2D is an AI-powered model designed to generate pixel-art sprite animations from textual descriptions. This model leverages a BERT-based text encoder to extract textual features and a convolutional generative network to create animated sprites. The goal is to provide game developers and artists with a tool that can bring character concepts to life with minimal effort.
4
 
5
+ ## 🛠️ Model Overview
6
 
7
  - **Name:** Animator2D
8
  - **Input:**
 
10
  - Number of animation frames
11
  - Character action
12
  - Viewing direction
13
+ - **Output:** Animated sprite sheet in image format
14
+
15
+ ## 📦 Dataset
16
+
17
+ The model was trained using the [spraix\_1024](https://huggingface.co/datasets/pawkanarek/spraix_1024) dataset, which contains animated sprites with detailed textual descriptions. This dataset serves as a foundation for training the model to generate high-quality, relevant sprites based on textual inputs.
18
 
19
+ ## 🚀 Model Versions
20
 
21
+ Over time, several iterations of Animator2D have been developed, each improving on the previous version with different training strategies and hyperparameters. Below is a chronological overview of the versions created so far:
22
 
23
+ | Model Version | Description |
24
+ |----------------------|-------------|
25
+ | **Animator2D-v1** | The first full version developed in this project, utilizing a structured training approach with BERT for text encoding and a convolutional generator for sprite creation. |
26
+ | **Animator2D-mini-10e** | A simplified version trained with only 10 epochs, batch size of 8, learning rate of 1e-4, and image size of 64x64. |
27
+ | **Animator2D-mini-100e** | An extension of the mini-10e version, trained for 100 epochs for improved performance. |
28
+ | **Animator2D-mini-250e** | A more refined version with 250 epochs, batch size increased to 16, learning rate of 2e-4, and image resolution of 128x128. |
29
+ | **Animator2D-v2 (In Development)** | A new version being built from scratch with an entirely redesigned training process, aiming for better animation quality and efficiency. |
30
 
31
+ ## 🔮 Future Goals
32
 
33
+ This is just the first iteration of Animator2D. Future updates will focus on refining and expanding its capabilities:
34
+
35
+ - **Multiple Output Formats**: Currently, the model generates a single sprite sheet. Future updates will enable exporting animations in various formats, including folders with individual frames, GIFs, and videos.
36
+ - **Frame Input Optimization**: The number of frames is currently manually defined. Improvements will include a more intuitive system that considers FPS and actual animation duration.
37
+ - **Model Refinement**: The current model is in an early stage. Future improvements will enhance sprite generation consistency and quality by optimizing the architecture and training dataset.
38
+ - **Sprite Size Customization**: A new input will allow users to specify the character height in pixels, dynamically adjusting the sprite’s artistic style. This will ensure greater flexibility, allowing for different art styles (e.g., Pokémon vs. Metal Slug aesthetics).
39
+
40
+ ---
41
 
42
+ Animator2D is an exciting step toward AI-assisted sprite animation generation, and future versions will continue to push the boundaries of what’s possible in pixel-art automation! 🚀🎮