Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -27,9 +27,153 @@ Detail Graph: https://github.com/AaronCWacker/Yggdrasil/blob/main/Mermaid/Multim
|
|
27 |
|
28 |
5. 
|
29 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
30 |
|
31 |
|
32 |
-
# AI Multimodal Model
|
33 |
|
34 |
Zero GPU and GPU Spaces of note:
|
35 |
|
|
|
27 |
|
28 |
5. 
|
29 |
|
30 |
+
# Architecture to AI Space
|
31 |
+
|
32 |
+
# ๐ค AI Types and Capabilities Classification
|
33 |
+
|
34 |
+
## 1. Foundation Models & Orchestrators ๐ฏ
|
35 |
+
|
36 |
+
### Large Language Models (LLMs)
|
37 |
+
- ๐ง **General Purpose**
|
38 |
+
- GPT-4 Omni
|
39 |
+
- Claude 3.5 Sonnet
|
40 |
+
- ๐ **Domain-Specific**
|
41 |
+
- Medical knowledge models
|
42 |
+
|
43 |
+
**_Mapped Spaces:_**
|
44 |
+
- `GPT-4o-omni-text-audio-image-video`
|
45 |
+
- `AnthropicClaude3.5Sonnet-ACW`
|
46 |
+
|
47 |
+
## 2. Content Generation ๐จ
|
48 |
+
|
49 |
+
### 2.1 Image Generation
|
50 |
+
#### Static Images
|
51 |
+
- ๐ผ๏ธ **Text-to-Image**
|
52 |
+
- Stable Diffusion 3
|
53 |
+
- DALL-E 3
|
54 |
+
- Midjourney
|
55 |
+
- ๐ **Image-to-Image**
|
56 |
+
- ControlNet
|
57 |
+
- Stable Cascade
|
58 |
+
|
59 |
+
**_Mapped Spaces:_**
|
60 |
+
- `OpenDalleV1.1-GPU-Demo`
|
61 |
+
- `stable-cascade`
|
62 |
+
- `stable-diffusion-3-medium`
|
63 |
+
- `playgroundai/playground-v2.5`
|
64 |
+
|
65 |
+
### 2.2 Video Generation
|
66 |
+
#### Dynamic Content
|
67 |
+
- ๐ฅ **Text-to-Video**
|
68 |
+
- Stable Video Diffusion
|
69 |
+
- ๐ฌ **Image-to-Video**
|
70 |
+
- Video Generation models
|
71 |
+
|
72 |
+
**_Mapped Spaces:_**
|
73 |
+
- `stable-video-diffusion`
|
74 |
+
- `AI-MovieMaker-Comedy`
|
75 |
+
|
76 |
+
### 2.3 Audio Generation
|
77 |
+
#### Sound Creation
|
78 |
+
- ๐ต **Music Generation**
|
79 |
+
- MusicGen
|
80 |
+
- MIDI Composer
|
81 |
+
- ๐ฃ๏ธ **Text-to-Speech**
|
82 |
+
- Edge TTS
|
83 |
+
- ๐๏ธ **Voice Cloning**
|
84 |
+
- Voice Clone tools
|
85 |
+
|
86 |
+
**_Mapped Spaces:_**
|
87 |
+
- `midi-composer`
|
88 |
+
- `AudioFileGenerationWithSDAudio`
|
89 |
+
- `MusicGenStreamFacebook`
|
90 |
+
- `EZ-Voice-Clone-From-Long-Text`
|
91 |
+
|
92 |
+
## 3. Analysis & Understanding ๐
|
93 |
+
|
94 |
+
### 3.1 Text Analysis
|
95 |
+
#### Document Processing
|
96 |
+
- ๐ **RAG Systems**
|
97 |
+
- Arxiv RAG
|
98 |
+
- ๐ **Knowledge Extraction**
|
99 |
+
|
100 |
+
**_Mapped Spaces:_**
|
101 |
+
- `Arxiv-Paper-Search-And-QA-RAG-Pattern`
|
102 |
+
- `AIKnowledgeTreeBuilder`
|
103 |
+
|
104 |
+
### 3.2 Speech Processing
|
105 |
+
#### Audio Analysis
|
106 |
+
- ๐ค **Speech Recognition**
|
107 |
+
- Real-Time ASR
|
108 |
+
- ๐ข **Speech Understanding**
|
109 |
+
|
110 |
+
**_Mapped Spaces:_**
|
111 |
+
- `RealTimeAsyncASR`
|
112 |
+
- `whisper-web`
|
113 |
+
|
114 |
+
## 4. Visualization & Presentation ๐
|
115 |
+
|
116 |
+
### 4.1 Data Visualization
|
117 |
+
#### Chart Generation
|
118 |
+
- ๐ **AutoML Plotting**
|
119 |
+
- ๐บ๏ธ **Mapping Tools**
|
120 |
+
|
121 |
+
**_Mapped Spaces:_**
|
122 |
+
- `AutoMLUsingStreamlit-Plotly`
|
123 |
+
- `DynamicMapCreator2`
|
124 |
+
|
125 |
+
### 4.2 3D Visualization
|
126 |
+
#### 3D Content
|
127 |
+
- ๐ **Mesh Generation**
|
128 |
+
- InstantMesh
|
129 |
+
- ๐ฎ **Interactive 3D**
|
130 |
+
|
131 |
+
**_Mapped Spaces:_**
|
132 |
+
- `InstantMesh`
|
133 |
+
|
134 |
+
## 5. Integration & Composition ๐
|
135 |
+
|
136 |
+
### 5.1 Media Composition
|
137 |
+
#### Multi-modal Integration
|
138 |
+
- ๐ฌ **Video Composition**
|
139 |
+
- ๐ **Audio Mixing**
|
140 |
+
|
141 |
+
**_Mapped Spaces:_**
|
142 |
+
- `Jasperai/Flux.1-dev-Controlnet-Upscaler`
|
143 |
+
|
144 |
+
### 5.2 Interactive Systems
|
145 |
+
#### User Interaction
|
146 |
+
- ๐ฅ **Chat Interfaces**
|
147 |
+
- ๐ค **Multiplayer Systems**
|
148 |
+
|
149 |
+
**_Mapped Spaces:_**
|
150 |
+
- `anychat`
|
151 |
+
- `Multiplayer-Eval-App-Upvote`
|
152 |
+
- `SocketIO-Multiplayer-DrawingGame`
|
153 |
+
|
154 |
+
## 6. Specialized Applications ๐ฅ
|
155 |
+
|
156 |
+
### 6.1 Healthcare Specific
|
157 |
+
#### Medical Tools
|
158 |
+
- ๐ **Training Material Generation**
|
159 |
+
- ๐จโโ๏ธ **Patient Communication**
|
160 |
+
|
161 |
+
**_Mapped Spaces:_**
|
162 |
+
- `USMLE-Story-Voting-Evals`
|
163 |
+
- `PDF-text-to-speech-Per-Page`
|
164 |
+
|
165 |
+
### 6.2 Research Tools
|
166 |
+
#### Scientific Applications
|
167 |
+
- ๐ **Literature Analysis**
|
168 |
+
- ๐ฌ **Data Processing**
|
169 |
+
|
170 |
+
**_Mapped Spaces:_**
|
171 |
+
- `Models-Datasets-Spaces-Search-Hub`
|
172 |
+
- `RLHF.Reinforce.Learn.With.Human.Feedback`
|
173 |
+
|
174 |
|
175 |
|
176 |
+
# AI Pipeline Architecture Models for Multimodal Model Space Workflows
|
177 |
|
178 |
Zero GPU and GPU Spaces of note:
|
179 |
|