Spaces:
Running
Running
Commit
·
73256f4
1
Parent(s):
5daf891
Update README.md
Browse files
README.md
CHANGED
@@ -7,4 +7,178 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# <p align="center">Deep Learning 101</a></p>
|
11 |
+
|
12 |
+
## <p align="center">The top private AI Meetup in Taiwan, launched on 2016</a></p>
|
13 |
+
|
14 |
+
<p align="center">
|
15 |
+
<img src="./images/DeepLearning101.JPG" width="50%" />
|
16 |
+
|
17 |
+
<p align="center">
|
18 |
+
<a href="https://www.facebook.com/groups/525579498272187/">台灣人工智慧社團</a>
|
19 |
+
</p>
|
20 |
+
|
21 |
+
<p align="center">
|
22 |
+
http://DeepLearning101.TWMAN.ORG
|
23 |
+
<p align="center">
|
24 |
+
https://huggingface.co/DeepLearning101
|
25 |
+
<p align="center">
|
26 |
+
https://www.youtube.com/@DeepLearning101
|
27 |
+
|
28 |
+
##
|
29 |
+
|
30 |
+
### [Speech Processing( 語音處理)](https://github.com/Deep-Learning-101/Speech-Processing-Paper):**[那些語音處理踩的坑](https://blog.twman.org/2021/04/ASR.html)**:[針對訪談或對話進行分析與識別](https://www.twman.org/AI/ASR)。
|
31 |
+
|
32 |
+
<details open>
|
33 |
+
<summary><strong>語音處理</strong></summary>
|
34 |
+
|
35 |
+
<details open>
|
36 |
+
<summary>Speech Recognition (語音識別)</summary>
|
37 |
+
|
38 |
+
- [中文語音識別](https://www.twman.org/AI/ASR)
|
39 |
+
- [Whisper](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/Whisper.md)
|
40 |
+
- (2023-07-26@TonTon) 已在 higgingface hub 上佈署 WhisperX
|
41 |
+
- [WeNet](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/WeNet.md)
|
42 |
+
- [FunASR](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/FunASR.md)
|
43 |
+
|
44 |
+
</details>
|
45 |
+
|
46 |
+
<details open>
|
47 |
+
<summary>Speaker Recognition (聲紋識別)</summary>
|
48 |
+
|
49 |
+
- [中文語者(聲紋)識別](https://www.twman.org/AI/ASR/SpeakerRecognition)
|
50 |
+
- [WeSpeaker](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/WeSpeaker.md)
|
51 |
+
- [SincNet](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/SincNet.md)
|
52 |
+
|
53 |
+
</details>
|
54 |
+
|
55 |
+
<details close>
|
56 |
+
<summary>Speech Enhancement (語音增強)</summary>
|
57 |
+
|
58 |
+
- [中文語音增強(去噪)](https://www.twman.org/AI/ASR/SpeechEnhancement)
|
59 |
+
- [Denoiser](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/Denoiser.md)
|
60 |
+
|
61 |
+
</details>
|
62 |
+
|
63 |
+
<details close>
|
64 |
+
<summary>Speech Separation (語音分離)</summary>
|
65 |
+
|
66 |
+
- [中文語者分離(分割)](https://www.twman.org/AI/ASR/SpeechSeparation)
|
67 |
+
- [Mossformer](https://github.com/Deep-Learning-101/Speech-Processing-Paper/blob/main/Mossformer.md)
|
68 |
+
- [TOLD@FASR](https://github.com/alibaba-damo-academy/FunASR/tree/main/egs/callhome/TOLD)
|
69 |
+
- [TOLD能對混疊語音建模的說話人日誌框架](https://zhuanlan.zhihu.com/p/650346578))
|
70 |
+
|
71 |
+
</details>
|
72 |
+
|
73 |
+
<details close>
|
74 |
+
<summary>Speech Synthesis (語音合成)</summary>
|
75 |
+
|
76 |
+
- [清華大學LightGrad-TTS,且流式實現](https://zhuanlan.zhihu.com/p/656012430):https://github.com/thuhcsi/LightGrad
|
77 |
+
- [出門問問MeetVoice, 讓合成聲音以假亂真](https://zhuanlan.zhihu.com/p/92903377)
|
78 |
+
- [VALL-E:微軟全新語音合成模型可以在3秒內復制任何人的聲音](https://zhuanlan.zhihu.com/p/598473227)
|
79 |
+
- [BLSTM-RNN、Deep Voice、Tacotron…你都掌握了吗?一文总结语音合成必备经典模型(一)](https://new.qq.com/rain/a/20221204A02GIT00)
|
80 |
+
- [Tacotron2、GST、Glow-TTS、Flow-TTS…你都掌握了吗?一文总结语音合成必备经典模型(二)](https://cloud.tencent.com/developer/article/2250062)
|
81 |
+
- Bark:https://github.com/suno-ai/bark
|
82 |
+
- [最強文本轉語音工具:Bark,本地安裝+雲端部署+在線體驗詳細教程](https://zhuanlan.zhihu.com/p/630900585)
|
83 |
+
- [使用Transformers 優化文本轉語音模型Bark](https://zhuanlan.zhihu.com/p/651951136)
|
84 |
+
|
85 |
+
</details>
|
86 |
+
</details>
|
87 |
+
|
88 |
+
##
|
89 |
+
|
90 |
+
### [Natural Language Processing, NLP (自然語言處理)](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper):**[那些自然語言處理踩的坑](https://blog.twman.org/2021/04/NLP.html)**:[針對文檔進行分析與擷取](https://www.twman.org/AI/NLP)。
|
91 |
+
|
92 |
+
#### [大型語言模型(Large Language Model,LLM),想要嗎?](https://blog.twman.org/2023/04/GPT.html)
|
93 |
+
#### [基於機器閱讀理解的指令微調的統一信息抽取框架之診斷書醫囑擷取分析](https://blog.twman.org/2023/07/HugIE.html)
|
94 |
+
|
95 |
+
<details open>
|
96 |
+
<summary><strong>自然語言處理</strong></summary>
|
97 |
+
|
98 |
+
<details open>
|
99 |
+
<summary>Large Language Model (大語言模型)</summary>
|
100 |
+
|
101 |
+
- [LangChain](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper#langchain)
|
102 |
+
- [Retrieval Augmented Generation](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper#rag)
|
103 |
+
- [LLM Model](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper#llm-%E6%A8%A1%E5%9E%8B%E4%BB%8B%E7%B4%B9)
|
104 |
+
|
105 |
+
</details>
|
106 |
+
|
107 |
+
<details open>
|
108 |
+
<summary>Information/Event Extraction (資訊/事件擷取)</summary>
|
109 |
+
|
110 |
+
- [HugNLP](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper/blob/main/HugNLP.md)
|
111 |
+
- [DeepKE](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper/blob/main/DeepKE.md)
|
112 |
+
- [ERINE-Layout](https://github.com/Deep-Learning-101/Natural-Language-Processing-Paper/blob/main/ERNIE-Layout.md)
|
113 |
+
|
114 |
+
</details>
|
115 |
+
|
116 |
+
<details close>
|
117 |
+
<summary>Machine Reading Comprehension (機器閱讀理解)</summary>
|
118 |
+
</details>
|
119 |
+
|
120 |
+
<details close>
|
121 |
+
<summary>Named Entity Recognition (命名實體識別)</summary>
|
122 |
+
</details>
|
123 |
+
|
124 |
+
<details close>
|
125 |
+
<summary>Correction (糾錯)</summary>
|
126 |
+
</details>
|
127 |
+
|
128 |
+
<details close>
|
129 |
+
<summary>Classification (分類)</summary>
|
130 |
+
</details>
|
131 |
+
|
132 |
+
<details close>
|
133 |
+
<summary>Similarity (相似度)</summary>
|
134 |
+
</details>
|
135 |
+
|
136 |
+
</details>
|
137 |
+
|
138 |
+
|
139 |
+
|
140 |
+
|
141 |
+
##
|
142 |
+
|
143 |
+
### [Computer vision (電腦視覺)](https://www.twman.org/AI/CV):[針對物件或場景影像進行分析與偵測](https://github.com/Deep-Learning-101/Computer-Vision-Paper)。
|
144 |
+
|
145 |
+
#### [用PaddleOCR的PPOCRLabel來微調醫療診斷書和收據](https://blog.twman.org/2023/07/wsl.html)
|
146 |
+
|
147 |
+
|
148 |
+
<details open>
|
149 |
+
<summary><strong>圖像處理:</strong></summary>
|
150 |
+
|
151 |
+
<details close>
|
152 |
+
<summary>Optical Character Recognition (光學字元辨識)</summary>
|
153 |
+
|
154 |
+
- PaddleOCR
|
155 |
+
|
156 |
+
</details>
|
157 |
+
|
158 |
+
<details open>
|
159 |
+
<summary>Document Layout Analysis (文件結構分析)</summary>
|
160 |
+
|
161 |
+
- [arXiv-2020_LayoutLM](https://github.com/Deep-Learning-101/Computer-Vision-Paper/blob/main/LayoutLM.md)
|
162 |
+
- [arXiv-2021_LayoutLMv2](https://github.com/Deep-Learning-101/Computer-Vision-Paper/blob/main/LayoutLMv2.md)
|
163 |
+
- arXiv-2021_LayoutXLM
|
164 |
+
- arXiv-2022_LayoutLMv3
|
165 |
+
|
166 |
+
</details>
|
167 |
+
|
168 |
+
<details close>
|
169 |
+
<summary>Document Understanding (文件理解)</summary>
|
170 |
+
</details>
|
171 |
+
|
172 |
+
<details close>
|
173 |
+
<summary>Object Detection (物件偵測)</summary>
|
174 |
+
</details>
|
175 |
+
|
176 |
+
<details close>
|
177 |
+
<summary>Handwriting Recognition (手寫識別)</summary>
|
178 |
+
</details>
|
179 |
+
|
180 |
+
<details close>
|
181 |
+
<summary>Face Recognition (人臉識別)</summary>
|
182 |
+
</details>
|
183 |
+
|
184 |
+
</details>
|