Changelog
v2.6.1 (2024-09-09)
- Google colabã§ãtorchã®ããŒãžã§ã³ç±æ¥ã§ãšã©ãŒãçºçããäžå ·åã®ä¿®æ£ïŒãã¶ãïŒ
- WebUIããã®ã¹ã¿ã€ã«äœæã§ã®ããµããã©ã«ãã«ããã¹ã¿ã€ã«åãã§ãšã©ãŒãçºçããŠããç¹ã®ä¿®æ£
v2.6.0 (2024-06-16)
æ°æ©èœ
ã¢ãã«ã®ããŒãžæã«ãä»ãŸã§ã® new = (1 - weight) * A + weight * B
ã®ä»ã«ã次ãè¿œå
new = A + weight * (B - C)
: å·®åããŒãžnew = a * A + b * B + c * C
: å éåããŒãžnew = A + weight * B
: ãã«ã¢ãã«ã®ããŒãž
å·®åããŒãžã¯ãäŸãã°BããCãšåã話è
ã ãã©åããŠããã¢ãã«ããšãããšãB - C
ãåããã¯ãã«çãªãã®ã ãšæããã®ã§ããããAã«è¶³ãããšã§ãAã®è©±è
ãåããŠãããããªé³å£°ãçæã§ããããã«ãªããŸãã
ãŸããå éåã§new = A - B
ãäœã£ãŠãããããã«ã¢ãã«ããŒãžã§å¥ã®ã¢ãã«ã«è¶³ãã°ãå®è³ªå·®åããŒãžãå®çŸã§ããŸãããŸãè¬ã«new = -A
ãnew = 41 * A
çã®ã¢ãã«ãäœãããšãã§ããŸãã
ãããã®ããŒãžã®æŽ»çšæ³ã«ã€ããŠã¯åèªããããèããŠå®éšããŠã¿ãŠãé¢çœã䜿ãæ¹ãããã°ãã²å ±æããŠãã ããã
åãã«ã€ããŠå®éšçã«äœã£ããã«ã¢ãã«ããã¡ãã«çœ®ããŠããŸããããããã«ã¢ãã«ããŒãžã§äœ¿ãããšã§ãä»»æã®ã¢ãã«ãåãã¢ãã«ã«ããçšåºŠã¯å€æã§ããŸãã
æ¹å
- ã¹ã¿ã€ã«ãã¯ãã«ã®ããŒãžéšåã®UIã®æ¹å
- WebUIã®
App.bat
ã®èµ·åãå°ãéãã®ã§ãããããã®æ©èœãåå²ããDataset.bat
,Inference.bat
,Merge.bat
,StyleVectors.bat
,Train.bat
ãè¿œå (ä»ãŸã§ã®App.bat
ããããŸã§éã䜿ããŸã)
v2.5.1 (2024-06-14)
ã©ã€ã»ã³ã¹ãšã®ã³ã³ããªã¯ããããå©çšèŠçŽãéçºé£ããã®ãé¡ããšããã©ã«ãã¢ãã«ã®å©çšèŠçŽã«å€æŽããŸããã
v2.5.0 (2024-06-02)
ãã®ããŒãžã§ã³ããå©çšèŠçŽãè¿œå ãããŸããããå©çšã®éã¯å¿ ããèªã¿ãã ããã
æ°æ©èœç
- ããã©ã«ãã¢ãã«ã« ãã¿ããã®å£°çŽ æå·¥æ¿ ã®ãã¿ããæ§ãå
¬éããŠããã³ãŒãã¹ãšã©ã€ãé
ä¿¡é³å£°ãå©çšããŠåŠç¿ããå°æ¥é³ã¢ããšãã¿ããã¢ãã«ãè¿œå ïŒãã¿ããæ§ã«ã¯äºåã«é£çµ¡ããŠèš±è«ŸãåŸãŠããŸãïŒ
- ã¢ããã®å Žåã¯
Initialize.bat
ãããã«ã¯ãªãã¯ããã°ã¢ãã«ãããŠã³ããŒãã§ããŸãïŒæåã§ããŠã³ããŒãããŠmodel_assets
ãã©ã«ãã«å ¥ããããšãå¯èœïŒ
- ã¢ããã®å Žåã¯
- åŠç¿æã«é³å£°ããŒã¿ãã¹ã¿ã€ã«ããšã«ãã©ã«ãåãããŠããããšã§ããã®ãã©ã«ãããšã®ã¹ã¿ã€ã«ãåŠç¿æã«èªåçã«äœæããããã«
inputs
ããã¹ã©ã€ã¹ããŠäœ¿ãå Žåã¯inputs
çŽäžã«äœãããã¹ã¿ã€ã«ã ããµããã©ã«ããäœãããã«é³å£°ãã¡ã€ã«ãé 眮Data/ã¢ãã«å/raw
ãã䜿ãå Žåãraw
çŽäžã«åæ§ã«é 眮- ãµããã©ã«ãã®åæ°ã0ãŸãã¯1ã®å Žåã¯ãä»ãŸã§éãã®Neutralã¹ã¿ã€ã«ã®ã¿ãäœæãããŸã
- batãã¡ã€ã«ã§ã®ã€ã³ã¹ããŒã«ã®å€§å¹ ãªé«éåïŒPythonã®ã©ã€ãã©ãªã€ã³ã¹ããŒã«ã«uvã䜿çšïŒ
- åŠç¿æã«ãã«ã¹ã¿ã ããããµã³ãã©ãŒãç¡å¹åããªãã·ã§ã³ãè¿œå ãããã«ãããé·ãé³å£°ãã¡ã€ã«ãåŠç¿ã«äœ¿ãããããã«ãªããŸããã䜿çšVRAMãããªãå¢ãããåŠç¿ãäžå®å®ã«ãªãå¯èœæ§ããããŸãã
- ãããã質åãè¿œå
- è±èªã®é³å£°åæã®é床åäžïŒgordon0414ããã«ããPRã§ããããããšãããããŸãïŒïŒ
- ãšãã£ã¿ãŒã®åçš®æ©èœæ¹åïŒå€ããkamexyæ§ã«ãããšãã£ã¿ãŒãªããžããªãžã®ãã«ãªã¯çŸ€ã§ããããããšãããããŸãïŒïŒ
- éžæããè¡ã®äžã«æ°èŠã®è¡ãäœæã§ããããã«
- Mac䜿çšæã«æ¥æ¬èªå€æã®ãšã³ã¿ãŒã§é³å£°åæãèµ°ããã°ã®ä¿®æ£
- ããŒã¹ãæã«æ¹è¡ãå«ãŸãªãå Žåã¯éåžžã®ããŒã¹ãã®æ¯ãèãã«ãªãããã«ä¿®æ£
ãã®ä»ã®æ¹å
- äžã®ã¹ã¿ã€ã«èªåäœææ©èœãæ¢åã¢ãã«ã§ã䜿ãããããªæ©èœè¿œå ãå ·äœçã«ã¯ãã¹ã¿ã€ã«äœæã¿ãã«ãŠããã©ã«ãåããããé³å£°ãã¡ã€ã«ã®ãã£ã¬ã¯ããªãä»»æã«æå®ãããã®ãã©ã«ãåãã䜿ã£ãŠæ¢åã®ã¢ãã«ã®ã¹ã¿ã€ã«ã®äœæãå¯èœã«
- é³å£°æžãèµ·ããã«kotoba-whisperãè¿œå
- é³å£°æžãèµ·ããæã«Hugging Faceã®Whisperã¢ãã«ã䜿ãéã«ãæžãèµ·ãããé 次ä¿åããããã«æ¹å
- é³å£°æžãèµ·ããã®ããã©ã«ããfaster-whiperããHugging Faceã®Whisperã¢ãã«ãžå€æŽ
- ïŒã©ã€ãã©ãªãšããŠã®ã¿ïŒäŸåé¢ä¿ã®è»œéåãé³å£°åææã«èªã¿äžãããã¹ãã®èªã¿ãè¡šãé³çŽ åãæå®ããæ©èœãè¿œå + æ§ã ãªæ¹å (tsukumijimaããã«ãããã«ãªã¯ã§ããããããšãããããŸãïŒ)
å éšå€æŽ
- ãããŸã§path管çã«
configs/paths.yml
ã䜿ã£ãŠããããconfigs/default_paths.yml
ã«ãªããŒã ããconfigs/paths.yml
ã¯gitã®ç®¡ç察象å€ã«å€æŽ
ãã°ä¿®æ£
- Gradioã®ã¢ããããŒãã«ãããã¢ãã«éžææãã¹ã¿ã€ã«ã®DBSCANäœææçã«
TypeError: Type is not JSON serializable: WindowsPath
ã®ãããªãšã©ãŒãåºãåé¡ãä¿®æ£ - TensorboardãWebUIããç«ã¡äžããéã«ãšã©ãŒãåºãåé¡ã®ä¿®æ£ (#129)
v2.4.1 (2024-03-16)
batãã¡ã€ã«ã§ã®ã€ã³ã¹ããŒã«ã»ã¢ããããŒãæ¹æ³ã®å€æŽïŒãã以å€ã®å€æŽã¯ãããŸããïŒ
è«žäºæ ã«ãããã€ã³ã¹ããŒã«ã»ã¢ããããŒãã®batãã¡ã€ã«ãå€æŽããŸããïŒGitã䜿ããªãã®ã§ããŒãžã§ã³ã¢ããæã®ã¢ããããŒãã®å¯Ÿå¿ãå°é£ã ã£ããããGitããªãç°å¢ã®å Žåã¯PortableGitãããŠã³ããŒãããŠäœ¿ãããã«ïŒã
䌎ã£ãŠããããŸã§Windowsã§batãã¡ã€ã«ãããã«ã¯ãªãã¯ããŠã€ã³ã¹ããŒã«ããŠããæ¹ã¯åã€ã³ã¹ããŒã«ãå¿ é ãšãªããŸãã倧å€ç³ãèš³ãããŸããã
ã€ã³ã¹ããŒã«æé
ïŒã€ã³ã¹ããŒã«ã®æµãã¯å€ãããŸããããbatãã¡ã€ã«ã¯å€ãã£ãŠããã®ã§ãæ°ããzipãå¿ ãããŠã³ããŒãããŠãã ããïŒ
- sbv2.zipãããŠã³ããŒããã解åããŠãã ããã
- ã°ã©ããããæ¹ã¯ã
Install-Style-Bert-VITS2.bat
ãããã«ã¯ãªãã¯ããŸãã - ã°ã©ãããªãæ¹ã¯ã
Install-Style-Bert-VITS2-CPU.bat
ãããã«ã¯ãªãã¯ããŸããCPUçã§ã¯åŠç¿ã¯ã§ããŸããããé³å£°åæãšããŒãžã¯å¯èœã§ãã
ã¢ããããŒãæé
以åã®ããŒãžã§ã³ããã®ã¢ããããŒã
ä»ãŸã§ã®ç°å¢ãå šãŠåé€ããŠæ°ããã€ã³ã¹ããŒã«ããå¿ èŠããããŸãã 移è¡æ¹æ³ïŒ
- éèŠãªããŒã¿ãå
¥ã£ãŠããå¯èœæ§ã®ãã
Data
ãã©ã«ããšmodel_assets
ãã©ã«ããããã¯ã¢ãã - äžã®ã€ã³ã¹ããŒã«æé ãããæ°ããå Žæã«Style-Bert-VITS2ãã€ã³ã¹ããŒã«
- ã€ã³ã¹ããŒã«ãçµäºããããããã¯ã¢ãããã
Data
ãã©ã«ããšmodel_assets
ãã©ã«ããæ°ããStyle-Bert-VITS2
ãã©ã«ãã«ã³ã㌠- ãããŸã§ã€ã³ã¹ããŒã«ãããŠãããã©ã«ãïŒbatãã¡ã€ã«ãã¡å«ãïŒã¯åé€ããŠãæ§ããŸãã
ä»åŸã®ã¢ããããŒã
ä»åŸã¯ãæ°ããã€ã³ã¹ããŒã«ãããäžã®Update-Style-Bert-VITS2.bat
ãããã«ã¯ãªãã¯ããŠãã ãããä»ãŸã§ã®Update-Style-Bert-VITS2.bat
çã®ãã¡ã€ã«ã¯äœ¿ããŸããã
v2.4.0 (2024-03-15)
倧èŠæš¡ãªãã¡ã¯ã¿ãªã³ã°ã»æ¥æ¬èªåŠçã®ã¯ãŒã«ãŒåãšæ©èœè¿œå çãããŒã¿ã»ããäœãã»åŠç¿ã»é³å£°åæã»ããŒãžã»ã¹ã¿ã€ã«WebUIã¯å
šãŠapp.py
(App.bat
) ãžçµ±äžãããŸããã®ã§ã泚æãã ããã
ã¢ããããŒãæé
- 2.3æªæºïŒèŸæžã»ãšãã£ã¿ãŒè¿œå åïŒããã®ã¢ããããŒãã®å Žåã¯ãUpdate-to-Dict-Editor.batãããŠã³ããŒããã
Style-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ããã - ãã以å€ã®å Žåã¯ãåçŽã«ä»ãŸã§ã®
Update-Style-Bert-VITS2.bat
ã§ã¢ããããŒãã§ããŸãã - ãã ãã¢ããããŒãã«ããå€ãã®ãã¡ã€ã«ã移åãããäžèŠã«ãªã£ããããã®ã§ãããããåé€ãããå Žåã¯Clean.batã
Update-Style-Bert-VITS2.bat
ãšåãå Žæã«ä¿åããŠå®è¡ããŠãã ããã
å éšæ¹å
- tsukumijimaããã«ãã倧èŠæš¡ãªãã¡ã¯ã¿ãªã³ã°ã®ãã«ãªã¯ ã«ãã£ãŠãå éšã³ãŒããéåžžã«æŽçããå¯èªæ§ãé«ãŸãã©ã€ãã©ãªåãããããtsukumijimaãã 倧å€ãªäœæ¥ãæ¬åœã«ããããšãããããŸãïŒ
- ã©ã€ãã©ãªãšããŠ
pip install style-bert-vits2
ã«ããããã«ã€ã³ã¹ããŒã«ã§ããé³å£°åæéšåã®æ©èœã䜿ããŸãïŒäœ¿çšäŸã¯/library.ipynbãåç §ããŠãã ããïŒ - ãã®ä»ãã®ãã«ãªã¯ã«åæ©ã¥ããããå€ãã®ã³ãŒãã®ãªãã¡ã¯ã¿ãªã³ã°ã»åã¢ãããŒã·ã§ã³ã®è¿œå çãè¡ã£ã
- æ¥æ¬èªåŠçã®pyopenjtalkããœã±ããéä¿¡ãçšããŠå¥ããã»ã¹åããè€æ°åæã«åŠç¿ãé³å£°åæãç«ã¡äžããŠãèŸæžã®ç«¶åãšã©ãŒãèµ·ããªãããã«ãkale4eat ããã«ããPR ã§ããããããšãããããŸãïŒ
ãã°ä¿®æ£
- äžèšã«ãããéããé³å£°åæãšåŠç¿ååŠçãªã©ãæ¥æ¬èªåŠçãæ±ããã®ã2ã€ä»¥äžèµ·åããããšãããšãšã©ãŒãçºçããä»æ§ã®è§£æ±ºããŠãŒã¶ãŒèŸæžã¯è¿œå ããã°åžžã«ã©ãããã§ãé©å¿ãããŸãã
raw
ãã©ã«ãã®çŽäžã§ãªããµããã©ã«ãå ã«é³å£°ãã¡ã€ã«ãããå Žåã«ãwavs
ãã©ã«ãã§ããã®æ§é ãä¿ãããŠããŸããæžãèµ·ãããã¡ã€ã«ãšã®æŽåæ§ãåããªããªãæåãä¿®æ£ããåžžã«wav
ãã©ã«ãçŽäžãžwav
ãã¡ã€ã«ãä¿åããããã«å€æŽ- ã¹ã©ã€ã¹æã«å
ãã¡ã€ã«åã«ããªãªã
.
ãå«ãŸãããšãã¹ã©ã€ã¹åŸã®ãã¡ã€ã«åããããããªããã°ã®ä¿®æ£
æ©èœæ¹åã»è¿œå
- åçš®WebUIãäžã€
app.py
App.bat
ã«çµ±äž - ãã®ä»ä»¥äžã®å€æŽãã軜埮ãªUIã»èª¬ææã®æ¹åç
ããŒã¿ã»ããäœæ
- ã¹ã©ã€ã¹åŠçã®é«éåïŒãã«ãã¹ã¬ããã«ããã倧éã«ã¹ã©ã€ã¹å
ãã¡ã€ã«ãã¡ã€ã«ãããå Žåã«é«éã«ãªããŸãïŒããŸãã¹ã©ã€ã¹å
ã®ãã¡ã€ã«ã
wav
以å€ã®mp3
ãogg
ãªã©ã®åœ¢åŒã«ãå¯Ÿå¿ - ã¹ã©ã€ã¹åŠçæã«ããã¡ã€ã«åã«ã¹ã©ã€ã¹ãããéå§çµäºåºéãå«ãããªãã·ã§ã³ãè¿œå ïŒaka7774 ããã«ããPRã§ããããããšãããããŸãïŒïŒ
- æžãèµ·ããã®é«éåããŸãHugging Faceã®Whisperã¢ãã«ã䜿ããªãã·ã§ã³ãè¿œå ãããããµã€ãºãäžããããšã§VRAMãé£ã代ããã«é床ãå€§å¹ ã«åäžããŸãã
åŠç¿
- åŠç¿å
ã®é³å£°ãã¡ã€ã«ïŒ
Data/ã¢ãã«å/raw
ã«ããããã€ïŒããwav
以å€ã®mp3
ãogg
ãªã©ã®åœ¢åŒã«ã察å¿ïŒååŠç段éã§èªåçã«wav
ãã¡ã€ã«ã«å€æãããŸãïŒïŒãã ãå€ããã1ãã¡ã€ã«2-12ç§çšåºŠã®ç¯å²ã®é·ããæãŸããïŒ
é³å£°åæ
- é³å£°åææã«ãçæé³å£°ã®é³ã®é«ãïŒé³é«ïŒãšææã®å¹
ã調æŽã§ããããã«ïŒãã ãé³è³ªãå°ãå£åããïŒã
App.bat
ãEditor.bat
ã®ã©ã¡ãããã§ã䜿ããŸãã Editor.bat
ã®è€æ°è©±è ã¢ãã«ã§ã®è©±è æå®ãå¯èœã«Editor.bat
ã§ãæ¹è¡ãå«ãæååãããŒã¹ããããšèªåçã«æ¬ãå¢ããããã«ããŸããââãããŒã§æ¬ãè¿œå ã»è¡ãæ¥ã§ããããã«ïŒãšãã£ã¿ãŒåŽã§ä»¥åã«æ¢ã«ã¢ããããŠããŸããïŒEditor.bat
ã§ã¢ãã«äžèŠ§ã®ãªããŒããã¡ãã¥ãŒã«è¿œå
API
server_fastapi.py
ã®å®è¡æã«å šãŠã®ã¢ãã«ãã¡ã€ã«ãèªã¿èŸŒãããšããæåãä¿®æ£ãé³å£°åæããªã¯ãšã¹ããããŠåããŠãã®ã¢ãã«ãèªã¿èŸŒãããã«å€æŽïŒAPIã䜿ããªãé³å£°åæã®ãšããšåãæåïŒserver_fastapi.py
ã®é³å£°åæãšã³ããã€ã³ã/voice
ã«ã€ããŠãGETã¡ãœããã«å ããŠPOSTã¡ãœãããè¿œå ãGETã¡ãœããã§ã¯å€ãã®å¶çŽããããããªã®ã§POSTã䜿ãããšãæšå¥šãããŸãã
CLI
preprocess_text.py
ã§ãæžãèµ·ãããã¡ã€ã«ã§ã®é³å£°ãã¡ã€ã«åãèªåçã«æ£ããData/ã¢ãã«å/wavs/
ãžæžãæãã--correct_path
ãªãã·ã§ã³ã®è¿œå ïŒWebUIã§ã¯ä»ãŸã§ããã®æåã§ããïŒ- ãã®ä»äžè¿°ã®ããŒã¿ã»ããäœæã®æ©èœè¿œå ã«äŒŽãCLIã®ãªãã·ã§ã³ã®è¿œå ïŒè©³ããã¯CLI.mdãåç §ïŒ
v2.3.1 (2024-02-27)
ãã°ä¿®æ£
- colabã®åŠç¿çšããŒãããã¯ãåããªãã£ãã®ãä¿®æ£
App.bat
ãserver_fastapi.py
ã§ã¯èªããªãæåã§ãŸã ãšã©ãŒãçºçããããã«ãªã£ãŠããã®ã§ãæšè«æã¯å¿ ãèªããªãæåãç¡èŠããŠåŒ·åŒã«èªãããã«æåãå€æŽ
æ¹å
- èªã¿ãååŸã§ããªãå Žåã«ãããã¹ãååŠçå®äºæã«ãšã©ãŒã§äžæããä»ãŸã§ã®æåã«å ããŠããèªã¿ååŸå€±æãã¡ã€ã«ãåŠç¿ã«äœ¿ããã«é²ããããããã¯ãèªããªãæåãç¡èŠããŠèªãã§ãã¡ã€ã«ãåŠç¿ã«äœ¿ãé²ããããšãããªãã·ã§ã³ãè¿œå ã
- ããŒãžæ¹æ³ã«ç·åœ¢è£éã®ä»ã«çé¢ç·åœ¢è£å®ãè¿œå ïŒ@frodo821 ããã«ããPRã§ããããããšãããããŸãïŒïŒ
- ãããã€çš
.dockerignore
ãæŽæ°
ã¢ããããŒãæé
- 2.3æªæºããã®ã¢ããããŒãã®å Žåã¯ãUpdate-to-Dict-Editor.batãããŠã³ããŒããã
Style-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ããã - 2.3ããã®ã¢ããããŒãã®å Žåã¯ãåçŽã«ä»ãŸã§ã®
Update-Style-Bert-VITS2.bat
ã§ã¢ããããŒãã§ããŸãã
v2.3 (2024-02-26)
倧ããªå€æŽ
倧ããå€æŽãããã€ããããããã¢ããããŒãã¯ãŸãå°çšã®æé ãå¿ èŠã§ããäžèšã®æ瀺ã«ãããã£ãŠãã ããã
ãŠãŒã¶ãŒèŸæžæ©èœ
ãããããèŸæžã«åºæåè©ãè¿œå ããããšãã§ãããããåŠç¿æã»é³å£°åææã®èªã¿ååŸéšåã«é©å¿ãããŸããèŸæžã®è¿œå ã»ç·šéã¯æ¬¡ã®ãšãã£ã¿çµç±ã§è¡ã£ãŠãã ããããŸãã¯ãææã¡ã®OpenJTalkã®csv圢åŒã®èŸæžãããå Žåã¯ãdict_data/default.csv
ãã¡ã€ã«ãçŽæ¥äžæžããè¿œå ããŠãå¯èœã§ãã
䜿ããããªèŸæžïŒã©ã€ã»ã³ã¹çã¯åèªã確èªãã ããïŒïŒä»ã«è¯ãã®ããã£ããæããŠäžããïŒïŒ
èŸæžæ©èœéšåã®å®è£ ã¯ãäžã®READMEã«ããéããVOICEVOX Editor ã®ãã®ã䜿ã£ãŠããããã®éšåã®ã³ãŒãã©ã€ã»ã³ã¹ã¯LGPL-3.0ã§ãã
é³å£°åæå°çšãšãã£ã¿
ð€ ãªã³ã©ã€ã³ãã¢ã¯ãã¡ããã
é³å£°åæå°çšãšãã£ã¿ãè¿œå ãä»ãŸã§ã®WebUIã§ã§ããæ©èœã®ã»ãã次ã®ãããªæ©èœã䜿ããŸãïŒã€ãŸãæ¢åã®æ¥æ¬èªé³å£°åæãœãããŠã§ã¢ã®ãšãã£ã¿ãç䌌ãŸããïŒïŒ
- ã»ãªãåäœã§ãã£ã©ãèšå®ãå€æŽããªããåçš¿ãäœãããããäžæ¬ã§çæããããåçš¿ãä¿åçãããèªã¿èŸŒãã ã
- GUIããåãããããã¢ã¯ã»ã³ã調æŽ
- ãŠãŒã¶ãŒèŸæžãžã®åèªè¿œå ãç·šé
Editor.bat
ãããã«ã¯ãªãã¯ãpython server_editor.py --inbrowser
ã§èµ·åããŸãããšãã£ã¿ãŒéšåã¯ãã¡ãã®å¥ãªããžããªã«ãªããŸããããã³ããšã³ãåå¿è
ãªã®ã§ãã«ãªã¯ãæ¹åæ¡çããåŸ
ã¡ããŠããŸãã
ãã°ä¿®æ£
- ç¹å®ã®ç¶æ³ã§èªã¿ãæ£ããååŸã§ãã
list index out of range
ãšãªããã°ã®ä¿®æ£ - ååŠçæã«ãæžãèµ·ãããã¡ã€ã«ã®ããè¡ã®åœ¢åŒãäžæ£ã ãšãæžãèµ·ãããã¡ã€ã«ã®ãã以éã®å 容ãæ¶ããŠããŸããã°ã®ä¿®æ£
- faster-whisperã1.0.0ã«ã¡ãžã£ãŒããŒãžã§ã³ã¢ããããïŒä»ã®ãšããïŒå€§å¹ ã«å£åããã®ã§ãããŒãžã§ã³ã0.10.1ãžåºå®
æ¹å
- ããã¹ãååŠçæã«ãèªã¿ã®ååŸã®å€±æçããã£ãå Žåã«ãåŠçãäžæããããšã©ãŒããããç®æã
text_error.log
ãã¡ã€ã«ãžä¿åããããã«å€æŽã - é³å£°åææã«ãèªããªãæåããã£ããšãã¯ãšã©ãŒãèµ·ãããããã®éšåãç¡èŠããŠèªã¿äžããããã«å€æŽïŒåŠç¿æ®µéã§ã¯ãšã©ãŒãåºããŸãïŒ
- ã³ãã³ãã©ã€ã³ã§ååŠçãåŠç¿ãç°¡åã«ã§ãããããååŠçãè¡ã
preprocess_all.py
ãè¿œå ïŒè©³ããã¯CLI.mdãåç §ïŒ - åŠç¿ã®éã«ãèªåçã«èªåã®hugging faceãªããžããªãžçµæãã¢ããããŒããããªãã·ã§ã³ãè¿œå ãã³ãã³ãã©ã€ã³åŒæ°ã§
--repo_id username/my_model
ã®ããã«æå®ããŠãã ããïŒè©³ããã¯CLI.mdãåç §ïŒãð€ã®ç¡å¶éã¹ãã¬ãŒãžã䜿ããã®ã§ã¯ã©ãŠãã§ã®åŠç¿ã«äŸ¿å©ã§ãã - åŠç¿æã«ãã³ãŒããŒéšåãåçµãããªãã·ã§ã³ã®è¿œå ãå質ãããããããäžãããããããŸããã
initialize.py
ã«åŒæ°--dataset_root
ãš--assets_root
ãè¿œå ããconfigs/paths.yml
ããã®æç¹ã§å€æŽã§ããããã«ãã
ãã®ä»
- paperspaceã§ã®åŠç¿ã®æåŒããè¿œå ãpaperspaceã§ã®imageã«äœ¿ããDockerfileãè¿œå
- CLIã§ã®åçš®åŠçã®å®è¡ã®ä»æ¹ãè¿œå
- Hugging Face spacesã§éã¹ãé³å£°åæãšãã£ã¿ããããã€ããããã®Dockerfileãè¿œå
ã¢ããããŒãæé
Update-to-Dict-Editor.batãããŠã³ããŒããã
Style-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ãããæåã§ã®å Žåã¯ã以äžã®æé ã§å®è¡ããŠãã ããïŒ
git pull
venv\Scripts\activate
pip uninstall pyopenjtalk-prebuilt
pip install -U -r requirements.txt
# python initialize.py # ããã1.xç³»ããã®ã¢ããããŒãã®å Žåã¯å®è¡ããŠãã ãã
python server_editor.py --inbrowser
æ°èŠã€ã³ã¹ããŒã«æé
ãã®zipãããŠã³ããŒããã解åããŠãã ããã
ãå±éããInstall-Style-Bert-VITS2.bat
ãããã«ã¯ãªãã¯ããŠãã ããã
v2.2 (2024-02-09)
å€æŽã»æ©èœè¿œå
- bfloat16ãªãã·ã§ã³ã¯ãã¡ãªããããç¡ããããªã®ã§ãåžžã«ãªãã§åŠç¿ããããå€æŽ
- ããããµã€ãºã®ããã©ã«ãã4ãã2ã«å€æŽãåŠç¿ãé ãå Žåã¯ããããµã€ãºãäžããŠè©ŠããŠã¿ãŠãVRAMã«äœè£ãããã°äžããŠãã ãããJP-Extra䜿çšæã§ã®ããããµã€ãºããšã®VRAM䜿çšéç®å®ã¯ã1: 6GB, 2: 8GB, 3: 10GB, 4: 12GB ãããã®ããã§ãã
- åŠç¿ã®éã®æ€èšŒããŒã¿æ°ãããã©ã«ãã§0ã«å€æŽãããŸãæ€èšŒããŒã¿æ°ãåŠç¿çšWebUIã§æå®ã§ããããã«ãã
- Tensorboardã®ãã°ééãåŠç¿çšWebUIã§æå®ã§ããããã«ãã
- UIã®ããŒãã
common/constants.py
ã®GRADIO_THEME
ã§æå®ã§ããããã«ãã
ãã°ä¿®æ£
- JP-Extra䜿çšæã«ããããµã€ãºã1ã ãšåŠç¿äžã«ãšã©ãŒãçºçãããã°ãä¿®æ£
- ãããã«ã¡ã¯!?!?!?!?ãçãæå笊çã®èšå·ãé£ç¶ãããšåŠç¿ã»é³å£°åæã§ãšã©ãŒã«ãªããã°ãä¿®æ£
â
(em dash, U+2014) ãâ
(quotation dash, U+2015) çã®ããã·ã¥ããã€ãã³ã®åçš®å€çš®ããçš®é¡ã«ãã£ãŠ-
ïŒéåžžã®åè§ãã€ãã³ïŒã«æ£èŠåãããããããŠããªãã£ããããåŠçããå šãŠæ£èŠåããããã«ä¿®æ£
v2.1 (2024-02-07)
å€æŽ
- åŠç¿ã®éãããã©ã«ãã§ã¯bfloat16ãªãã·ã§ã³ã䜿ããªãããå€æŽïŒåŠç¿ãçºæ£ããã質ãäžããããšãããæš¡æ§ïŒ
- åŠç¿ã®éã®ã¡ã¢ãªäœ¿çšéãåæžããããšé 匵ã£ã
ãã°ä¿®æ£ãæ¹å
- åŠç¿WebUIããTensorboardã®ãã°ãèŠããããã«
- é³å£°åæïŒããã®APIïŒã«ãããŠãåæã«å¥ã®è©±è ãéžæããé³å£°åæããªã¯ãšã¹ããããå Žåã«çºçãããšã©ãŒãä¿®æ£
- ã¢ãã«ããŒãžæã«ããã®ã¬ã·ãã
recipe.json
ãã¡ã€ã«ãžä¿åããããã«å€æŽ - ãæ¹è¡ã§åããŠçæããããææ ãä¹ãæšã®æèšçã軜埮ãªèª¬ææã®æ¹å
- ã
ãŒãŒããã¯é¢çœã
ããããªãã»ã©ããŒãŒãŒããããããšãã
ãçãé·é³èšå·ã®åãæ¯é³ã§ãªãå Žåãé·é³èšå·ãŒ
ã§ãªãããã·ã¥â
ã®åéãã ãšæãããã®ã§ãããã·ã¥èšå·ãšããŠåŠçããããã«å€æŽ
v2.0.1 (2024-02-05)
軜埮ãªãã°ä¿®æ£ãæ¹å
- ã¹ã¿ã€ã«ãã¯ãã«ã«
NaN
ãå«ãŸããŠããå ŽåïŒäž»ã«é³å£°ãã¡ã€ã«ã極端ã«çãå Žåã«çºçïŒããããåŠç¿ãªã¹ãããé€å€ããããã«ä¿®æ£ - colabã«ããŒãžã®è¿œå
- åŠç¿æã®ããã°ã¬ã¹ããŒã®è¡šç€ºãããããã£ãã®ãä¿®æ£
- ããã©ã«ãã®jvnvã¢ãã«ãJP-Extraçã«ã¢ããããŒããæ°ããã¢ãã«ã䜿ãããæ¹ã¯æåã§ãã¡ãããããŠã³ããŒããããã
python initialize.py
ããããããã®batãã¡ã€ã«ãStyle-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ããã
v2.0 (2024-02-03)
倧ããå€æŽ
ã¢ãã«æ§é ã« Bert-VITS2ã®æ¥æ¬èªç¹åã¢ãã« JP-Extra ãåã蟌ãã ãã®ã䜿ããããã«å€æŽãäºååŠç¿ã¢ãã«ãBert-VITS2 JP-Extraã®ãã®ãæ¹é ããŠStyle-Bert-VITS2ã§äœ¿ããããã«ããŸãã (ã¢ãã«æ§é ãèŠçŽããŠæ¥æ¬èªã§ã®åŠç¿ãããŠããã ãã @Stardust-minus æ§ã«æè¬ããŸã)
- ããã«ãããæ¥æ¬èªã®çºé³ãã¢ã¯ã»ã³ããææãèªç¶æ§ãåäžããåŸåããããŸã
- ã¹ã¿ã€ã«ãã¯ãã«ã䜿ã£ãã¹ã¿ã€ã«ã®æäœã¯å€ããã䜿ããŸã
- ãã ãJP-Extraã§ã¯è±èªãšäžåœèªã®é³å£°åæã¯ïŒçŸç¶ã¯ïŒã§ããŸãã
- æ§ã¢ãã«ãåŒãç¶ã䜿ãããšãã§ãããŸãæ§ã¢ãã«ã§åŠç¿ããããšãã§ããŸã
- ããã©ã«ãã®JVNVã¢ãã«ã¯çŸåšã¯æ§verã®ãŸãŸã§ã
æ¹å
Merge.bat
ã§ã声é³ããŒãžãããã现ããã声質ããšã声ã®é«ããã®ç¹ã§ããŒãžã§ããããã«ã
ãã°ä¿®æ£
- PyTorchã®ããŒãžã§ã³ã«ç±æ¥ãããã°ãä¿®æ£ïŒtorchã®ããŒãžã§ã³ã2.1.2ã«åºå®ïŒ
â
ïŒããã·ã¥ãé·é³èšå·ã§ã¯ãªãïŒã2é£ç¶ãããšåŠç¿ã»é³å£°åæã§ãšã©ãŒã«ãªããã°ãä¿®æ£- ãäžåãçããïŒæ¯é³ãã®ã¢ã¯ã»ã³ãã®ä»®åè¡šèšãããµãã³ãçã«ãªãããŸãå¶ã«ãšã©ãŒãçºçããåé¡ãä¿®æ£ïŒãããã®é³çŽ è¡šèšãå éšçã«ã¯ãNãã§çµ±äžïŒ
v1.3 (2024-01-09)
倧ããå€æŽ
- å
ã
ã®Bert-VITS2ã«ååšãããæ¥æ¬èªã®çºé³ã»ã¢ã¯ã»ã³ãåŠçéšåã®ãã°ãä¿®æ£ã»ãªãã¡ã¯ã¿ãªã³ã°
è»äž¡
ãã·ã£ãªãšãª
ãæã
ããªã¢ãª
ãèŠã€ãã
ãããã±ã«
çã«çºé³ã»åŠç¿ãããŠããããã®åèªä»¥éã®ã¢ã¯ã»ã³ãæ å ±ãå šãŠæ»ãã§ããç§ã¯ãããèŠã
ã®ã¢ã¯ã»ã³ããã¯âã¿ã·âã¯ããœâã¬âãªããâã«
ã ã£ãã®ãã¯âã¿ã·ã¯ããœâã¬ãªããâã«
ã«ä¿®æ£- åŠç¿ã»é³å£°åæã§ç¡èŠãããŠããã¢ã«ãã¡ãããã»ã®ãªã·ã£æåãç¡èŠããªãããã«å€æŽïŒåºæ¬ã¯ã¢ã«ãã¡ãããèªã¿ã ãã©ç°¡åãªåèªã¯èªããããããåŠç¿ã®éã¯å¿µã®ããã«ã¿ã«ãçã«ããã»ããããã§ãïŒ
- ä¿®æ£ã®åœ±é¿ã§ãååŠçæã«ïŒä»ãŸã§ç¡èŠãããŠããïŒèªããªã挢åçã§åŒã£ãããããã«ãªããŸããããã®å Žåã¯æžãèµ·ããã確èªããŠä¿®æ£ããããã«ããŠãã ããã
- ã¢ã¯ã»ã³ãã調æŽããŠé³å£°åæã§ããããã«ïŒå®å šã«å¶åŸ¡ã§ããããã§ã¯ãªããæ¹åãããå ŽåãããïŒã
ãããŸã§ã®ã¢ãã«ããããŸã§éã䜿ããã¢ã¯ã»ã³ããçºé³çãæ¹åãããå¯èœæ§ããããŸããæ°ããããŒãžã§ã³ã§åŠç¿ãçŽããšããè¯ããªãå¯èœæ§ããããŸãããåçã«è¯ããªããã¯åãããŸããã
æ¹å
Dataset.bat
ã®é³å£°ã¹ã©ã€ã¹ãšæžãèµ·ãããããã«ã¹ã¿ãã€ãºã§ããããã«ïŒã¹ã©ã€ã¹ã®ç§æ°èšå®ãæžãèµ·ããã®Whisperã¢ãã«æå®ãèšèªæå®çïŒStyle.bat
ã®ã¹ã¿ã€ã«åãã§ãã¹ã¿ã€ã«ããšã®ãµã³ãã«é³å£°ãæå®ããæ°ã ãè€æ°åçã§ããããã«ããŸãæ°ãã次å åæžæ¹æ³ïŒUMAPïŒãšæ°ããã¹ã¿ã€ã«åãã®æ¹æ³ïŒDBSCANïŒãè¿œå ïŒUMAPã®ã»ããããã¹ã¿ã€ã«ãåããããããããŸããïŒApp.bat
ã§ã®é³å£°åææã«è€æ°è©±è ã¢ãã«ã®å Žåã«è©±è ãæå®ã§ããããã«- colabã®ããŒãããã¯ã§ãé³å£°ãã¡ã€ã«ã®ã¿ããããŒã¿ã»ãããäœæãããªãã·ã§ã³éšåãè¿œå
- ã¯ã©ãŠãå®è¡çã®éã«ãã¹ã®æå®ããã¡ãã§ã§ããããã«ããã¹ã®èšå®ã
configs/paths.yml
ã«ãŸãšããïŒcolabã®ããŒãããã¯ãããã«äŒŽã£ãŠæŽæ°ïŒãããã©ã«ãã¯dataset_root: Data
ãšassets_root: model_assets
ãªã®ã§ãã¯ã©ãŠãçã§ããæ¹ã¯ãããå€æŽããŠãã ããã - ã©ã®ã¹ãããæ°ã®åºåããããã®ãäžã€ã®ãææšãšã㊠SpeechMOS ã䜿ãã¹ã¯ãªãããè¿œå ïŒ
python speech_mos.py -m <model_name>
ã¹ãããããšã®èªç¶æ§è©äŸ¡ã衚瀺ãããmos_results
ãã©ã«ãã®mos_{model_name}.csv
ãšmos_{model_name}.png
ã«çµæãä¿åããããèªã¿äžãããããæç« ãå€ãããã£ããäžã®ãã¡ã€ã«ãåŒã£ãŠåèªèª¿æŽããŠãã ããããããŸã§ã¢ã¯ã»ã³ããææ
è¡šçŸãææãå
šãèããªãåºæºã§ã®è©äŸ¡ã§ãç®å®ã®ã²ãšã€ãªã®ã§ãå®éã«èªã¿äžããããŠéžå¥ããã®ãäžçªã ãšæããŸãã
- åŠç¿æã®ãŠã©ãŒã ã¢ãããªãã·ã§ã³ãæ©èœããããã«ïŒ @kale4eat æ§ã«ããPRã§ããããããšãããããŸãïŒïŒãååŠçæã«çæããã
config.json
ã®train
ã®warmup_epochs
ãå€æŽããããšã§ããŠã©ãŒã ã¢ããã®ãšããã¯æ°ãå€æŽã§ããŸããããã©ã«ãã¯0
ã§ä»ãŸã§ãšåãåŠç¿çã®æåã§ãã
ãã®ä»
Dataset.bat
ã®é³å£°ã¹ã©ã€ã¹ã§ããŒãã©ã€ãºæ©èœãåé€ïŒåŠç¿ååŠçã§è¡ããããïŒTrain.bat
ã®é³éããŒãã©ã€ãºãšç¡é³åãè©°ããããã©ã«ãã§ãªãã«å€æŽ- åŠç¿æã®é²æãå šäœãšããã¯æ°ã§è¡šç€ºããåŠç¿å šäœã®é²æãèŠãããããã«( @RedRayz æ§ã«ããPRã§ããããããšãããããŸãïŒ)
- ãã®ä»ãã°ä¿®æ£çïŒ @tinjyuu æ§ã @darai0512 æ§ããããšãããããŸãïŒïŒ
config.json
ã«ã¹ã¿ã€ã«åã蟌ã¿éšåãåŠç¿ããªãfreeze_style
ãªãã·ã§ã³ãè¿œå ïŒããã©ã«ãã¯false
ïŒ
TIPS
- æ¥æ¬èªåŠç¿ã®å Žåã
config.json
ã®freeze_bert
ãšfreeze_en_bert
ãtrue
ã«ããŠãããšãè±èªãšäžåœèªã®çºè©±èœåãåŠç¿ã®éçšã§èœã¡ãªããããããŸããããããŸãæ¯èŒããŠããªã®ã§åãããŸããã
v1.2 (2023-12-31)
- ã°ã©ãããªããŠãŒã¶ãŒã§ã®é³å£°åæããµããŒãã
Install-Style-Bert-VITS2-CPU.bat
ã§ã€ã³ã¹ããŒã«ã - Google Colabã§ã®åŠç¿ããµããŒããããŒãããã¯ãè¿œå
- é³å£°åæã®APIãµãŒããŒãè¿œå ã
python server_fastapi.py
ã§èµ·åããŸããAPIä»æ§ã¯èµ·ååŸã«/docs
ã«ãŠç¢ºèªãã ãããïŒ @darai0512 æ§ã«ããPRã§ããããããšãããããŸãïŒïŒ - åŠç¿æã«èªåçã«ããã©ã«ãã¹ã¿ã€ã« Neutral ãçæããããã«ãç¹ã«ã¹ã¿ã€ã«æå®ãå¿ èŠã®ãªãæ¹ã¯ãåŠç¿ããããã®ãŸãŸé³å£°åæãè©ŠããŸãããããŸã§éãã¹ã¿ã€ã«ãèªåã§äœãããšãã§ããŸãã
- ããŒãžæ©èœã®æ°èŠè¿œå :
Merge.bat
,webui_merge.py
- ååŠçã®ãªãµã³ããªã³ã°æã«é³å£°ãã¡ã€ã«ã®éå§ã»çµäºéšåã®ç¡é³ãåé€ãããªãã·ã§ã³ãè¿œå ïŒããã©ã«ãã§ãªã³ïŒ
ã¹ã¿ã€ã«ããã¹ã (style text)
ãã¹ã¿ã€ã«æå®ãšçŽããããã£ãã®ã§ãã¢ã·ã¹ãããã¹ã (assist text)
ã«å€æŽ- ãã®ä»ã³ãŒãã®ãªãã¡ã¯ã¿ãªã³ã°
v1.1 (2023-12-29)
- TrainãšDatasetã®WebUIã®æ¹è¯ã»èª¿æŽïŒäžæ¬äºååŠçãã¿ã³çïŒ
- ååŠçã®ãªãµã³ããªã³ã°æã«é³éãæ£èŠåãããªãã·ã§ã³ãè¿œå ïŒããã©ã«ãã§ãªã³ïŒ
v1.0 (2023-12-27)
- åç