ð°ã§ããããDia 1.6B - è³éãŒãããçãŸããé©åœçé³å£°åæAI
é³å£°åæã¹ã¿ãŒãã¢ãããDiaãã®å°é
ããã«ã¡ã¯ïŒð°ã§ãã仿¥ã¯é³å£°åæã®äžçã«é©åœãèµ·ããã€ã€ããæ°èã¹ã¿ãŒãã¢ãããNari LabsããšåœŒããéçºããé³å£°åæAIãDiaãã«ã€ããŠç޹ä»ãããŽããïŒ
æè¿ãAIã«ããé³å£°åæïŒText-to-Speechãç¥ããŠTTSïŒæè¡ãæ¥éã«é²åããŠããŸãããDiaã¯ãã®äžã§ãç¹ã«æ³šç®ãã¹ãååšã§ãããã£ã2人ã®ããŒã ã§éçºããããã®ã¢ãã«ã¯ãElevenLabsãOpenAIãªã©ã®å€§äŒæ¥ãæäŸãããµãŒãã¹ãšè©ã䞊ã¹ãããããã¯äžéšã®æ©èœã§ã¯äžåãæ§èœãå®çŸããŠããŸãã
ãããå®å šç¡æã®ãªãŒãã³ãœãŒã¹ãšããŠå ¬éãããŠãããã ãŽããïŒããã¯ãããããšãªãã§ãïŒ
Nari Labsãšã¯ïŒè³éãŒãããçãŸããé©ç°ã®ã¢ãã«
Nari Labsã¯ãããã2äººã§æ§æãããå°ããªã¹ã¿ãŒãã¢ããã§ããå ±åéçºè ã®Toby Kimæ°ã«ããã°ã圌ãã¯ãæåããAIã®å°éå®¶ã§ã¯ãªãã£ãããšã®ããšã
éçºã®åååãšãªã£ãã®ã¯ãGoogleã®NotebookLMã®ããããã£ã¹ãæ©èœã«æåããããšã§ããããããå€ãã®å£°ã®ã³ã³ãããŒã«ãããèªç±ãªã¹ã¯ãªãããæãã ããšãã圌ãã¯ãæ¢åãµãŒãã¹ã§ã¯æºè¶³ã§ãããèªãçæ³ã®é³å£°ã¢ãã«ãäœãåºãéãéžãã ã®ã§ãã
é©ãã¹ãããšã«ããè³éãŒããã§éçºãããDiaã¯ãGoogleã®TPU Research CloudãéããŠTPUããããå©çšã§ããããšãæåã®éµãšãªããŸããããŸããHugging Faceã®ZeroGPUã¹ãã³ãµãŒã·ãããæŽ»çšããå€éšè³éãªãã§ãã®é©ç°çãªã·ã¹ãã ãæ§ç¯ããŠããŸãã
çŸåšããã®ã³ãŒããšã¢ãã«ãŠã§ã€ãã¯Hugging FaceãšGithubã§èª°ã§ãç¡æã§å©çšã§ããããã«ãªã£ãŠããŸãããªãŒãã³ãœãŒã¹ãšããŠå ¬éãããŠããããšã§ãé³å£°æè¡ã®æ°äž»åã«å€§ããè²¢ç®ããŠãããã ãŽããïŒ
Diaã®æè¡çç¹åŸŽïŒææ è±ããªå¯Ÿè©±çæãå¯èœ
Diaã®æå€§ã®é åã¯ãææ è¡šçŸã話è ã¿ã°ãéèšèªçãªé³å£°ãã¥ãŒãèªç¶ã«è¡šçŸã§ããç¹ã«ãããŸãã
è€æ°è©±è ã®å¯Ÿè©±çæ
[S1]ã[S2]ãªã©ã®ã¿ã°ã䜿ã£ãŠè©±è ãåºå¥ããããšã§ãè€æ°ã®äººç©ã«ããèªç¶ãªå¯Ÿè©±ãçæã§ããŸããåãã·ãŒãå€ã䜿ããšããããã®å£°ã¯å¯Ÿè©±å šäœãéããŠäžè²«æ§ãä¿ã¡ãŸãã
äŸãã°ïŒ
[S1] ããã«ã¡ã¯ã仿¥ã®å€©æ°ã¯ã©ãã§ããïŒ
[S2] ãšãŠãè¯ã倩æ°ã§ãããæ£æ©ã§ããããã§ããïŒ
[S1] ããã§ããïŒ(laughs) ã¡ããã©éåäžè¶³ã ã£ããã§ãã
ãã®ãããªã¹ã¯ãªããããããŸãã§å®éã®äŒè©±ã®ãããªé³å£°ãçæã§ãããã§ãïŒ
éèšèªã³ãã¥ãã±ãŒã·ã§ã³ã®åçŸ
ã(laughs)ããã(coughs)ããªã©ã®æç€ºãå ¥ãããšãå®éã®ç¬ã声ãå³ãèªç¶ã«åçŸã§ããŸããäžè¬çãªé³å£°åæã§ã¯é£ãããšããããããã®è¡šçŸããDiaã¯ãã¬ãŒã³ããã¹ãããã·ãŒã ã¬ã¹ã«çæå¯èœã§ãã
ä»ã®ã¢ãã«ã§ã¯ãhahaããšèšèãšããŠçºããã ããªã®ã«å¯ŸããDiaã¯å®éã®èªç¶ãªç¬ã声ãçæãããã ãŽããïŒ
é³å£°ã¯ããŒãã³ã°æ©èœ
ç¹å®ã®è©±è ã®å£°ã«çžãããªãæè»æ§ãæã¡åãããŠããŸãããŠãŒã¶ãŒã15ç§çšåºŠã®é³å£°ãµã³ãã«ãã¢ããããŒãããã°ããã®ç¹åŸŽïŒå£°è³ªã»æ»èã»ãã¬ã¹é³ãªã©ïŒãåæ ããé³å£°çæãå¯èœã§ãã
æ¥çæé«å³°ã®è£œåãšæ¯èŒããŠãåŒããåããªãå®å
Nari Labsã¯èªç€ŸãŠã§ããµã€ãã§ãDiaãšElevenLabs StudioãSesame CSM-1Bãšã®æ¯èŒãµã³ãã«ãå ¬éããŠããŸãããã®çµæã¯ãå€ãã®ç¹ã§Diaãåªäœæ§ã瀺ããã®ã§ããã
ç¹ã«éç«ã€ã®ã¯éèšèªè¡šçŸã®åŠçèœåã§ããã(laughs)ããšããã¿ã°ãã¹ã¯ãªããã«ããå Žåãæ¯èŒãããšïŒ
- DiaïŒå®éã®èªç¶ãªç¬ã声ãçæ
- ç«¶å補åïŒãhahaããšããèšèãçºããã ã
ããã«ææ 衚çŸã«ãããŠããDiaã¯å€§ããªåŒ·ã¿ãèŠããŠããŸããç·æ¥äºæ ãæããåçãªã·ãŒã³ã®ãã¹ãã§ã¯ãDiaã話è ã®ç·åŒµæãã¹ãã¬ã¹ã广çã«è¡šçŸããäžæ¹ãä»ã®ã¢ãã«ã§ã¯ææ ã®èµ·äŒãå¹³åŠã«ãªã£ãããäŒè©±ã®ããŒã¹ãäžèªç¶ã«ãªãåŸåãèŠãããŸããã
ãŸããè€éãªã©ããæè©ãªã©ã®ãã¹ãã§ããDiaã¯ãã³ããç¶æããæµæ¢ãªè¡šçŸãå®çŸãã鳿¥œçãªèŠçŽ ãå«ããé«åºŠãªé³å£°çæãå¯èœãªããšã瀺ããŠããŸãã
å®éã«Diaã䜿ã£ãŠã¿ãã
ããŒããŠã§ã¢èŠä»¶
Diaãåããã«ã¯ä»¥äžã®ç°å¢ãå¿ èŠã§ãïŒ
- PyTorch 2.0以äžãšCUDA 12.6ã§åäœ
- çŽ10GBã®VRAMïŒGPUã®ãããªã¡ã¢ãªïŒ
- NVIDIA A4000ãªã©ã®GPUã§ã®åŠçé床ã¯1ç§ãããçŽ40ããŒã¯ã³
æªæ¥çã«ã¯ãCPUãµããŒããšéååããŒãžã§ã³ã®æäŸãèšç»ãããŠãããããå¹ åºãç°å¢ã§ã®å©çšãå¯èœã«ãªãäºå®ã§ãã
Hugging Faceã§ã®ç°¡åãªè©Šãæ¹
ããŒããŠã§ã¢ã®æºåããªããŠããHugging Faceã®ãã¢ããŒãžããããã«è©Šãããšãã§ããŸããããã¹ããå ¥åãããçæããã¿ã³ãæŒãã ãã§éæ³ã®ãããªé³å£°ãçæãããŸãã
ããŒã«ã«ç°å¢ã§ã®å®è¡æ¹æ³
ããæ¬æ Œçã«äœ¿ãããå Žåã¯ãGitHubãããªããžããªãã¯ããŒã³ããŠãããŒã«ã«ç°å¢ã§å®è¡ã§ããŸãïŒ
git clone https://github.com/nari-labs/dia.git
cd dia
python -m venv .venv
source .venv/bin/activate
pip install uv
uv run app.py
Google Colabã§ãç¡æã§è©Šããã®ãå¬ããã§ããïŒ
!git clone https://github.com/nari-labs/dia.git
%cd dia
!python -m venv .venv
!source .venv/bin/activate
!pip install uv
!uv run app.py --share
ããžãã¹æŽ»çšãšä»åŸã®å¯èœæ§
Diaã¯æ§ã ãªåéã§ã®æŽ»çšãæåŸ ãããŠããŸãïŒ
ã³ã³ãã³ãå¶äœ
- ããããã£ã¹ãããã©ãã®èªåé³å£°å
- åç»ãã¬ãŒã·ã§ã³ã®å€å£°å
- ãªãŒãã£ãªããã¯ã®äœæ
ã²ãŒã éçº
- NPCã®åçäŒè©±çæ
- ã·ããªãªéç£ãšæ²¡å ¥æã®äž¡ç«
æ¯æŽæè¡
- 倱èªçæ£è åãäŒè©±è£å©
- å€èšèªã«ã¹ã¿ããŒãµããŒãã®ã声ã®ããŒã«ã©ã€ãºã
åçšå©çšãApache 2.0ã©ã€ã»ã³ã¹ã§èªç±ã«è¡ããŸãããã ããå人ã«ãªãããŸãã誀æ å ±æ¡æ£ãªã©ã®æªçšã¯æç¢ºã«çŠæ¢ãããŠããããšã«æ³šæãå¿ èŠã§ãã
çŸåšã®å¶çŽãšä»åŸã®å±æ
çŸæç¹ã§ã®Diaã«ã¯ãããã€ãã®å¶çŽããããŸãïŒ
-
è±èªã®ã¿ã®å¯Ÿå¿ïŒçŸåšã¯è±èªã®ã¿ããµããŒãããŠããŸãããå°æ¥çã«ã¯æ¥æ¬èªãå«ãå€èšèªå¯Ÿå¿ãäºå®ãããŠããŸãã
-
ããŒããŠã§ã¢èŠä»¶ã®é«ãïŒçŽ10GBã®VRAMãå¿ èŠãªãããäžè¬çãªPCã§ã¯åãããªãã±ãŒã¹ããããŸããéååã¢ãã«ãCPU察å¿ã®éçºã«ããããã®å¶çŽã¯ç·©åãããèŠèŸŒã¿ã§ãã
-
GPUäŸåïŒçŸåšã¯NVIDIA GPUã«äŸåããŠãããCPUã§ã®å®è¡ã¯ãŸã 察å¿ããŠããŸããã
Nari Labsã¯ãããã®å¶çŽãèªèããéååããŒãžã§ã³ã®æäŸãCPU察å¿ã®è¿œå ã«ããããŒããŠã§ã¢èŠä»¶ã®ç·©åãç®æããŠããŸãããŸããäžè¬ãŠãŒã¶ãŒåãã®ãDia Consumerãã®Î²çãå€ãŸã§ã«å ¬éäºå®ãšã®ããšã§ãã
ãŸãšã
Nari Labsã®ãDiaãã¯ãé³å£°åææè¡ã®åéã«æ°ããªå¯èœæ§ããããããŠããŸããããã2人ã®éçºè ãè³éãŒãããçã¿åºãããã®ãªãŒãã³ãœãŒã¹ã¢ãã«ã¯ã倧æäŒæ¥ã®è£œåãšæ¯èŒããŠãéè²ãªãããããäžéšã®é¢ã§ã¯åªããæ§èœãçºæ®ããŠããŸãã
é³å£°åæã®ç«¶äºè»žã¯ããæåâé³å£°ãã§ããªãŒãã³åãæ¥éã«é²ãã§ããŸããNari Labs Diaã¯ãå°èŠæš¡ããŒã ã§ããããã¯ã©ã¹ã®é³å£°äœéšãå®è£ ã§ããããšã蚌æããŸããã
ããªãã®ãµãŒãã¹ããããžã§ã¯ãã«ã声ããå¿ èŠã§ããã°ããŸãããŒã«ã«ã§Diaãåããããã®å¯èœæ§ãäœæããŠã¿ãŠã¯ãããã§ããããïŒ
ãªãŒãã³ãŠã§ã€ãã®é«è¡šçŸTTSãæ®åããã°ããé³å£°ã¯ã¯ã©ãŠãAPIã«å€æ³šããåžžèã ã£ãéçºãããŒããèªåæšè«ïŒãªã³ããã€ã¹åŠçãžäžæ°ã«ã·ããããå¯èœæ§ããããŸããä»åŸã®Diaã®çºå±ã«ç®ãé¢ããŸãããïŒ
åèãªã³ã¯ïŒ
Discussion