2017幎9æ6æ¥ãããŒã¿åæãšæ©æ¢°åŠç¿ã«é¢ããOpenDataScienceãªãŒãã³ã³ãŒã¹ã®2åç®ã®éå§ãéå§ãããŸãã ä»åã¯ãã©ã€ãã¬ã¯ãã£ãŒãéå¬ããããµã€ãã¯Mail.Ru Groupã®ã¢ã¹ã¯ã¯ãªãã£ã¹ã«ãªããŸãã
èŠããã«ãã³ãŒã¹ã¯HabrÃ©ïŒ ãããæåïŒã®äžé£ã®èšäºãè€è£œãããè³æïŒJupyterããŒãããã¯ããã¡ããã³ãŒã¹ã®githubãªããžã㪠ïŒã宿é¡ãKaggle Inclassã³ã³ãã¹ãããã¥ãŒããªã¢ã«ãåã
ã®ããŒã¿åæãããžã§ã¯ãã§æ§æãããŠããŸãã ããã§ã³ãŒã¹ã«ãµã€ã³ã¢ããã§ããŸãã ãã㧠ãOpenDataScienceã³ãã¥ããã£ã«åå ããŠãã ãããããã§ã¯ãã³ãŒã¹äžã«ãã¹ãŠã®éä¿¡ãè¡ãããŸãïŒSlack ODSã®ãã£ãã«#mlcourse_openïŒã ãããŠããã詳现ã«èšãã°ãããã¯ç«ã®äžã§ããªãã®ããã§ãã
èšäºã®æŠèŠ
ã³ãŒã¹ã®ç¹åŸŽã¯äœã§ãã
ãã®ã³ãŒã¹ã®ç®æšã¯ãæ¢åã®ç¥èããã°ããæŽæ°ããããã«åŠç¿ããããã®ãããã¯ãèŠã€ããããšã§ãã ã³ãŒã¹ã¯ããã®ãããã¯ã®æåã®ã³ãŒã¹ã«å®å
šã«é©åãããšã¯èããããŸããã ããŒã¿åæãšæ©æ¢°åŠç¿ã«é¢ããå
æ¬çãªã³ãŒã¹ãäœæããã¿ã¹ã¯ãèšå®ããŸããã§ããããçè«ãšå®è·µã®å®ç§ãªçµã¿åããã§ã³ãŒã¹ãäœæããããšèããŸããã ãããã£ãŠãã¢ã«ãŽãªãºã ã¯æ°åŠã䜿çšããŠååã«è©³çŽ°ã«èª¬æãããå®è·µçãªã¹ãã«ã¯å®¿é¡ã競æãåã
ã®ãããžã§ã¯ãã«ãã£ãŠãµããŒããããŠããŸãã
ãã®ç¹å®ã®ã³ãŒã¹ã®å€§ããªãã©ã¹ã¯ããã©ãŒã©ã ïŒOpenDataScienceã®Slackã³ãã¥ããã£ïŒã§ã®ã¢ã¯ãã£ãã©ã€ãã§ãã äžèšã§èšãã°ãOpenDataScienceã¯ãã·ã¢èªã話ãDataScientistsã®æ倧ã®ã³ãã¥ããã£ã§ããã Data Festã®éå¬ãå«ãå€ãã®ã¯ãŒã«ãªããšãè¡ã£ãŠããŸã ã åæã«ãã³ãã¥ããã£ã¯ç©æ¥µçã«Slackã«äœãã§ããŸããããã§ã¯ã誰ã§ãDS質åãžã®åçãèŠã€ãããããããžã§ã¯ãã®å¿ãåãããã人ã
ãååãèŠã€ããããä»äºãèŠã€ãããããããšãã§ããŸãã ãªãŒãã³ã³ãŒã¹çšã«å¥ã®ãã£ã³ãã«ãäœæãããŸããããã®ãã£ã³ãã«ã§ã¯ãæ°ãããããã¯ãç¿åŸããã®ã«åœ¹ç«ã€3ã400人ã®äººã
ãåãããšãå匷ããŸãã
çŽ æã®ãã¬ãŒã³ããŒã·ã§ã³åœ¢åŒãéžæããŠãHabréããã³JupyterããŒãããã¯ã«é¢ããèšäºãäœæããŸããã ããã§ãã©ã€ãè¬çŸ©ãšãã®ãããªãè¿œå ãããŸãã
ã³ãŒã¹ã®å¯Ÿè±¡è
ãšæºåæ¹æ³
åææ¡ä»¶ïŒå°é倧åŠã®2幎次ã¬ãã«ã§æ°åŠïŒç·åœ¢ä»£æ°ã解æ幟äœåŠãæ°åŠçåæã確çè«ãçµ±èšïŒãç¥ã£ãŠããå¿
èŠããããŸãã Pythonã§å°ãããã°ã©ãã³ã°ã§ããå¿
èŠããããŸãã
ååãªç¥èãã¹ãã«ããªãå Žåã¯ãã·ãªãŒãºã®æåã®èšäºã§ãæ°åŠãç¹°ãè¿ããPythonããã°ã©ãã³ã°ã¹ãã«ãæŽæ°ïŒãŸãã¯ååŸïŒããæ¹æ³ã«ã€ããŠèª¬æããŸãã
ã¯ããè±èªã®ç¥èã¯ãã¡ããããŠãŒã¢ã¢ã®ã»ã³ã¹ãå·ã€ããŸããã
ã³ãŒã¹ã«ã¯äœãå«ãŸããŠããŸããïŒ
èšäº
Habrã«è³ããèšäºã®åœ¢ã§è³æãæåºããŸããã ãã®ããããã€ã§ãçŽ æã®é©åãªéšåããã°ããç°¡åã«èŠã€ããããšãã§ããŸãã èšäºã¯æ¢ã«æºåãã§ããŠããã9æãã11æã«éšåçã«æŽæ°ãããåŸé
ããŒã¹ãã£ã³ã°ã«é¢ããå¥ã®èšäºãè¿œå ãããŸãã
ã·ãªãŒãºã®èšäºã®ãªã¹ãïŒ
- ãã³ãã䜿çšããäžæ¬¡ããŒã¿åæ
- Pythonã䜿çšããããžã¥ã¢ã«ããŒã¿åæ
- åé¡ã決å®æšãããã³æè¿åæ³
- ç·åœ¢åé¡ããã³ååž°ã¢ãã«
- æïŒãã®ã³ã°ãã©ã³ãã ãã©ã¬ã¹ã
- æšèã®äœæãšéžæã ã¯ãŒãããç»åãããã³ãžãªããŒã¿ã¿ã¹ã¯ã®ã¢ããªã±ãŒã·ã§ã³
- æåž«ãªãåŠç¿ïŒPCAãã¯ã©ã¹ã¿ãªã³ã°
- Vowpal Wabbitã«ããã®ã¬ãã€ãããŒã¹ã®ãã¬ãŒãã³ã°
- Pythonæç³»ååæ
- åŸé
ããŒã¹ã
è¬çŸ©
è¬çŸ©ã¯ã9æ6æ¥ãã11æ8æ¥ãŸã§ã®19:00ãã22.00ãŸã§ã®æ°Žææ¥ã«ãMail.Ru Groupã®ã¢ã¹ã¯ã¯äºåæã§éå¬ãããŸãã è¬çŸ©ã§ã¯ãèšäºã§èª¬æãããŠããã®ãšåãèšç»ã«åŸã£ãŠãçè«å
šäœãåæããŸãã ããããè¬åž«ã«ããã¿ã¹ã¯ã®ã©ã€ããã£ã¹ã«ãã·ã§ã³ãè¡ãããåè¬çŸ©ã®æåŸã®1æéã¯ç·Žç¿ã«å°å¿µããŸã-åŠçã¯ããŒã¿ãèªåã§åæãïŒãããçŽæ¥ã³ãŒããæžãïŒãè¬åž«ããããæäŒããŸãã çŸåšã®è©äŸ¡ã®ã³ãŒã¹ã®äžäœ30人ã®åå è
ãè¬çŸ©ã«åå ã§ããŸãã ã©ã³ãã³ã°ã¯ã宿é¡ã競æäŒãããŒã¿åæãããžã§ã¯ãã®åœ±é¿ãåããŸãã è¬çŸ©æŸéãéå¬ãããŸãã
è¬åž«ïŒ
- ãŠãŒãªã»ã«ã·ããã㌠ã Mail.Ru Groupã®ããã°ã©ããŒç 究è
ã§ãããHSEã³ã³ãã¥ãŒã¿ãŒãµã€ãšã³ã¹åŠéšã®äžçŽè¬åž«ã§ãããHSEã§ã®ããŒã¿åæã®å¹Žéæè²ããã°ã©ã ã®æåž«ã§ããããŸãã
- ã¢ã¬ã¯ã»ã€ã»ãã€ããã³ OpenDataScienceã³ãã¥ããã£ã®åµèšè
ã§ãããDigineticaã®æé«ããŒã¿è²¬ä»»è
ã§ããDM Labsã 以åã¯ãããã€ãã®åæéšéã®è²¬ä»»è
ã DataFestã®äž»å¬è
ã§ããOpenDataScienceã³ãã¥ããã£ã®ã€ããªãã®ãŒãªãŒããŒã
- ããããªãŒã»ã»ã«ã²ã€ãšã ã Zeptolabã®ããŒã¿ãµã€ãšã³ãã£ã¹ããã¢ã¹ã¯ã¯å·ç«å€§åŠã®æ°çãã¡ã€ãã³ã¹ã»ã³ã¿ãŒã®è¬åž«ã
ããã§ã³ãŒã¹èšäºã®ãã¹ãŠã®èè
ã«ã€ããŠèªãããšãã§ããŸã ã
宿é¡
10ã®ãããã¯ã«ã¯ãããã宿é¡ãä»ãã1é±éãäžããããŸãã ã¿ã¹ã¯ã¯JupyterããŒãããã¯ã®åœ¢åŒã§ãããããã«ã³ãŒããè¿œå ããããã«åºã¥ããŠGoogleã®åœ¢åŒã§æ£ããçããéžæããå¿
èŠããããŸãã 宿é¡ã¯ãã³ãŒã¹ã®åå è
ã®è©äŸ¡ã«åœ±é¿ãäžãå§ããããã«å¿ããŠã誰ãã©ã€ãã§è¬çŸ©ã«åå ã§ããããã«ãªãããæåã®ãã®ã§ãã
ã³ãŒã¹ãªããžããªã§ã 10ã®å®¿é¡ãšãœãªã¥ãŒã·ã§ã³ãèŠãããšãã§ããŸãã ã³ãŒã¹ã®æ°ããç«ã¡äžãã§ã¯ã宿é¡ã¯æ°ãããªããŸãã
ãã¥ãŒããªã¢ã«
ã³ãŒã¹äžã®åµé çãªã¿ã¹ã¯ã®1ã€ã¯ãããŒã¿åæãšæ©æ¢°åŠç¿ã®åéãããããã¯ãéžæããããã«é¢ãããã¥ãŒããªã¢ã«ãäœæããããšã§ãã ããã§ã®äŸãç¥ãããšãã§ããŸã ã çµéšã¯æåããããšãå€æããã³ãŒã¹åå è
èªèº«ããã³ãŒã¹ã§èæ
®ãããªãã£ããããã¯ã«é¢ããããã€ãã®éåžžã«å
å®ãªèšäºãæžããŸããã
Kaggle Inclassã³ã³ããã£ã·ã§ã³
ãã¡ãããã©ãã§ãããŒã¿ãåæããç·Žç¿ãããªããŠãã競äºã®äžã§ããã«äœããåŠã³ããã®æ¹æ³ãåŠã¶ããšãã§ããŸãã ããã«ãããŸããŸãªãã³ã®åœ¢ã§ã®åæ©ïŒã倧ããªãKaggleã§ã®ãéãšæ Œä»ãããããŠç§ãã¡ãç¥ã£ãŠããæ Œä»ãã®åœ¢ã§ã®æ Œä»ãïŒã¯ãããŒã¿åæ競äºäžã®æ°ããæ¹æ³ãšã¢ã«ãŽãªãºã ã®éåžžã«æŽ»çºãªç 究ã«è²¢ç®ããŸãã ã³ãŒã¹ã®æåã®éå§ã§ã¯ãéåžžã«èå³æ·±ãåé¡ã解決ããã2ã€ã®ã³ã³ãã¹ããæäŸãããŸããã
- ã€ã³ã¿ãŒãããã§ã®è¡åã«ããæ»æè
ã®èå¥ ã ããŸããŸãªãµã€ãã蚪åããŠãããŠãŒã¶ãŒã«é¢ããå®éã®ããŒã¿ãããã30å以å
ã«èšªåãããµã€ãã®ã·ãŒã±ã³ã¹ãããããã誰ãã¢ãªã¹ã§ãããä»ã®èª°ãã§ããããç解ããå¿
èŠããããŸããã
- Habréã«é¢ããèšäºã®äººæ°ã®äºæž¬ ã ãã®ã¿ã¹ã¯ã§ã¯ãããã¹ããæéãããã³Habréã®åºçã®å
åã«ãããšããã®èšäºã®äººæ°-ãæ°ã«å
¥ããžã®è¿œå ã®æ°ãäºæž¬ããå¿
èŠããããŸããã
åå¥ãããžã§ã¯ã
Vkontakteã®å
¬é ãæ人ç·æ§åãã®æ©æ¢°åŠç¿ã«é¢ããããŒã ãããã
ã³ãŒã¹ã¯2.5ãæéèšèšãããŠãããå€ãã®ã¢ã¯ãã£ããã£ãèšç»ãããŠããŸãã ãã ããæåž«ãææ¡ããèšç»ã«åŸã£ãŠãç¬èªã®ããŒã¿ã䜿çšããŠãæåããæåŸãŸã§ç¬èªã®ããŒã¿åæãããžã§ã¯ããå®äºããå¯èœæ§ãå¿
ãæ€èšããŠãã ããã ãããžã§ã¯ãã¯ååãšè©±ãåãããšãã§ããã³ãŒã¹ã®çµããã«ãããžã§ã¯ãã®ãã¢ã¬ãã¥ãŒæ€èšŒãæé
ãããŸãã
ãããžã§ã¯ãã®è©³çŽ°ã«ã€ããŠã¯åŸã»ã©èª¬æããŸãããä»ã®ãšããã¯ãããããžã§ã¯ãã®ããã«äœããäºæž¬ãããããã«ã©ã®ãããªããŒã¿ã䜿çšããããèãããããããŸããã ããããã¢ã€ãã¢ããªããã°ãåé¡ãããŸãããåæã®ããã«èå³æ·±ãã¿ã¹ã¯ãšããŒã¿ãã¢ããã€ã¹ããŸãããããã¯è€éãã®ç¹ã§ç°ãªãå ŽåããããŸãã
ã³ãŒã¹ã«ç»é²ããã«ã¯ã©ãããã°ããã§ããïŒ
ã³ãŒã¹ã«åå ããã«ã¯ã ãã®ã¢ã³ã±ãŒãã«èšå
¥ããOpenDataScience ã³ãã¥ããã£ã«åå ããŠãã ããïŒãOpenDataScienceã«ã€ããŠã©ããã£ãŠç¥ããŸãããïŒãã®ãmlcourse_openããšçããŠãã ããïŒã ã³ãŒã¹å
šäœã®éä¿¡ã®ã»ãšãã©ã¯ãïŒmlcourse_openãã£ãã«ã®Slack OpenDataScienceã§è¡ãããŸãã
ã³ãŒã¹ã®æåã®å®è¡ã¯ã©ãã§ããã
æåã®æã¡äžãã¯2017幎2æãã6æã«è¡ãããçŽ1,000人ããµã€ã³ã¢ããããæåã®å®¿é¡ã¯520人ãæåŸã¯150人ã§ããã ãã©ãŒã©ã ã§ã®ç掻ã¯æ¬æ Œçã§ãKaggleã³ã³ãã¹ãã§æ°ååã®å°å
ãäœãããã³ãŒã¹åå è
ã¯å€æ°ã®ãã¥ãŒããªã¢ã«ãäœæããŸããã ãŸããã¬ãã¥ãŒããå€æãããšããã¥ãŒã©ã«ãããã¯ãŒã¯ãKaggleã®ç«¶äºããŸãã¯æ©æ¢°åŠç¿ã®çè«ã«ããã«èžã¿èŸŒãããšãã§ããçŽ æŽãããçµéšãåŸãŸããã
ã³ãŒã¹ã®ããã100ã®ãã¡ã€ããªã¹ãã«ããŒãã¹ããããããã®ã¯ãMail.Ru Groupã®ã¢ã¹ã¯ã¯äºåæã®mitapã§ãããããã«ã¯ãçŸä»£ã®DSã«é¢é£ãããããã¯ã«é¢ãã3ã€ã®è¬çŸ©ããããŸããã
- Apache Sparkã«ããããã°ããŒã¿åŠçïŒVitaliy KhudobakhshovãOdnoklassnikiïŒã ãããªïŒ part1 ã part2 ;
- ãã¥ãŒã©ã«ãããã¯ãŒã¯ãšãã£ãŒãã©ãŒãã³ã°ã®åºç€ïŒAlex OzerinãReason8.aiïŒã ãã㪠;
- ææ
åæåé¡ã®è§£æ±ºã«ããããã£ãŒãã©ãŒãã³ã°ïŒVitaliy RadchenkoãCiklumïŒã ãã㪠ã
ããŒãã¹ïŒco-cs231nã³ãŒã¹
ãããŠæåŸã«ç§ãã¡ãåã¶æåŸã®ããšïŒ2017幎11æäžæ¬ãæ©æ¢°åŠç¿å
¥éã³ãŒã¹ã®å°å
¥çŽåŸãããSlack ODSã®#mlcourse_openãã£ãã«ã®åãå Žæã§ããã¥ãŒã©ã«ãããã¯ãŒã¯ã§æé«ã®ã³ãŒã¹ã®1ã€ã§ããã¹ã¿ã³ãã©ãŒãã³ãŒã¹ cs231nã®ãConvolutional Neural Networks forèŠèŠèªèãã
ãã®çŽ æŽãããèŠåŸãåŠã¶ã®ã«å¹žé-æ©æ¢°åŠç¿ïŒ ãããŠãããã«ããäºäººã®ä»²é-åæ©ä»ãã®ããã«ã
Andrew Ngã¯ããã£ãŒãã©ãŒãã³ã°å°éã®äžç°ãšããŠãAndrej Karpathyã«ã€ã³ã¿ãã¥ãŒããŸãã