Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Wow, this really took me by surprise. I thought the only input was (s_1...s_final, whowon) where s are statates during training and (s_current) during play, and the system would learn the game on its own. That's the way it worked with the Atari games anyway.


I expect the Atari games, if we're thinking of the same articles, had much less strategic depth than playing a Go champion.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: