Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yeah, I'm quite confused that there's no mention of SEARN or LOLS or similar imitation learning algorithms in the references of the Alpha Zero paper. The algorithm for learning looks severely derived from that 10 year old idea.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: