Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
HuggingFace/open-r1: open reproduction of DeepSeek-R1 (github.com/huggingface)
63 points by ianrahman on Jan 27, 2025 | hide | past | favorite | 1 comment


I really wonder how much the self-supervised data flywheel will be enough to reproduce this model or not.

There are also so many different tweaks that could be made to try to get these models to perform better, kudos to the team reproducing it all in the open




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: