Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

  > He’s training an LLM with a corpus of one-off policies for solving specific manipulation tasks, and claiming to get robust ad hoc policies from it for previously unsolved tasks.
It seems clear that many people do not understand that this is the key breakthrough: solving arbitrary tasks after learning previous, unrelated tasks.

In my opinion that really is a good definition of intelligence, and puts this technique at the forefront of machine intelligence.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: