Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

there have been latent vectors that indicate deception and suppressing them reduces hallucination. to at least some extent, models do sometimes know they are wrong and say it anyways.

e: and i’m downvoted because..?



Deception requires the deceiver to have a theory of mind; that's an advanced cognitive capability that you're ascribing to these things, which begs for some citation or other evidence.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: