Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Tell us more about LoRA


Instead of a matrix NN they use two matrices, Nd and d*N, and assuming d << N it would be like a lightweight addition. Keeping the original matrix unchanged, this new addition is simply added in parallel and initialised to low values.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: