Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

With LM Studio you can configure context window freely. Max is 131072 for gpt-oss-20b.


Yes but if I set it above ~16K on my 32gb laptop it just OOMs. Am I doing something wrong?


try enable flash attention and offload all layer to GPU




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: