Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is the 15mb basically embeddings from the video screenshots? What would it recall if there isn't the screenshots saved?


I’m not sure if the above product does this, but you could use a multimodal model to extract descriptions of the screenshots and store those in a vector database with embeddings.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: