Is the 15mb basically embeddings from the video screenshots? What would it recal... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		pseudosavant on Feb 21, 2024 \| parent \| context \| favorite \| on: The killer app of Gemini Pro 1.5 is using video as... Is the 15mb basically embeddings from the video screenshots? What would it recall if there isn't the screenshots saved?

rlt on Feb 22, 2024 [–]

I’m not sure if the above product does this, but you could use a multimodal model to extract descriptions of the screenshots and store those in a vector database with embeddings.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact