Ask HN: I want to detect when arbitrary things come into a video stream

byoung2 · on Oct 4, 2013

I would imagine it would be pretty slow, but you could have a video stream write to a canvas element at regular intervals. You could analyze the pixels in the canvas to look for a pattern. This would be the hard part, since you'd have to write an engine to detect edges and create a hash of all detectable objects in the frame, then you'd have to match each hash to a known hash of the desired object. To get the known hash, you'd likely have to feed in many example images of your object and create hashes from these.

CookWithMe · on Oct 4, 2013

What you are looking for is an "object recognition" algorithm. These algorithms take one or multiple images of an object (e.g. your little statue) and then try to locate it in another image.

Generally, they are computationally expensive, so performing them on a video stream, especially inside the browser, will most likely not work, even for rather low resolutions.

I have some experience with the SIFT algorithm. Takes some time to grok it as the algorithm consists of several sub-steps.

You may want to look into OpenCV, they have a couple of algorithms implemented.

jlengrand · on Oct 4, 2013

Well, what you describe is extremely difficult to do, and people have been trying to do with for decades now without success.

It could work if you place yourself into similar conditions over all the videos. By having a good understanding (precise model) of what you want to recognize, you could make some pattern recognition / object detection to be able to find it in other videos. But the huge problem in this field is that the conditions keep changing. Light, luminosity, color, shape, distance. All of the makes pretty much every method of recognition of "it" completely useless.

There is one thing you can currently do though. It is target tracking. Without knowing the target before, and with minimal user interaction to follow it, it is currently possible to track it quite accurately.

Hope this helps.

xauronx · on Oct 4, 2013

I've seen people do this with a Kinect I think. Obviously that's more info than just a video stream but I don't know exactly what your limits on your solution are.

michaelmior · on Oct 4, 2013

Are you restricted to running in the browser?

eof · on Oct 4, 2013

not necessarily, but preferably

beachstartup · on Oct 4, 2013

i'm no expert, but i believe machine learning / classification algorithms can help you on this. you just need to train the algorithms on what you want to find.

'finding' objects means feeding the algorithms a series of 'features' that you extract from video and normalize into data structures - color, corners, edges, velocities, relative distances, etc, which are then fed into the algorithms to produce a classification or a probability.

analyze every Nth frame and try to classify found objects. if a known object is found, do something.