Below is a segment of a patch I am working on which is working towards gestural recognition, very simple stuff atm. However, I am just curious to know a bit more about tracking depth (z axis) with webcams.
Tutorial 25 states "By comparing an object's apparent size from one frame to the next, we can even make some crude guesses about its movement toward or away from the camera in the "z axis" (depth)." but is this the only method of doing so? If so, I am a bit disheartened by the "crude guesses", which suggests further inaccuracies.
I understand ms kinect produces much higher accuracy regarding depth and tracking in general but I am keen to promote my patch via webcam due to its accessibility.
Anyway, any info regarding depth would be appreciated