First, go through the tutorials; you'll be better at asking questions.
Second, be sure to spend some time looking at those objects commonly used for extracting data from a Jitter matrix. The list includes (but isn't limited to) the jit.3m object, learning how to use the jit.submatrix or jit.scissors objects to "slice" a matrix to grab portions of an image, becoming adept at the use of jit.spill (which outputs lists of data about an image that you'll have little trouble working with by virtue of learning how to break up and route lists from having done the tutorials) learning those messages that let you get cell or plane values from an image.
On the other side, It's a matter of learning how MSP objects work to produce audio, and the kinds of messages that they expect. Connect the stuff in paragraph 1 to the stuff in this paragraph and you're on your way.
Thank you very much for your quick reply, I started going through the tutorials and the are very helpful. I just wanted to know if video sonification is possible using this software, or whether I should find something else.
The short answer is "Yes, it's definitely possible." Lots and lots of Max users are engaged at this very moment in approaching this problem in a variety of different ways [since there is no single way of doing it, and everyone has their own way of approaching it].