First you must determine which features you wish to extract from the audio source: amplitude, pitch, spectral content? When you know this, then there are tools in MaxMSP that will allow you access to this data.
I've made this patch already where, pitch is vertical, amplitude thickness and a lot of other stuff, using xsense sensor to do opacity and so on, It was part processing, part max, but I've been redoing it in max solely. The best pitchtracker around according to me is sigmund~
if you want to further discuss this, you can contact me oflist at pieter.coussement at ugent dot be
(it's not that I don' want to share the patch, but it has some restrictions due to our research group's involvement
I'm afraid I'm not a Jitter expert (or a Max one, while we're at it!); to move smoothly between control values, use [float] --> [pack f 100.] --> [line 0.] --> smoothed values; in this examples the value 100 gives [line] its interpolation time in ms: