...why not just record the audio and pitch it up? That's what the studios have been doing for more than half a century. Do you need it to happen in realtime? If so, then a pitchshifter is in order - maxforlive.com has a few - both under "pitch shifter" and "pitchshifter", for some reason.
Real Mickey Mousing (it's actually called that) is a time-based effect, so you'll probably get more authentic results using the classic delay line method than you would with gizmo.
Have a look at http://msp.ucsd.edu/techniques/latest/book-html/node125.html which shows a PD version, which is pretty straightforward to translate into Max - just substitute a tapin-tapout pair for vd~ and a pong~ for wrap~,