Making a sample-based text-to-speech
Hey guys,
I was wondering if you might be able to give any pointers on starting out for building something in Max For Live centered around making a sample-based text-to-speech. I'm using it as a project to learn Max and see what different things it can do, though I'm a beginner and not sure what processes to look at.
Currently my idea for how the processing could work is as follows:
Text box with intended message put into it.
Separate each letter/space/symbol to run into a reader.
Delay inputs to for each symbol that is put through
Have different objects scanning for what letter it is and sending a signal to the corresponding sound file.
Once a sound file finishes, go back to step three to find what the next symbol in the queue is to play.
Probably explained it rather poorly but would love any pointers people may have. Kind of want to have fun snipping a bunch of different consonants and vowels together and seeing what mess I come out with.