- Unit selection synthesis
- Diphone synthesis
- Domain specific synthesis
It is a custom filter model based on the acoustic theory of speech production where the vocal tract transfer passes through the filter and in time morphed to create a waveform of artificial speech also called Rules based synthesis .The source proceeds as a sampling function for voiced speech, in much more simpler models transfer function of the linear filter modeling the vocal tract has only poles. This format has been used before in Sega and Atari. Video games, the lead source for this function are produced by the vocal cord and noise made by pressure variations across the constriction formed in the vocal tract. The resulting speech sounds “inanimate” or “robot-like”. No human speech recordings are involved at run time. Several larger undertakings have used formant synthesizers because the high degree of control they can provide not only with conveying questions and statements but a range of other multi-purpose functions. Formant synthesis is currently in use within the VAESS project.
• Source Filter Model
The source filter is the most common of all synthesis techniques. This theory states that the vocal tract can be used as linear filter. The vocal cord has to vibrate in order for this process to activate. The result sound which is produced must exit through the lips. All sounds can be later filtered, the different aspects of this theory are complicating and so left for professionals.
In a model made by a journalist it is explained that the source filter model is divided into 3 separate parts the source, the filter and lip radiation.