The underlying SAPI 5 technology permits the basic conversion of written text to spoken words. This is now being used to produce voice-over narration for PowerPoint slides. The result is a set of videos which are used for instructional purposes.
This site documents the current state of this effort.
Introduction Text-to-Speech (TTS) involves the conversion of written text to spoken text. TTS technology has been maturing from very robot-like speech to vocalizations that now approach human-quality voices. The education application is to replace human talent with synthesized voices. This will allow the efficient production and editing of learning modules that have narration. Some of the advantages of this approach include:
Examples | TTS Engines The software that does the conversion from a text file to a sound file is the TTS Engine. The TTS Engine must have a file option in order to be used in the production of narrated modules. The TTS Engines which are being tested and which appear to meet the minimum quality qualifications include:
Note that license restrictions prevent the use of these TTS Engines for materials that will be used in public or posted on the Internet. It may be that the voices, not the engines, are licensed. As a result, the use of these TTS Engines is currently limited to testing the technology. Licenses are available for some of the TTS Engines so that the voice products can be used for educational applications. It is recommended that you contact the vendors for more specific license information and pricing. | TTS Voices A wide variety of voices is available for use with TTS Engines. Voices are available with different characteristics (male vs. female) and nationalities (American vs. British English). A variety of languages have their own voices. The voices which are currently being used include: Ivona Several of the IVONA voices show great promise. These work in IVONA Reader and TextAloud with similar results.
SAPI 5 Enhancements The underlying Microsoft SAPI (Speech API) 5 technology has been implemented in the various TTS Engines to allow the insertion of control information (i.e., a markup language) which will improve the narration. The markup control which have been used in the modules are shown in the examples below. Silence The silence control lets you place pauses in the text. This is useful as the break between paragraphs is a fixed interval; this lets you extend this interval.
Emphasis This control code provides a way to change how strongly a word or phrase is spoken.
The options are strong, moderate, none and reduced. Say As This control code tells the TTS Engine how to pronounce special cases. For example,
The following set of attributes is from the http://developer.voicegenie.com website and list some of the different types which are available.
Sub The words in the alias string replace the text inside the sub brackets.
|