
The voices were not available on the original site at the time of writing this post thus I uploaded them to a GitHub repository. HTS stands for hidden-markov-model-based speech synthesis system and Nitech is the Nagoya Institute of Technology. The voice_cmu_us_slt_arctic_hts voice (used above) can be installed easily and isn’t bad but sayText ( 'hello' )įestival also has a C, C++ and a client-server API.
#Festival speech synthesis system install
Pip install git+ import festival festival. Use Python wrapper of festival’s C++ API: sudo apt install festival-dev Pipe text to festival’s tts mode from Python with subprocess: from subprocess import PIPE, Popen text = 'We love Raspberry.' process = Popen (, stdin = PIPE ) process. Pipe text into festival’s tts mode: echo hello | festival -tts Use festival’s batch mode to run file: festival -b hello.scm Use festival’s batch mode: festival -b '(voice_cmu_us_slt_arctic_hts)' \ '(SayText "The temperature is 22 degrees centigrade and there is a slight breeze from the west.")' Run commands from file: festival> (load "hello.scm" ) Use festival’s interactive mode: festival -iįestival> (SayText "Don't hate me, I'm just doing my job!" ) Install festival: sudo apt-get install festival festvox-us-slt-hts This post shall outline the various ways festival can be used and the steps required to achieve good results. While I found espeak to be the simplest system to use, festival produced the best results when used with the right voices. Spd-say, MBROLA, PicoTTS and the festvox project
