Text to speech
We can make the computer speak with Python.
Given a text string, it will speak the written words in the English language.
This process is called Text To Speech (TTS).
Related Course: Zero to Deep Learning with Python and Keras
Pytsx is a cross-platform text-to-speech wrapper.
It uses different speech engines based on your operating system:
sapi5 - SAPI5 on Windows XP, Windows Vista, and (untested) Windows 7
espeak - eSpeak on any distro / platform that can host the shared library (e.g., Ubuntu / Fedora Linux)
Install with pip (using pyenv, pipenv or virtualenv):
Then run the example code:
gTTS is a module and command line utility to save spoken text to mp3.
It uses the Google Text to Speech (TTS) API.
Listen to the voice sample below:
This module supports many languages and sounds very natural.
Install with the python package tool (pip):
from gtts import gTTS
If you want to test it on the command line use:
gtts-cli.py “Hello” -l ‘en’ -o hello.mp3
TTS in Pythonista for iOS:
To record sound:
To recognize it as text:
text = speech.recognize('audio.m4a', 'en') # sent to Apple servers
If you use Microsoft Windows 10, it has a speech engine included.
Install the module win32com, then you can use this code:
import win32com.client as wincl
IBM has created an tts API, which is free for a limited amount of queries.
You can listen to sample data on the Watson TTS page.
You can use the tts-watson module to interact.
After registrating on the IBM watson site (generate keys there),
we can write our code:
Alternatively you can use curl to directly fetch from the API.