0.2.0 • Published 3 years ago
@junaga/tts v0.2.0
A CLI to transform written text files (.txt
, .smd
and .ssml
) into naturally sounding, spoken human voice, audio files (.mp3
or .wav
). Using the partially free, Google Cloud Platform, Machine Learning, Text to Speech API.
Get a GCP service account
You need a GCP service account JSON key file as credential to authenticate against the API. A service account is a Google account, but for computers/containers, not humans.
Option one
Ask your local GCP admin. lol
Option two
Follow this guide, where you're going to:
- Create a GCP Project, with billing enabled
- Turn on the Text to Speech API
- Generated a service account that has project access (i.e. viewer)
Set the GOOGLE_APPLICATION_CREDENTIALS
environment variable to the absolute, not relative path of the key file. The variable is read by the @google-cloud/text-to-speech
package, the APIs' official nodejs client used in this tool.
Usage
sudo npm install --global @junaga/tts
# # Assuming the following:
# $ ls -1
# service-account.json
# text-smd-or-ssml-files/
# use the `$PWD` variable to save 5 seconds of your life
export GOOGLE_APPLICATION_CREDENTIALS=$PWD/service-account.json
# The default `--voice` is `germanSales`
tts --voice tikTok text-smd-or-ssml-files/ mp3-output/
# Check out `mp3-output/`
Run tts --help
for all options.