mimic3/opentts-abc
2022-03-30 17:08:56 -04:00
..
opentts_abc Add SSML to web server 2022-03-25 17:48:47 -04:00
.gitignore Move CLI into main package 2022-03-25 16:56:11 -04:00
.isort.cfg Initial commit 2022-03-16 17:22:15 -04:00
.projectile Initial commit 2022-03-16 17:22:15 -04:00
check.sh Initial commit 2022-03-16 17:22:15 -04:00
LICENSE Fix license 2022-03-25 14:24:15 -04:00
MANIFEST.in Fix license 2022-03-25 14:24:15 -04:00
mypy.ini Initial commit 2022-03-16 17:22:15 -04:00
pylintrc Clean up opentts_abc 2022-03-25 11:52:23 -04:00
README.md Cleanup in mimic3_tts 2022-03-25 13:20:28 -04:00
setup.cfg Initial commit 2022-03-16 17:22:15 -04:00
setup.py More clean up on opentts-abc 2022-03-25 14:28:33 -04:00

Open Text to Speech (TTS) Abstract Base Classes (ABC)

Base classes for open text to speech systems.

SSML

A subset of SSML is supported in SSMLSpeaker:

  • <speak> - wrap around SSML text
    • lang - set language for document
  • <s> - sentence (disables automatic sentence breaking)
    • lang - set language for sentence
  • <w> / <token> - word (disables automatic tokenization)
  • <voice name="..."> - set voice of inner text
    • voice - name of voice
  • <say-as interpret-as=""> - force interpretation of inner text
    • interpret-as - way to interpret text (implementation dependent)
    • format - way to format text (implementation dependent)
  • <break time=""> - Pause for given amount of time
    • time - seconds ("123s") or milliseconds ("123ms")
  • <mark name=""> - User-defined mark (written to --mark-file or part of TextToSpeechResult)
    • name - name of mark
  • <sub alias=""> - substitute alias for inner text
  • <phoneme ph="..." alphabet="ipa"> - supply phonemes for inner text
    • ph - phonemes for each word of inner text
    • alphabet - name of phoneme alphabet (usually "ipa")