Pricing

Free Plan

$0

  • 3,600 QSS/hr
  • Free STT (3 models)
  • Free TTS (2 voices)
  • Email Support

Standard Plan

$10/Month

  • 10,000 QSS/hr
  • Free STT (6 models)
  • Free TTS (Mimic and Coqui)
  • Live Chat Support

Enterprise

$100/Month

  • 100,000 QSS/hr
  • Transcription Grade STT
  • Tons of TTS Voices
  • Live Phone Support
  • Customizations

User Levels

The main distinction between user levels (or plans) is hourly limits. Using default settings a smart speaker, for example could be on 24/7 and realize a 1:1 relationship between QSS and real time. Note, both STT and TTS contribute to hourly QSS consumption. If you are going to require professional grade STT and TTS where both accuracy and performance are needed as fast as possible, then the $10 monthly package will probably work best. Enterprise is only needed if you anticipate a lot of transcription grade STT with very large wav input files. Note, the data share flag will also affect QSS calculations.

Free
  • 3,600 QSS/hr
  • Free STT & TTS
  • Email Support

$0

Get Now
Standard
  • 10,000 QSS/hr
  • Free STT & TTS
  • Multiple TTS Voices
  • Live Chat

$10/Month

Get Now
Enterprise
  • 20,000 QSS/hr
  • Transcription Quality STT
  • Multiple TTS Voices
  • Multiple STT Engines
  • Direct Customer Support
  • Custom Plans Available

$100/Month

Get Now

Quality Speech Seconds (QSS)

Quality speech seconds represents the amount of resource consumed by an operation. It is the product of wav file length in seconds (at the default rate) and the quality multiplier and represents the cost of an operation in both bandwidth and CPU consumption. See models and tradeoffs for more detail on how QSS is calculated. The system is designed such that a normal voice assistant using the default STT and TTS models should easily be able to operate 24/7/365. When an account exceeds its hourly limits it is simply cut-off until it becomes eligible to consume again.

  • Free
  • Suitable for most applications
  • Standard
  • Faster response times and/or better accuracy
  • Enterprise
  • Transcription and/or mission critical applications

Get Now