User Levels

The main distinction between user levels (or plans) is hourly limits. Using default settings a smart speaker, for example could be on 24/7 and realize a 1:1 relationship between QSS and real time. Note, both STT and TTS contribute to hourly QSS consumption. If you are going to require professional grade STT and TTS where both accuracy and performance are needed as fast as possible, then the $10 monthly package will probably work best. Enterprise is only needed if you anticipate a lot of transcription grade STT with very large wav input files. Note, the data share flag will also affect QSS calculations.

Free

3,600 QSS/hr
Free STT & TTS
Email Support

Get Now

Standard

10,000 QSS/hr
Free STT & TTS
Multiple TTS Voices
Live Chat

$10/Month

Get Now

Enterprise

20,000 QSS/hr
Transcription Quality STT
Multiple TTS Voices
Multiple STT Engines
Direct Customer Support
Custom Plans Available

$100/Month

Get Now

Quality Speech Seconds (QSS)

Quality speech seconds represents the amount of resource consumed by an operation. It is the product of wav file length in seconds (at the default rate) and the quality multiplier and represents the cost of an operation in both bandwidth and CPU consumption. See models and tradeoffs for more detail on how QSS is calculated. The system is designed such that a normal voice assistant using the default STT and TTS models should easily be able to operate 24/7/365. When an account exceeds its hourly limits it is simply cut-off until it becomes eligible to consume again.

Free
Suitable for most applications

Standard
Faster response times and/or better accuracy

$10/Month

Enterprise
Transcription and/or mission critical applications

$100/Month

Get Now

Pricing

Free Plan

Standard Plan

Enterprise

User Levels

Free

Standard

Enterprise

Quality Speech Seconds (QSS)