Pricing that scales with you
STARTER
$5 / month
- 4,000 seconds included each month
- 1 Rapid Voice Clone
- Voice Design
- Translate into 150+ Languages
- Audio Editing
CREATOR
$19 / month
- 15,000 seconds included
- 3 Rapid Voice Clones
- 1 Professional Voice Clone
- High Definition 48khz audio output
- Clone your Voice in 6 Languages
- Translate into 150+ Languages
- Audio Editing
PROFESSIONAL
$99 / month
- All Features in Creator
- 45,000 seconds included
- $0.002/sec after 45,000 seconds
- 20 Rapid Voice Clones
- 1 Professional Voice Clones
SCALE
$299 / month
- All Features in Professional
- 120,000 seconds included
- $0.0018/sec after 120,000 seconds
- 150 Rapid Voice Clones
- 3 Professional Voice Clones
BUSINESS
$699 / month
- All Features in Scale
- 360,000 seconds included each month
- $0.0015/sec after 360,000 seconds
- 500 Rapid Voice Clones
- 3 Professional Voice Clone
- Low latency WebSocket API
- Authorized partner program
ENTERPRISE
Contact Us
- All Features in Business
- Dedicated Support
- Enterprise SLA
- Deepfake Detection
- Real-Time Speech-to-Speech
- Dedicated nodes or On-Prem Support
* For languages available for each plan see our list
What’s included in each plan?
Frequently Asked Questions
Can the content I generate be used for commercial purposes?
All content generated in all tiers is available for commercial use.
What is the difference between Rapid Voice Clone and Professional Voice Clone?
Rapid Voice Clone and Professional Voice Clone are both state-of-the-art voice cloning technologies offered on our platform, designed to cater to different user needs and project scopes.
Rapid Voice Clone is all about speed and efficiency. It enables users to quickly create a custom voice clone using a small audio sample — as little as 10 seconds and up to 1 minute. The cloning process is swift, taking around a minute to complete. Currently, Rapid Voice Clone supports text-to-speech functionality, making it an excellent choice for projects that require fast turnaround times, like prototyping or content development where voice detail is secondary to speed.
Professional Voice Clone, on the other hand, is built for depth and nuance. It requires a longer audio sample, typically 10 minutes, and approximately an hour to create a voice clone. This clone captures the unique vocal characteristics of the original speaker, including their emotional nuances and expressiveness. Professional Voice Clone supports both text-to-speech and speech-to-speech functionalities and offers the ability to clone voices in various languages for Enterprise plan users. It is best suited for projects that demand high fidelity and detailed voice replication, such as professional-grade voiceovers, broadcasting, and customer engagement solutions where the quality of the voice clone is paramount.
In summary, the main differences lie in the time required to create the clone, the length of the audio sample needed, and the depth of voice replication and functionality. Your choice between Rapid and Professional Voice Clone should be guided by the specific requirements of your project, the level of detail needed, and the time frame for deployment.
How do I track my usage?
To track usage, please proceed to the Billing Portal and see Current Usage.
Can I cancel at any time?
You can cancel your subscription at any time through the Billing Portal. Note, your subscription will end at the end of your billing cycle and all amounts owed will be billable.
How do I change my subscription?
You can change your subscription by going to our Billing Portal and clicking on Manage Subscription.
What languages do I get access to for Localize?
There are over 148 languages included in all of our plans (see list).
How do I get access to faster streaming?
Enterprise customers get access to lower than 300ms time to first sound for streaming with greater consistency and concurrency. Please schedule a demo for more information.