Mars
MARS6
MARS6 is a frontier text-to-speech model by CAMB.AI with voice/prosody cloning capabilities in 10 languages. MARS6 must be licensed for commercial use, we can help!
Deploy MARS6
Example usage
This model requires at least four inputs:
text
: The input text that needs to be spokenaudio_ref
: An audio file containing the audio of a single personref_text
: What is spoken in audio_reflanguage
: The language code for the target language
The model will try to output an audio stream containing the speech in the reference audio’s style. The output is by default an HTTP1.1 chunked encoding response of an encoded audio file using an ADTS AAC stream, but can be configured to stream using flac format, or to not stream at all and return the entire response as a base64 encoded flac file.