Resemble AI launches tool to make AI voice clones in a minute
The key feature for making voice cloning convincing lies in accurately replicating the fundamental frequency (F0), or pitch, of the person being cloned. This can loosely be thought of as achieving the correct pitch of the target voice. A Boom investigative report uncovered the use of AI voice cloning technology to spread disinformation in the run-up to the Madhya Pradesh assembly elections last year. These customization options make Chatterbox a powerful tool for creating personalized user experiences. Whether you are developing branded content, interactive applications, or educational tools, the ability to tailor speech output ensures that your message resonates effectively with your audience. Hungry Jack’s, a fast food chain in Australia, just began trials of a new AI-driven voice assistant for its own drive-through orders.
- Mods are cropping up all over having used AI to create non-consensual NSFW deepfakes of people’s voices, not to mention one Persona voice actress being driven off Twitter after condemning a video that cloned her voice using AI.
- This conversion can be achieved through various methods, with modern systems often using techniques akin to those employed in image generation from text, fine-tuned for generating spectrograms.
- The Federal Trade Commission (FTC) issued a warning on March 20, titled “Scammers use AI to enhance their family emergency schemes,” revealing that scammers are now cloning people’s voices to make phone scams sound more convincing.
- “Initially, the field focused on TTS (text-to-speech) and speech-to-text (STT) technologies, which convert written text into spoken words and vice versa.
Today’s AI voices
Developing detection tools for identifying cloned voices or deepfakes is another avenue being explored. When using Resemble’s web platform, users can create a digital replica of their voice by uploading an audio sample or recording a series of sentences. The company has been offering this feature for a while, but the process took time. Users had to record around 25 sentences or upload at least three minutes of voice content to set up the system, which would then take another hour or so to provide a clone. At present, when trained on sufficient data, most voice cloning providers can accurately match the fundamental frequency of the target voice.
What are the key differences between a real and fake voice?
Otter.ai has also introduced a voice-activated AI version of its AI meeting assistant, letting users join and manage meetings without having to take notes. Now that we’re used to asking Siri to take a note or set a timer, asking your AI voice agent to start recording will be a cinch. The shift from passive transcription to an interactive agentic AI system is subtle, but it is here. Users can even give verbal commands mid-meeting, obviating the need for someone who understands the user interface of an app, for example, or web controls, all in real-time. Its existence is marred with tales of strangeness, horror, and even flat-out human rights encroachments. Mods are cropping up all over having used AI to create non-consensual NSFW deepfakes of people’s voices, not to mention one Persona voice actress being driven off Twitter after condemning a video that cloned her voice using AI.
Additionally, transparency is crucial when using AI-generated content, making sure that audiences are aware of its artificial nature. By adhering to legal and ethical standards, users can harness the benefits of Chatterbox while minimizing potential harm. From a legal perspective, while existing laws designed to protect privacy, prevent fraud, and regulate consent may apply to voice cloning, the rapid advancement of this technology is outpacing the current legal frameworks. For instance, issues like intellectual property rights of individual voices and the potential for defamation, copyright infringement, impersonation, or privacy violations are significant concerns. “In India, specific legal frameworks that regulate the use of AI in voice cloning are still evolving. The country’s approach to digital innovation and privacy is guided by broader IT regulations and privacy laws, but as of now, there are no specific laws that directly address AI voice cloning,” said Tandon.
- And then, all of that is converted into text that’s used either as input for additional systems or transcribed and displayed on screen.
- In addition to these core features, Chatterbox provides robust customization tools.
- After his son had a bad experience with one telemarketer, he made a series of bots to waste their time.
- Neither can match ChatGPT’s voice chat capabilities, especially when running ChatGPT Plus and GPT-4o.
- Nestled in a bunker, Belyaev soldiered on through air raid sirens as his hometown was invaded by Russian forces to deliver the voice of a much loved character using AI voice cloning software Respeecher.
But even OpenAI is wary about the potential misuse of the technology and says it will not release Voice Engine publicly, with it currently only being available to early testers. Australian franchises of KFC have also started testing out AI-driven voice ordering, though customers there have pushed back a bit, saying they prefer human interaction and their orders being misinterpreted by the AI window jockey. Game studio founder and voice AI advisor Mike Sorrenti is bullish on the tech. “Voice ai is an excellent thing. It can be used for translation and many other things and if a very natural interface for kids, and older adults especially those with mild disabilities such as arthritis,” he said in an email. You might think AI at work means typing prompts into ChatGPT or getting a slick summary from your inbox.