Future of AI voice re-synthesis — dxRevive and Adobe Podcast

Enhanced Media
4 min readOct 10, 2023

--

Photo by Tommy Lopez from Pexels.com

Artificial intelligence (AI) voice re-synthesis has burst into the world of sound design, editing, and mixing with astonishing impact. This technological breakthrough makes it possible to create replicas of human voices with astonishing accuracy, redefining the possibilities in the audio industry.

The human voice is a vital element in movies, television shows, video games, commercials, and many other entertainment media. Traditionally, voice actors have been essential to bringing characters and narratives to life. Nevertheless, AI voice cloning raises an undeniable issue: the possibility of many voice actors and related industry professionals losing their jobs due to this disruptive technology.

Let’s start talking about this with two key examples: Accentize’s dxRevive and Adobe Podcast — two AI tools whose work was once carried out by an entire team of people, and now, in short order, by a machine.

The first one is an outstanding dialog noise reduction plug-in that not only eliminates annoying noises in the background of recordings but employs meticulously designed AI algorithms to restore the original clarity and tone of the audio. This combination of precise noise reduction and improved sound quality is revolutionizing film and television post-production.

The implications of dxRevive are profound and encompass several key aspects. On the one hand, it ensures precise noise reduction while restoring every subtle nuance of the audio, delivering unprecedented clarity in the sound industry. Similarly, by working directly on the computer, without the need for audio processing in the cloud, the possibility of data loss is minimized, which is essential to ensure the security and integrity of the sound itself. The user-friendly interface and intuitive workflow make dxRevive accessible to audio professionals of all experience levels (including, of course, newcomers), thus democratizing access to studio quality.

Adobe Podcast is another interesting example. This app, powered by AI technology, has become an essential solution for elevating audio quality in podcast recordings, social media videos, and other multimedia content. It has a great ability to improve sound quality, even when using low-cost microphones. This means content creators don’t need to invest in expensive equipment to get professional studio audio. Furthermore, Adobe Podcast’s accessibility is one of its main strengths, as it can be used for free without the need to install additional software on your computer.

So, content quality is essential, and tools like these have become fundamental for content creators: from those producing tutorials to those looking to improve the quality of their podcasts. This application exemplifies how AI is democratizing sound design by offering high-quality, accessible solutions for everyone, and well, that sounds good…, but it’s kind of problematic at the same time.

Photo by cottonbro studio from Pexels.com

Leaving aside the issue of the sound industry professionals, the advancement of artificial intelligence (AI) in acting and entertainment also poses significant challenges for actors, especially voice actors. The Screen Actors Guild (SAG-AFTRA) has initiated an indefinite strike to protect actors from the threat of being replaced by AI, for example. Production companies have expressed interest in using AI to scan background performers’ faces and voices, for using their images and sound in projects without their consent or compensation, raising concerns about the exploitation of actors’ identities and talent. This situation is similar to that depicted in the “Joan Is Awful” episode of the “Black Mirror’’ series, where AI replaces actors in real life. SAG-AFTRA president Fran Drescher warns that AI poses a threat to all entertainment professionals and the industry as a whole, as the technology is also being used in automated audiobooks, synthesized voiceovers, and digital avatars. Concerns about AI intervention in acting are not limited to Hollywood, as other actors’ unions, such as Equity in the UK, also note the growing influence of technology in the entertainment industry.

Amid the challenges posed by the growing influence of artificial intelligence in acting and entertainment, a fundamental question arises: can actors, and voice actors in particular, find a lasting place in a technology-driven world? While AI has proven its ability to perform technical and automated tasks, creativity, empathy, and human uniqueness remain irreplaceable elements in acting and storytelling. Rather than seeing AI as a threat, perhaps we can view it as a tool that complements and empowers human talent. Human-machine collaboration can open up new creative and storytelling frontiers. As we move forward in this new digital world, let’s remember that the essence of acting lies in the authenticity and emotional connection that only humans can offer. In the same way, the human creativity and horizontal thinking of sound designers are irreplaceable aspects that machines will hardly replicate. So, how can we all adapt and embrace technology as an ally rather than a competitor? This question invites us to imagine a future where artificial intelligence and humanity work together to create even more powerful and moving entertainment experiences. Will it be possible? Let’s keep the faith.

If you like these and other topics, be sure to follow our blog. We like to bring you quality content and highlight cutting-edge topics in the world of sound design. We hope this post has been useful, and remember that if you need professional advice for your audiovisual projects, we are here: we are Enhanced Media Sound Studio, and our mission is to bring your work to excellence.

--

--

Enhanced Media

We tell stories through sound. We specialize in creating a complete audio post-production and sound design experience. https://enhanced.media/