Artificial Intelligence: OpenAI creates voice cloning software

Artificial intelligence
OpenAI creates voice cloning software

Concern about misuse: OpenAI’s software for cloning real voices will not be available for public use for the time being. photo

© Richard Drew/AP/dpa

A 15-second recording should be enough to enable the software to clone real voices. However, Voice Engine will not be available for public use for the time being – because the technology poses risks.

ChatGPT developer OpenAI is working on one Software for cloning human voices – but is holding it back for now due to concerns about misuse. “Voice Engine uses text input and a mere 15-second audio sample to produce natural-sounding speech that is very similar to the original speaker,” said the US company. OpenAI has been working on the program since 2022, among other things to refine software that converts text into speech.

However, it remains to be seen whether and when Voice Engine will be available to the general public. OpenAI said that such an application poses considerable risks, especially in an election year. Because of possible misuse, the group is therefore approaching the question of approval “carefully and knowledgeably”. OpenAI is hoping for a discussion about the opportunities and risks of the technology and wants to carry out further tests.

In January, fake calls fueled fears of manipulation using artificial intelligence in the race for the White House. In automated calls, a voice that sounded confusingly similar to that of US President Joe Biden called on Democrats in New Hampshire not to take part in the primaries. Such so-called robocalls are a common election campaign tool in the USA.

In its announcement, OpenAI also referred to the opportunities of the application. With a partner, OpenAI researched how synthetic voices can support learning to read. Voice Engine also has potential when it comes to translating videos and podcasts, for example, which could reach a wider audience. In any case, it is important that the cloned voice first reveals itself as being AI-generated.

dpa

source site-5