create account

OpenAI Refining Voice Cloning with Voice Engine by olujay

View this thread on: hive.blogpeakd.comecency.com
· @olujay ·
$25.33
OpenAI Refining Voice Cloning with Voice Engine
<div class="text-justify">


![image.png](https://files.peakd.com/file/peakd-hive/olujay/23uEsxcsYVUH1WJ9Z4myKDV2XEhm7fcmYo2qfeJThSeNchuwykoYAV6AjQNEaAvYzRSo4.png)


The voice cloning industry is growing every day, and even more rapidly in recent times as AI models improve at making synthetic voices sound realistic. And now, OpenAI finally makes its debut with Voice Engine, but they are keen on entering the industry with it responsibly.

OpenAI's Voice Engine will allow users to record 15-second voice samples, and it will generate a copy of the voice. Its application crosses many areas of the voice industry, including audiobooks, podcasts, voiceovers, and virtual assistants. However, OpenAI hasn't announced when Voice Engine will be publicly available, as they are taking time to ensure that it is as safe as it can be.

![image.png](https://files.peakd.com/file/peakd-hive/olujay/23tGXzLTWncQhHJpuDvk91iWFedE6kPrKrBS3eYrhujRoqCJ7dnzSn4k54GrjCSstymAx.png)

Interestingly, Voice Engine's AI model has been around for a while now. It's been available as a "read aloud feature in the AI chatbot, ChatGPT, and even that was already impressive. Where it's training data is from, however, isn't so clear. They'd only say that it was trained on some public and licensed data.

https://i.imgur.com/isw7nMh.gif

Training data is a crucial type of information for AI providers. It is confidential for most of them, as it is some sort of competitive advantage between themselves. More so, they are also potential leads to IP-related issues, further discouraging them from talking much about them. Already, OpenAI is already facing allegations over IP law violations by training their models on copyrighted content without actually attributing them to the creators or providing incentives, so they'd rather be discreet with their information on training data.

In an actual sense, it is difficult to create useful AI without real-world samples, including copyrighted content, and so OpenAI pitches that fair usage of such works be allowed as long as it is developmental for the models when training them.


![image.png](https://files.peakd.com/file/peakd-hive/olujay/Eo1vw2BqERz5aZiKuBey8ZgwYokSovFMwPGwFgxh2jLuuKzPW5WiuofjPDsfmNGRyaA.png)


Voice Engine's training isn't based on user data, however. “We take a small audio sample and text and generate realistic speech that matches the original speaker,” said Jeff Harris, a product staff member at OpenAI. “The audio that’s used is dropped after the request is complete.” And so, Voice Engine analyses the (15-second) voice sample provided and the text to be read and generates a voice that matches the sample, all on the go as the request is made.

There are already-existing technologies such as ElevenLabs, Replica Studios, Papercup, and Respeecher, but unlike many of them, there really aren't controls to adjust to the pitch, cadence, and tone of a voice. No fine-tuning knobs, either. You give it a 15-second sample, and it generates a voice for the request. However, something interesting it does is carry on the expressiveness of the voice in the sample to generations of the synthetic voice. That is, if you sound excited in the sample, the generated voice will sound just as similar.

https://i.imgur.com/isw7nMh.gif


There are concerns as to what will become of creators in the voice industries and how this tool will affect them, considering how good enough these models are to replace most of them. There are already existing platforms that have been deploying these AI cloning models to create content. To benefit these creators, voice actors, and the like, they are asked to sign rights to the use of their voices by these models so their clients get to use their synthetic versions.


While some AI providers try to find balance amidst the controversy over the ethical usage of copyrighted works by either creating deals with SAG-AFTRA (Screen Actors Guild - American Federation of Television and Radio Artists) to create and licence copies of the media artist union members’ voices, like Replica Studios is doing, or hosting a marketplace for synthetic voices that allows users to create a voice, verify it, and share it publicly, like ElevenLabs, OpenAI is taking a different approach.



OpenAI will establish no such labour union deals or marketplaces, at least not in the near term, and requires only that users obtain “explicit consent” from the people whose voices are cloned, make “clear disclosures” indicating which voices are AI-generated, and agree not to use the voices of minors, deceased people, or political figures in their generations. [<sub><sub>Source</sub></sub>](https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/)


![image.png](https://files.peakd.com/file/peakd-hive/olujay/23tRxGEJdTdbJzUMWUWM2txewWoKPmeKR6eAqEWxYqe6antaDCVkaaA5tJfij2RSTrMJc.png)


What we have seen with deepfakes in recent times and what's possible in the future with these AI models continue to raise concerns about the ethical and responsible use of AI. OpenAI is implementing some measures to prevent misuse of Voice Engine.

For now, Voice Engine is only going to be available to a very small number of people—say, 10 developers. OpenAI is prioritising use cases that are “low risk” and “socially beneficial,” Harris says, like those in healthcare and accessibility, in addition to experimenting with “responsible” synthetic media.

Watermarks are placed in the voice clones generated with Voice Engine. They are inaudible identifiers embedded in the generations that enable them to know if a voice clone was created by Voice Engine and who developed it. It's not promised that it can't be walked around, but they are described as "tamper resistant," at least.

https://i.imgur.com/isw7nMh.gif

An example of Voice Engine's performance is how it used [this voice sample](https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362466/age_of_learning_reference.mp3) to generate three audio clips. Generated [clip 1](https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362465/age_of_learning_rainforest.mp3), [clip 2](https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362464/age_of_learning_reading.mp3), and [clip 3](https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362463/age_of_learning_chemistry.mp3). The difference between the original clip and the generated ones isn't apparent, and unsuspecting listeners will unlikely be able to figure it out.


![image.png](https://files.peakd.com/file/peakd-hive/olujay/23xAZzqD9UKfBQg25mPL9nNJZVW6ejwRnbjbtXnkiZ4X5cDPBr1Af6QFWodnXcn6rtfJR.png)


OpenAI states that there will be HD and non-HD voices, but a spokesperson at OpenAI also says that there really isn't a difference between both of them. They are priced differently, however, with HD costing twice as much as non-HD.

Until OpenAI releases Voice Engine to the public, they are focusing more on safety issues as they develop the AI voice cloning model. “What’s going to keep pushing us forward in terms of the actual voice-matching technology is really going to depend on what we learn from the pilot, the safety issues that are uncovered, and the mitigations that we have in place,” Harris said. “We don’t want people to be confused between artificial voices and actual human voices.”

---

By the way, make earnings with your content on Hive via [InLeo](https://inleo.io/) while you truly own your account. If you're new, sign up in a few minutes by clicking [here](https://inleo.io/signup?referral=olujay)!

---

[<sub>References</sub>](https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/)

Images [1](https://unsplash.com/photos/a-cell-phone-sitting-on-top-of-a-laptop-computer-7q-kE4SZzvQ), [2](https://unsplash.com/photos/man-standing-in-front-of-condenser-microphone-inside-recording-studio-HowWHYGqFF4), [3](https://unsplash.com/photos/black-microphone-on-black-microphone-stand-zn-xrjOKQmc), [4](https://unsplash.com/photos/a-computer-generated-image-of-a-ball-of-string-P5mCQ4KACbM)

---

<sub>Interested in more?</sub>

<sub><sub>[Meet the Humane AI Pin: Voice, Gesture, AI – No Screens Needed!](https://inleo.io/@olujay/meet-the-humane-ai-pin-voice-gesture-ai-no-screens-needed)</sub></sub>

<sub><sub>[The Link: Bridging Minds and Machines with Neuralink's Brain Chip](https://inleo.io/@olujay/the-link-bridging-minds-and-machines-with-neuralinks-brain-chip)</sub></sub>

<sub><sub>[AI-coustics: Revolutionizing Audio Clarity with Generative AI Technology](https://inleo.io/@olujay/aicoustics-revolutionizing-audio-clarity-with-generative-ai-technology)</sub></sub>
</div>

Posted Using [InLeo Alpha](https://inleo.io/@olujay/openai-refining-voice-cloning-with-voice-engine)
👍  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and 301 others
properties (23)
authorolujay
permlinkopenai-refining-voice-cloning-with-voice-engine
categoryhive-167922
json_metadata{"app":"leothreads/0.3","format":"markdown","tags":["hive-167922","openai","ai","voice","cloning","model","inleo","neoxian","tribes","tech"],"canonical_url":"https://inleo.io/@olujay/openai-refining-voice-cloning-with-voice-engine","links":["https://files.peakd.com/file/peakd-hive/olujay/23uEsxcsYVUH1WJ9Z4myKDV2XEhm7fcmYo2qfeJThSeNchuwykoYAV6AjQNEaAvYzRSo4.png)","https://files.peakd.com/file/peakd-hive/olujay/23tGXzLTWncQhHJpuDvk91iWFedE6kPrKrBS3eYrhujRoqCJ7dnzSn4k54GrjCSstymAx.png)","https://files.peakd.com/file/peakd-hive/olujay/Eo1vw2BqERz5aZiKuBey8ZgwYokSovFMwPGwFgxh2jLuuKzPW5WiuofjPDsfmNGRyaA.png)","https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/)","https://files.peakd.com/file/peakd-hive/olujay/23tRxGEJdTdbJzUMWUWM2txewWoKPmeKR6eAqEWxYqe6antaDCVkaaA5tJfij2RSTrMJc.png)","https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362466/age_of_learning_reference.mp3)","https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362465/age_of_learning_rainforest.mp3),","https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362464/age_of_learning_reading.mp3),","https://cdn.vox-cdn.com/uploads/chorus_asset/file/25362463/age_of_learning_chemistry.mp3).","https://files.peakd.com/file/peakd-hive/olujay/23xAZzqD9UKfBQg25mPL9nNJZVW6ejwRnbjbtXnkiZ4X5cDPBr1Af6QFWodnXcn6rtfJR.png)","https://inleo.io/)","https://inleo.io/signup?referral=olujay)!","https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/)","https://unsplash.com/photos/a-cell-phone-sitting-on-top-of-a-laptop-computer-7q-kE4SZzvQ),","https://unsplash.com/photos/man-standing-in-front-of-condenser-microphone-inside-recording-studio-HowWHYGqFF4),","https://unsplash.com/photos/black-microphone-on-black-microphone-stand-zn-xrjOKQmc),","https://unsplash.com/photos/a-computer-generated-image-of-a-ball-of-string-P5mCQ4KACbM)","https://inleo.io/@olujay/meet-the-humane-ai-pin-voice-gesture-ai-no-screens-needed)</sub></sub>","https://inleo.io/@olujay/the-link-bridging-minds-and-machines-with-neuralinks-brain-chip)</sub></sub>","https://inleo.io/@olujay/aicoustics-revolutionizing-audio-clarity-with-generative-ai-technology)</sub></sub>","https://inleo.io/@olujay/openai-refining-voice-cloning-with-voice-engine)"],"images":["https://i.imgur.com/isw7nMh.gif","https://i.imgur.com/isw7nMh.gif","https://i.imgur.com/isw7nMh.gif"],"isPoll":false,"dimensions":{}}
created2024-04-02 14:31:30
last_update2024-04-02 14:31:30
depth0
children14
last_payout2024-04-09 14:31:30
cashout_time1969-12-31 23:59:59
total_payout_value12.702 HBD
curator_payout_value12.630 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length8,696
author_reputation530,536,449,085,412
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,541,113
net_rshares48,625,614,924,896
author_curate_reward""
vote details (365)
@emrysjobber ·
This is why Elon Musk raised a debate about creating an anti-Ai committee that ensures AI is used in a manner than doesn't infringe on the right of its users.

There is a popular scamming technique used abroad called voice pinching, you are sent a voice note which is identical to someone you know but it is just another person playing pranks on you.

The voice pinching is not perfect due to inability of the tools used to perfectly imitate a human voice. Now imagine this kind of tool in the weaponry of those people. Digital conversations will no longer be safe because anyone with voice engine will be capable of imitating anyone's voice.
properties (22)
authoremrysjobber
permlinkre-olujay-202443t222649903z
categoryhive-167922
json_metadata{"type":"comment","tags":["hive-167922","openai","ai","voice","cloning","model","inleo","neoxian","tribes","tech"],"app":"ecency/3.0.46-mobile","format":"markdown+html"}
created2024-04-03 21:26:51
last_update2024-04-03 21:26:51
depth1
children5
last_payout2024-04-10 21:26:51
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length642
author_reputation112,872,079,640,546
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,578,604
net_rshares0
@olujay ·
I am not aware of that committee Elon Musk is putting up, but I know that many AI vendors keep developing their guradrails to keep the use and development of AI responsible.

Yes, man, there is now AI voice cloning. One time, someone used it to break into their own bank using the tool. It's crazy the things people are doing with AI now.<div><a href="https://engage.hivechain.app">![](https://i.imgur.com/XsrNmcl.png)</a></div>
properties (22)
authorolujay
permlinkre-1712507368577
categoryhive-167922
json_metadata{"app":"engage"}
created2024-04-07 16:29:30
last_update2024-04-07 16:29:30
depth2
children4
last_payout2024-04-14 16:29:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length428
author_reputation530,536,449,085,412
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,677,816
net_rshares0
@emrysjobber ·
Hmmm, it seems the advancement of technology which is a product of humans intelligence will eventually be the end of humans 😔
properties (22)
authoremrysjobber
permlinkre-olujay-2024411t1085421z
categoryhive-167922
json_metadata{"type":"comment","tags":["ecency"],"app":"ecency/3.0.46-mobile","format":"markdown+html"}
created2024-04-11 09:08:57
last_update2024-04-11 09:08:57
depth3
children3
last_payout2024-04-18 09:08:57
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length125
author_reputation112,872,079,640,546
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,776,790
net_rshares0
@hive-naija ·
$0.02
![1000140008.gif](https://files.peakd.com/file/peakd-hive/hive-naija/Eo6C9yoF9MPZkKVsL2PUMBtRKnwAt5x3zzyVDuqzeUwtUrSd9Buad4eFMCSpjMiGpBS.gif)
👍  
properties (23)
authorhive-naija
permlinkre-olujay-sbc56x
categoryhive-167922
json_metadata{"tags":["hive-167922"],"app":"peakd/2024.4.1"}
created2024-04-02 22:01:48
last_update2024-04-02 22:01:48
depth1
children1
last_payout2024-04-09 22:01:48
cashout_time1969-12-31 23:59:59
total_payout_value0.010 HBD
curator_payout_value0.010 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length141
author_reputation157,611,811,673,375
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,550,818
net_rshares40,120,936,312
author_curate_reward""
vote details (1)
@olujay ·
Thank you!<div><a href="https://engage.hivechain.app">![](https://i.imgur.com/XsrNmcl.png)</a></div>
properties (22)
authorolujay
permlinkre-1712145255954
categoryhive-167922
json_metadata{"app":"engage"}
created2024-04-03 11:54:18
last_update2024-04-03 11:54:18
depth2
children0
last_payout2024-04-10 11:54:18
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length100
author_reputation530,536,449,085,412
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,564,401
net_rshares0
@hivebuzz ·
Congratulations @olujay! You have completed the following achievement on the Hive blockchain And have been rewarded with New badge(s)

<table><tr><td><img src="https://images.hive.blog/60x70/https://hivebuzz.me/@olujay/upvotes.png?202404022252"></td><td>You distributed more than 27000 upvotes.<br>Your next target is to reach 28000 upvotes.</td></tr>
</table>

<sub>_You can view your badges on [your board](https://hivebuzz.me/@olujay) and compare yourself to others in the [Ranking](https://hivebuzz.me/ranking)_</sub>
<sub>_If you no longer want to receive notifications, reply to this comment with the word_ `STOP`</sub>



**Check out our last posts:**
<table><tr><td><a href="/hive-122221/@hivebuzz/pum-202403-result"><img src="https://images.hive.blog/64x128/https://i.imgur.com/mzwqdSL.png"></a></td><td><a href="/hive-122221/@hivebuzz/pum-202403-result">Hive Power Up Month Challenge - March 2024 Winners List</a></td></tr><tr><td><a href="/hive-122221/@hivebuzz/pum-202405"><img src="https://images.hive.blog/64x128/https://i.imgur.com/M9RD8KS.png"></a></td><td><a href="/hive-122221/@hivebuzz/pum-202405">Be ready for the May edition of the Hive Power Up Month!</a></td></tr></table>
properties (22)
authorhivebuzz
permlinknotify-olujay-20240402t225659
categoryhive-167922
json_metadata{"image":["https://hivebuzz.me/notify.t6.png"]}
created2024-04-02 22:57:00
last_update2024-04-02 22:57:00
depth1
children0
last_payout2024-04-09 22:57:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length1,195
author_reputation369,423,457,379,625
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,552,949
net_rshares0
@luchyl ·
What else will this AI not do. Maybe the next thing we'll hear is that AI can now record our breathing, lol.

Good a thing I already have a nice voice, already insured so wouldn't be needing it, hehehe.

#dreemerforlife 
properties (22)
authorluchyl
permlinkre-olujay-sbd3ge
categoryhive-167922
json_metadata{"tags":["hive-167922"],"app":"peakd/2024.4.1"}
created2024-04-03 10:22:00
last_update2024-04-03 10:22:00
depth1
children1
last_payout2024-04-10 10:22:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length220
author_reputation177,386,399,611,372
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,562,840
net_rshares0
@olujay ·
I think by the time that happens, it will not be news to us. 😁<div><a href="https://engage.hivechain.app">![](https://i.imgur.com/XsrNmcl.png)</a></div>
properties (22)
authorolujay
permlinkre-1712505619174
categoryhive-167922
json_metadata{"app":"engage"}
created2024-04-07 16:00:21
last_update2024-04-07 16:00:21
depth2
children0
last_payout2024-04-14 16:00:21
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length152
author_reputation530,536,449,085,412
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,677,250
net_rshares0
@nkemakonam89 ·
Olujay has come again with his Ai complex post🤦. Only the title alone was paining my brain, haha, and then I began to read further and keep meeting terminologies in Ai world. What can I say, Ai kept waxing stronger, striving to do even the unimaginable and I hope that all the advancement will be safe for us humans
I came via #dreemport
#dreemerforlife
properties (22)
authornkemakonam89
permlinkre-olujay-sbcxp5
categoryhive-167922
json_metadata{"tags":["hive-167922"],"app":"peakd/2024.4.1"}
created2024-04-03 08:17:33
last_update2024-04-03 08:17:33
depth1
children0
last_payout2024-04-10 08:17:33
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length354
author_reputation754,232,122,409,307
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,560,857
net_rshares0
@uyobong ·
$0.02
It's cool to se that AI is becoming more responsible. Placing priority on users' safety is a great step.
👍  
properties (23)
authoruyobong
permlinkre-olujay-2ykdxy9q
categoryhive-167922
json_metadata{"app":"leothreads/0.3","format":"markdown","tags":["leofinance"],"canonical_url":"https://inleo.io/@uyobong/re-olujay-2ykdxy9q","isPoll":false,"pollOptions":{},"dimensions":[]}
created2024-04-02 22:32:51
last_update2024-04-02 22:32:51
depth1
children1
last_payout2024-04-09 22:32:51
cashout_time1969-12-31 23:59:59
total_payout_value0.010 HBD
curator_payout_value0.010 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length104
author_reputation935,715,227,722,741
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,552,098
net_rshares39,887,387,104
author_curate_reward""
vote details (1)
@olujay ·
Totally, man. It's a really good thing that AI vendors are careful and responsible with their products.<div><a href="https://engage.hivechain.app">![](https://i.imgur.com/XsrNmcl.png)</a></div>
properties (22)
authorolujay
permlinkre-1712145389376
categoryhive-167922
json_metadata{"app":"engage"}
created2024-04-03 11:56:30
last_update2024-04-03 11:56:30
depth2
children0
last_payout2024-04-10 11:56:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length193
author_reputation530,536,449,085,412
root_title"OpenAI Refining Voice Cloning with Voice Engine"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id132,564,438
net_rshares0