Stories
Slash Boxes
Comments

SoylentNews is people

posted by Fnord666 on Monday January 16 2023, @07:56AM   Printer-friendly
from the my-voice-is-no-longer-my-password dept.

Text-to-speech model can preserve speaker's emotional tone and acoustic environment:

On Thursday, Microsoft researchers announced a new text-to-speech AI model called VALL-E that can closely simulate a person's voice when given a three-second audio sample. Once it learns a specific voice, VALL-E can synthesize audio of that person saying anything—and do it in a way that attempts to preserve the speaker's emotional tone.

Its creators speculate that VALL-E could be used for high-quality text-to-speech applications, speech editing where a recording of a person could be edited and changed from a text transcript (making them say something they originally didn't), and audio content creation when combined with other generative AI models like GPT-3.


Original Submission

 
This discussion was created by Fnord666 (652) for logged-in users only, but now has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 2, Informative) by GloomMower on Monday January 16 2023, @01:25PM (1 child)

    by GloomMower (17961) on Monday January 16 2023, @01:25PM (#1287054)

    It would be nice if I can make voice-overs in my own voice without me saying them. Especially to make it sound less monotone and pronounce words correctly without having to do several takes.

    My voice:
    https://www.youtube.com/watch?v=zndy5BNjf0I [youtube.com]

    Later I used AWS Polly text to speech:
    https://www.youtube.com/watch?v=K7PMrOzxzj0 [youtube.com]

    I thought polly was better than me reading. But it would be nice if I could make a pristine sample of my voice and use text to speech.

    Starting Score:    1  point
    Moderation   +1  
       Informative=1, Total=1
    Extra 'Informative' Modifier   0  

    Total Score:   2  
  • (Score: 2) by inertnet on Monday January 16 2023, @02:16PM

    by inertnet (4071) on Monday January 16 2023, @02:16PM (#1287055) Journal

    I can see your point, not that your original voice-over is bad though. I usually dislike artificial voice-overs, but that one wasn't so bad.

    I can't object as long as it's used voluntary, but I would really have a problem with people stealing my voice and have me say things that I've never actually spoken.