Stories
Slash Boxes
Comments

SoylentNews is people

posted by Fnord666 on Monday January 16 2023, @07:56AM   Printer-friendly
from the my-voice-is-no-longer-my-password dept.

Text-to-speech model can preserve speaker's emotional tone and acoustic environment:

On Thursday, Microsoft researchers announced a new text-to-speech AI model called VALL-E that can closely simulate a person's voice when given a three-second audio sample. Once it learns a specific voice, VALL-E can synthesize audio of that person saying anything—and do it in a way that attempts to preserve the speaker's emotional tone.

Its creators speculate that VALL-E could be used for high-quality text-to-speech applications, speech editing where a recording of a person could be edited and changed from a text transcript (making them say something they originally didn't), and audio content creation when combined with other generative AI models like GPT-3.


Original Submission

 
This discussion was created by Fnord666 (652) for logged-in users only, but now has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 4, Informative) by Nuke on Monday January 16 2023, @09:53AM (1 child)

    by Nuke (3162) on Monday January 16 2023, @09:53AM (#1287044)

    "Hi, this is your bank manager, we were speaking in branch last week. You need to transfer $10,000 to a special account we have set up ...."
    or :
    "Hi, your IT tech support here. We have detected a virus on your Windows. As you can tell by my Oxford accent, I am not from India ..."

    Starting Score:    1  point
    Moderation   +2  
       Insightful=1, Informative=1, Total=2
    Extra 'Informative' Modifier   0  
    Karma-Bonus Modifier   +1  

    Total Score:   4  
  • (Score: 2) by DannyB on Monday January 16 2023, @05:16PM

    by DannyB (5839) Subscriber Badge on Monday January 16 2023, @05:16PM (#1287087) Journal

    Hi,
    This is the voice of a rich Nigerian prince who recently died. This voice is from beyond the grave. I spent my entire life trying to give away my substantial fortune by email, but nobody would return my emails!

    --
    The Centauri traded Earth jump gate technology in exchange for our superior hair mousse formulas.