Stories
Slash Boxes
Comments

SoylentNews is people

posted by Fnord666 on Friday April 03 2020, @08:24AM   Printer-friendly
from the fill-in-the-blanks dept.

Google's WaveNetEQ fills in speech gaps during Duo calls:

Google today detailed an AI system — WaveNetEQ — it recently deployed to Duo, its cross-platform voice and video chat app, that can realistically synthesize short snippets of speech to replace garbled audio caused by an unstable internet connection. It's fast enough to run on a smartphone while delivering state-of-the-art, natural-sounding audio quality, laying the groundwork for future chat apps optimized for bandwidth-constrained environments.

Here's how it sounds compared with Duo's old solution (the first is WaveNetEQ):

https://venturebeat.com/wp-content/uploads/2020/04/waveneteq_120_ms_2_63b829581a3291c144a030639139c199.wav
https://venturebeat.com/wp-content/uploads/2020/04/neteq_120_ms_2_8e86d7b2061dfb964b845ebefc1aebd9.wav

As Google explains, to ensure reliable real-time communication, it's necessary to deal with packets (i.e., formatted units of data) that are missing when the receiver needs them. (The company says that 99% of Duo calls need to deal with network issues, and that 10% of calls lose more than 8% of the total audio duration due to network issues.) If new audio isn't delivered continuously, audible glitches and gaps will occur, but repeating the same audio isn't ideal because it produces artifacts and reduces overall call quality.

Google's solution — WaveNetEQ — is what's called a packet loss containment module, which is responsible for creating data to fill in the gaps created by packet losses, excessive jitter, and other mishaps.

"I can['t] hear you now."


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 4, Interesting) by KritonK on Friday April 03 2020, @10:01AM (1 child)

    by KritonK (465) on Friday April 03 2020, @10:01AM (#978649)

    I'll k[glitch]l you tomorrow. Bye!

    Is that glitch an o or an i sound?

    Minor changes can substantially alter [youtube.com] the meaning of spoken text, so I'd rather ask the caller to repeat what they said, than have software automatically fill in the blanks, with possibly dubious results.

    Starting Score:    1  point
    Moderation   +2  
       Insightful=1, Interesting=1, Total=2
    Extra 'Interesting' Modifier   0  
    Karma-Bonus Modifier   +1  

    Total Score:   4  
  • (Score: -1, Redundant) by Anonymous Coward on Friday April 03 2020, @03:28PM

    by Anonymous Coward on Friday April 03 2020, @03:28PM (#978742)

    Phew - it's lucky you clarified that because I thought you said "Trump replaced White House pandemic-response team With Jared Kushner". That would've been hilarious!