Stories
Slash Boxes
Comments

SoylentNews is people

SoylentNews is powered by your submissions, so send in your scoop. Only 15 submissions in the queue.
posted by chromas on Friday March 01 2019, @03:03AM   Printer-friendly

Mozilla updates Common Voice dataset with 1,400 hours of speech across 18 languages

Mozilla wants to make it easier for startups, researchers, and hobbyists to build voice-enabled apps, services, and devices. Toward that end, it's today releasing the latest version of Common Voice, its open source collection of transcribed voice data that now comprises over 1,400 hours of voice samples from 42,000 contributors across 18 languages, including English, French, German, Dutch, Hakha-Chin, Esperanto, Farsi, Basque, Spanish, Mandarin Chinese, Welsh, and Kabyle.

It's one of the largest multi-language dataset of its kind, Mozilla claims — substantially larger than the Common Voice corpus it made publicly available eight months ago, which contained 500 hours (400,000 recordings) from 20,000 volunteers in English — and the corpus will soon grow larger still. The organization says that data collection efforts in 70 languages are actively underway via the Common Voice website and mobile apps.

Common Voice home page. Also at Engadget.

Previously: Mozilla's "Common Voice": Voice Recognition Without Google, Amazon, Baidu, Apple, Microsoft, etc.
Mozilla's Common Voice Collecting French, German, and Welsh Samples, Prepping 40 More Languages


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 2) by Runaway1956 on Friday March 01 2019, @03:13AM (5 children)

    by Runaway1956 (2926) Subscriber Badge on Friday March 01 2019, @03:13AM (#808540) Journal

    So, uhhhhh, nothing super special? Useful, maybe, but nothing super special.

    Don't we all love meaningless sales pitches?

    Starting Score:    1  point
    Karma-Bonus Modifier   +1  

    Total Score:   2  
  • (Score: 4, Informative) by takyon on Friday March 01 2019, @03:47AM

    by takyon (881) <takyonNO@SPAMsoylentnews.org> on Friday March 01 2019, @03:47AM (#808550) Journal
    --
    [SIG] 10/28/2017: Soylent Upgrade v14 [soylentnews.org]
  • (Score: 0) by Anonymous Coward on Friday March 01 2019, @04:31AM

    by Anonymous Coward on Friday March 01 2019, @04:31AM (#808570)

    one of . . . largest . . . of it's kind

    The good thing with aging related dementia . . . it's kind. For, in its kindness, the sufferer is blissful unaware of the affliction.

  • (Score: 0) by Anonymous Coward on Friday March 01 2019, @04:36AM

    by Anonymous Coward on Friday March 01 2019, @04:36AM (#808572)

    So, uhhhhh, nothing super special? Useful, maybe, but nothing super special.

    Do you feel like you are denied something super special, something that you totally deserve?

    Don't we all love meaningless sales pitches?

    'Useful' and 'meaningless' . . . I see you mastered the art of cognitive dissonance.

  • (Score: 5, Informative) by richtopia on Friday March 01 2019, @06:16AM (1 child)

    by richtopia (3160) on Friday March 01 2019, @06:16AM (#808591) Homepage Journal

    There are larger databases, but not publically available. Google Assistant, Siri, and Cortana's voice recognition system trained on similar datasets. Mozilla's motivation is to provide an open alternative, and even the reference dataset is a step forward to competing with these services.

    • (Score: 2) by DannyB on Friday March 01 2019, @04:03PM

      by DannyB (5839) Subscriber Badge on Friday March 01 2019, @04:03PM (#808751) Journal

      No thanks Mozilla. Such databases should be much larger, like those "not publicly available" ones, and for FREE. Because I'm entitled!

      And by the way . . .

      It is the government's and everyone else's duty to guarantee my happiness. Doesn't the constitution guarantee my happiness? I shouldn't have to "pursue" happiness. That would require some effort and sounds too much like something called "work". I've got it! I'll become a "Social Media Influencer"! That's it! I can get free stuff for doing no actual work except posting excessively long and content free YouTube videos! It's a great career path!

      --
      To transfer files: right-click on file, pick Copy. Unplug mouse, plug mouse into other computer. Right-click, paste.