Mozilla Expands Common Voice Database to 18 Languages, With More on the Way

posted by chromas on Friday March 01 2019, @03:03AM

Mozilla updates Common Voice dataset with 1,400 hours of speech across 18 languages

Mozilla wants to make it easier for startups, researchers, and hobbyists to build voice-enabled apps, services, and devices. Toward that end, it's today releasing the latest version of Common Voice, its open source collection of transcribed voice data that now comprises over 1,400 hours of voice samples from 42,000 contributors across 18 languages, including English, French, German, Dutch, Hakha-Chin, Esperanto, Farsi, Basque, Spanish, Mandarin Chinese, Welsh, and Kabyle.
It's one of the largest multi-language dataset of its kind, Mozilla claims — substantially larger than the Common Voice corpus it made publicly available eight months ago, which contained 500 hours (400,000 recordings) from 20,000 volunteers in English — and the corpus will soon grow larger still. The organization says that data collection efforts in 70 languages are actively underway via the Common Voice website and mobile apps.

Common Voice home page. Also at Engadget.

Previously: Mozilla's "Common Voice": Voice Recognition Without Google, Amazon, Baidu, Apple, Microsoft, etc.
Mozilla's Common Voice Collecting French, German, and Welsh Samples, Prepping 40 More Languages

Original Submission

This discussion has been archived. No new comments can be posted.

Mozilla Expands Common Voice Database to 18 Languages, With More on the Way | Log In/Create an Account | Top | 7 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Mozilla Expands Common Voice Database to 18 Languages, With More on the Way

Related Stories

one of . . . largest . . . of it's kindone of . . . largest . . . of it's kind (Score: 2) by Runaway1956 on Friday March 01 2019, @03:13AM (5 children)

Re:one of . . . largest . . . of it's kind(Score: 4, Informative) by takyon on Friday March 01 2019, @03:47AM

Re:one of . . . largest . . . of it's kind(Score: 0) by Anonymous Coward on Friday March 01 2019, @04:31AM

Re:one of . . . largest . . . of it's kind(Score: 0) by Anonymous Coward on Friday March 01 2019, @04:36AM

Re:one of . . . largest . . . of it's kindRe:one of . . . largest . . . of it's kind (Score: 5, Informative) by richtopia on Friday March 01 2019, @06:16AM (1 child)

Re:one of . . . largest . . . of it's kind(Score: 2) by DannyB on Friday March 01 2019, @04:03PM

I tried it(Score: 0) by Anonymous Coward on Friday March 01 2019, @10:57PM

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

Mozilla Expands Common Voice Database to 18 Languages, With More on the Way

Related Stories

one of . . . largest . . . of it's kindone of . . . largest . . . of it's kind (Score: 2) by Runaway1956 on Friday March 01 2019, @03:13AM (5 children)

Re:one of . . . largest . . . of it's kind(Score: 4, Informative) by takyon on Friday March 01 2019, @03:47AM

Re:one of . . . largest . . . of it's kind(Score: 0) by Anonymous Coward on Friday March 01 2019, @04:31AM

Re:one of . . . largest . . . of it's kind(Score: 0) by Anonymous Coward on Friday March 01 2019, @04:36AM

Re:one of . . . largest . . . of it's kindRe:one of . . . largest . . . of it's kind (Score: 5, Informative) by richtopia on Friday March 01 2019, @06:16AM (1 child)

Re:one of . . . largest . . . of it's kind(Score: 2) by DannyB on Friday March 01 2019, @04:03PM

I tried it(Score: 0) by Anonymous Coward on Friday March 01 2019, @10:57PM