Stories
Slash Boxes
Comments

SoylentNews is people

posted by martyb on Thursday July 20 2017, @08:02AM   Printer-friendly
from the speak-up! dept.

Mozilla wants to crowdsource thousands of hours of voice recordings for an open source voice recognition engine:

The Mozilla Foundation launched "Common Voice," which is a crowdsourced initiative to build an open source data set for voice recognition applications.

Many technology companies believe that voice control will be embedded into most devices in the future. This is why Apple, Google, Amazon, Microsoft, Baidu, and others are all trying to put their own voice-controlled artificial intelligence assistants into as many devices as they can and as fast as they can, in order to gain market share before the competition.

The problem with this, according to Mozilla, is that voice controlled technologies could end up being dominated by proprietary technology and data sets, which aren't made available to startups and academics. As some large companies already benefit from billion-dollar revenues, it could later become too difficult for startups to catch up with the big players. Though[sic] Common Voice, Mozilla aims to democratize voice recognition technology.

You could use this to build (the easy part of) a personal assistant that either does not use the cloud, or does so on your terms.


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 0) by Anonymous Coward on Friday July 21 2017, @01:59AM (1 child)

    by Anonymous Coward on Friday July 21 2017, @01:59AM (#542141)

    Because English has about 12k different unique syllables compared with a language like Mandarin that's only got about 1600. If it can handle English, then chances are it can handle other languages with some adjustment. English is also an incredibly popular language with many speakers that have the time and money necessary to fund the project.

  • (Score: 1, Informative) by Anonymous Coward on Friday July 21 2017, @04:17AM

    by Anonymous Coward on Friday July 21 2017, @04:17AM (#542177)

    Your examples call to mind the fact that Chinese is a tonal language [lexington.ro] whilst English is not. The difference is a stumbling block for English speakers when learning Chinese.

    Last year, we had a story [soylentnews.org] about Baidu's effort at an engine that would recognize both English and Chinese.