Stories
Slash Boxes
Comments

SoylentNews is people

posted by martyb on Thursday September 29 2016, @09:53AM   Printer-friendly
from the Translate-"Jabberwocky" dept.

Google Translate will be upgraded using a "Neural Machine Translation" technique, starting with Chinese-English translation today:

Google has been working on a machine learning translation technique for years, and today is its official debut. The Google Neural Machine Translation [GNMT] system, deployed today for Chinese-English queries, is a step up in complexity from existing methods. Here's how things have evolved (in a nutshell). [...] GNMT is the latest and by far the most effective to successfully leverage machine learning in translation. It looks at the sentence as a whole, while keeping in mind, so to speak, the smaller pieces like words and phrases. It's much like the way we look at an image as a whole while being aware of individual pieces — and that's not a coincidence. Neural networks have been trained to identify images and objects in ways imitative of human perception, and there's more than a passing resemblance between finding the gestalt of an image and that of a sentence.

Interestingly, there's little in there actually specific to language: The system doesn't know the difference between the future perfect and future continuous, and it doesn't break up words based on their etymologies. It's all math and stats, no humanity. Reducing translation to a mechanical task is admirable, but in a way chilling — though admittedly, in this case, little but a mechanical translation is called for, and artifice and interpretation are superfluous.

The code runs on Google's homegrown TPUs. The Google Research Blog says that the technique will be applied to other language pairs in the coming months.

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation


Original Submission

Related Stories

Google Reveals Homegrown "TPU" For Machine Learning 20 comments

Google has lifted the lid off of an internal project to create custom application-specific integrated circuits (ASICs) for machine learning tasks. The result is what they are calling a "TPU":

[We] started a stealthy project at Google several years ago to see what we could accomplish with our own custom accelerators for machine learning applications. The result is called a Tensor Processing Unit (TPU), a custom ASIC we built specifically for machine learning — and tailored for TensorFlow. We've been running TPUs inside our data centers for more than a year, and have found them to deliver an order of magnitude better-optimized performance per watt for machine learning. This is roughly equivalent to fast-forwarding technology about seven years into the future (three generations of Moore's Law). [...] TPU is an example of how fast we turn research into practice — from first tested silicon, the team
had them up and running applications at speed in our data centers within 22 days.

The processors are already being used to improve search and Street View, and were used to power AlphaGo during its matches against Go champion Lee Sedol. More details can be found at Next Platform, Tom's Hardware, and AnandTech.


Original Submission

This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @10:24AM

    by Anonymous Coward on Thursday September 29 2016, @10:24AM (#407818)

    China produces everything and America consumes everything and Google just took the jerbs of the people who been translating Chinese labels into English labels for American idiot consumers!

    • (Score: 1) by Francis on Thursday September 29 2016, @01:34PM

      by Francis (5544) on Thursday September 29 2016, @01:34PM (#407885)

      Mostly Chinese people and if you're lucky there's been a foreigner checking it afterwards, but usually not.

      But, only an idiot uses Google translate for any sort of serious translation work. At best, it's a quick look. For best results, you have to use the word order for the foreign language in order to nudge the translation into something approximating correct. Chinese is a particular problem and one where the current system fails worse than probably any other language, which is probably why it's the first to go. It's basically unusable now, but if the new one works, that would give them a ton of data to use when figuring out what to worry about with the other ones.

  • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @11:19AM

    by Anonymous Coward on Thursday September 29 2016, @11:19AM (#407832)

    The system doesn't know the difference between the future perfect and future continuous

    I’m not sure I want to know that either…

    • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @11:25AM

      by Anonymous Coward on Thursday September 29 2016, @11:25AM (#407833)

      Apply now to be a White House intern and you will have two possible futures. You will give head in both futures, but will it be cock or clam?

      • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @11:54AM

        by Anonymous Coward on Thursday September 29 2016, @11:54AM (#407844)

        You forgot the Reptilian future.

        • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @12:25PM

          by Anonymous Coward on Thursday September 29 2016, @12:25PM (#407855)

          In this case the clam is a sub-set of Reptilian.

      • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @02:00PM

        by Anonymous Coward on Thursday September 29 2016, @02:00PM (#407901)

        How about machine translating this?

    • (Score: 1) by Francis on Thursday September 29 2016, @01:37PM

      by Francis (5544) on Thursday September 29 2016, @01:37PM (#407889)

      Unless you're an English teacher or learned English as a secondary language, you probably wouldn't know the difference explicitly, but you almost certainly know the difference implicitly.

      Perfect just means that the action has been completed and continuous means that it's still in progress. I'm not sure why we use the terms perfect and continuous when we use the terms perfect and imperfect for the same basic concept when dealing with other languages.

      • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @02:39PM

        by Anonymous Coward on Thursday September 29 2016, @02:39PM (#407923)

        Unless you're an English teacher or learned English as a secondary language, you probably wouldn't know the difference explicitly, but you almost certainly know the difference implicitly.

        Perfect just means that the action has been completed and continuous means that it's still in progress. I'm not sure why we use the terms perfect and continuous when we use the terms perfect and imperfect for the same basic concept when dealing with other languages.

        Okay, I follow the words, but I'm still struggling to understand. Could you please provide examples? From the use of the word 'action', I'm guessing this has something to do with verb tense. If so, it would help to see something like this:

        • Tomorrow, I will kick the ball.
        • Today is the day I kick the ball.
        • Yesterday, I kicked the ball.

        Similarly, "will lift", "lift", and "lifted."

        Do those encompass the concepts? If so, please identify the "perfect" and the "continuous" (and whatever the third one is) — and if not, then better examples would be much appreciated!

        • (Score: 2) by schad on Thursday September 29 2016, @05:22PM

          by schad (2398) on Thursday September 29 2016, @05:22PM (#408025)

          For the past ten minutes, I have been kicking the ball.

          • (Score: 2) by HiThere on Thursday September 29 2016, @06:44PM

            by HiThere (866) Subscriber Badge on Thursday September 29 2016, @06:44PM (#408060) Journal

            15 minutes from now I will have been kicking the ball.

            I hope that tomorrow I will look back on kicking the ball with pleasure.

            15 minutes from now I ought to have been kicking the ball.

            etc.

            --
            Javascript is what you use to allow unknown third parties to run software you have no idea about on your computer.
  • (Score: 3, Funny) by wonkey_monkey on Thursday September 29 2016, @11:57AM

    by wonkey_monkey (279) on Thursday September 29 2016, @11:57AM (#407846) Homepage

    It make very good change word.

    Thanks chief!

    --
    systemd is Roko's Basilisk
  • (Score: 2) by gringer on Thursday September 29 2016, @12:18PM

    by gringer (962) on Thursday September 29 2016, @12:18PM (#407852)

    I really like this translation service. I am looking forward to many messages and inspiring spam in the future.

    --
    Ask me about Sequencing DNA in front of Linus Torvalds [youtube.com]
    • (Score: 1) by Francis on Thursday September 29 2016, @01:44PM

      by Francis (5544) on Thursday September 29 2016, @01:44PM (#407893)

      I think there should be an equivalent test to the Turing test for automatic translators. Basically, when the translations are able to beat that of a human translation.

      That being said, I've been messing around in the new translator for the last couple minutes and the results seem to be much improved over the older version. Most of the things I'm typing in are correct, or at least in the ball park for what they should be. And I'm not having to do my customary Chinglish input to get something that's grammatically appropriate in Chinese.

      I'm sure that as more people give corrections to the engine that they'll be able to handle more.

      Admittedly, I'm just typing in relatively simple sentences that should have been right previously, but that's still a huge improvement. Chinese is notoriously difficult for machines to translate, especially simplified Chinese. There's a ton of characters that now do multiple things depending upon context that historically had different characters and the only way to know the difference is from context.

    • (Score: 2) by bob_super on Thursday September 29 2016, @05:32PM

      by bob_super (1357) on Thursday September 29 2016, @05:32PM (#408031)

      Dear Sir,
      I being only son of bullet behind head Chinese billionaire Shi YinPing, and need the assistance in transfer of the 253.12 MILLION JIAO AND 6 FEN.
      You get the 15% for your invaluable help.
      Please send details for the bank's the account so we may debute transfer.
      Love
      Shi Ske Bab.

  • (Score: 2) by KritonK on Thursday September 29 2016, @01:37PM

    by KritonK (465) on Thursday September 29 2016, @01:37PM (#407890)

    If i input

    我隻氣墊船裝滿晒鱔

    which I am assured [omniglot.com] means "my hovercraft is full of eels", I get:

    I only hovercraft filled with eel.

    It leaves a lot to be desired.

    • (Score: 3, Informative) by Francis on Thursday September 29 2016, @02:00PM

      by Francis (5544) on Thursday September 29 2016, @02:00PM (#407900)

      Looks like the original Chinese isn't right. The second character is wrong, it should be "我的氣墊船裝滿晒鱔" if you type that into the translator you get almost exactly the translation you would expect, and arguably completely correct.

      • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @02:56PM

        by Anonymous Coward on Thursday September 29 2016, @02:56PM (#407936)

        That's a problem with machine translations. If one character is off a machine destroys the entire sentence. A human, like yourself, can tell it was a minor mistake and can still correctly translate the intent. So the machines still need improvement.

        • (Score: 2) by jdavidb on Thursday September 29 2016, @08:15PM

          by jdavidb (5690) on Thursday September 29 2016, @08:15PM (#408101) Homepage Journal

          That's a problem with machine translations. If one character is off a machine destroys the entire sentence

          On the plus side, that sounds like a very effective hashing algorithm!

          --
          ⓋⒶ☮✝🕊 Secession is the right of all sentient beings
        • (Score: 1) by Francis on Thursday September 29 2016, @09:13PM

          by Francis (5544) on Thursday September 29 2016, @09:13PM (#408121)

          Chinese is particularly problematic because they haven't discovered spaces between words. As a result word segmentation issues abound.
          Most other languages have them. It's rather inefficient to have to mentally insert white space when reading and makes it hard to identify words.

          Making things worse, the language has a ton of particles that aren't words, but are required for grammatical reasons.

        • (Score: 2) by darkfeline on Friday September 30 2016, @03:29AM

          by darkfeline (1030) on Friday September 30 2016, @03:29AM (#408223) Homepage

          That's a false premise though. Not all human Chinese speakers would have caught that error, and many Chinese speakers are functionally illiterate.

          Comparing the best of human performance against the worst of computer performance is hardly fair, and is denial at best, of the frightening potential of machine learning.

          --
          Join the SDF Public Access UNIX System today!
    • (Score: 2) by moondrake on Thursday September 29 2016, @02:14PM

      by moondrake (2658) on Thursday September 29 2016, @02:14PM (#407905)

      Well..there is Chinese and there is Chinese. I am by no means fluent but would translate the sentence as google did, even although I end up with nonsense (as if the original made much sense...). Then I followed your link and saw you copied a Cantonese sentence... Often the writing has roughly the same meaning though (even although the talking is unintelligible to me, and probably to most native mandarin speakers). But it is not the same.

      Google does not seem to translate Cantonese. But it does translate the mandarin Chinese correctly.

  • (Score: 0) by Anonymous Coward on Thursday September 29 2016, @05:14PM

    by Anonymous Coward on Thursday September 29 2016, @05:14PM (#408022)

    How else would you explain that they can harness a Neural network but to assume that the Borg have finally arrived and they are Google?

    Sincerely,

    Somebody who's involved in actual neuroscience IT, not virtual computer "neural" crap.