Stories
Slash Boxes
Comments

SoylentNews is people

posted by martyb on Friday October 25 2019, @05:06PM   Printer-friendly
from the when-it-misunderstands-let-me-do-a-literal-search dept.

Google is currently rolling out a change to its core search algorithm that it says could change the rankings of results for as many as one in ten queries. It's based on cutting-edge natural language processing (NLP) techniques developed by Google researchers and applied to its search product over the course of the past 10 months.

In essence, Google is claiming that it is improving results by having a better understanding of how words relate to each other in a sentence. In one example Google discussed at a briefing with journalists yesterday, its search algorithm was able to parse the meaning of the following phrase: "Can you get medicine for someone pharmacy?"

The old Google search algorithm treated that sentence as a "bag of words," according to Pandu Nayak, Google fellow and VP of search. So it looked at the important words, medicine and pharmacy, and simply returned local results. The new algorithm was able to understand the context of the words "for someone" to realize it was a question about whether you could pick up somebody else's prescription — and it returned the right results.

The tweaked algorithm is based on BERT, which stands for "Bidirectional Encoder Representations from Transformers." Every word of that acronym is a term of art in NLP, but the gist is that instead of treating a sentence like a bag of words, BERT looks at all the words in the sentence as a whole. Doing so allows it to realize that the words "for someone" shouldn't be thrown away, but rather are essential to the meaning of the sentence.

https://www.theverge.com/2019/10/25/20931657/google-bert-search-context-algorithm-change-10-percent-langauge

I can't help wondering how this is handled in other languages. Chinese, for instance, where (I've read that) a word might have several meanings, but context defines the word.


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 2) by Runaway1956 on Friday October 25 2019, @05:49PM

    by Runaway1956 (2926) Subscriber Badge on Friday October 25 2019, @05:49PM (#911751) Journal

    Depending on the initial search string, and the results offered, that's not a "bad" search term. I've often started a search using some term that made sense to me, the results looked crazy, so I altered the search by a single word, or started over, or added a word. It's always been a guessing game, in which you try to guess which single word will be important to your search. Typing out a linguistically correct question is a huge waste of time, in most cases. The search engines teach you to use keywords, rather than full sentences.

    Add in the possibility that the search originates from a mobile device, and NO ONE bothers to type a proper sentence. Well, no one except you.

    Starting Score:    1  point
    Karma-Bonus Modifier   +1  

    Total Score:   2