Stories
Slash Boxes
Comments

SoylentNews is people

posted by martyb on Thursday March 05 2020, @01:10AM   Printer-friendly
from the seeing-is-mis-believing dept.

Arthur T Knackerbracket has found the following story:

Researchers at Soluble today said they worked with Verisign to thwart the registration of domain names that use homoglyphs – non-Latin characters that look just like letters of the Latin alphabet – to masquerade as legit domains.

[...] There have been a number of efforts over the years, most recently in 2017, we reckon, to rid the internet of homograph abuse once and for all.

In the most recent case, it was found that the Unicode Latin IPA Extension characters could and were being exploited to setup lookalike domains.

"Between 2017 and today, more than a dozen homograph domains have had active HTTPS certificates," noted Soluble researcher Matt Hamilton. "This included prominent financial, internet shopping, technology, and other Fortune 100 sites. There is no legitimate or non-fraudulent justification for this activity."

Normally, it would not be possible to register domains with mixed scripts, as Verisign put protections in place years ago. However, the researchers found that those protections did not extend to Unicode Latin IPA, meaning that prior to Verisign updating its filters after being tipped off by Soluble, the characters could be used to set up lookalike URLs.

[...] "While it is unlikely that you, the reader, were attacked with this technique," Hamilton notes, "it is likely that this technique was used in highly targeted social-engineering campaigns."

-- submitted from IRC

The most notable of these confusables are:

Latin:agl
IPA:ɑɡɩ

It is much easier to tell them apart when the confusables are shown adjacent to each other. In the spoiler below, only one of the entries is correct... how long does it take you to figure out which one it is?

  1. google.ɑpis
     
  2. ɡoogle.ɑpis
     
  3. ɡoogle.apis
     
  4. gooɡle.apis
     
  5. google.apis
     
  6. ɡooɡle.ɑpis
     
  7. ɡooɡle.apis
     
  8. gooɡle.ɑpis
     

Are you sure? This is the number of the correct entry:

Are you really sure?
Did you pick number 6?
That was wrong. It was number 5.

Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
(1)
  • (Score: 2, Interesting) by fustakrakich on Thursday March 05 2020, @01:15AM (12 children)

    by fustakrakich (6150) on Thursday March 05 2020, @01:15AM (#966772) Journal

    This problem would go away?

    --
    La politica e i criminali sono la stessa cosa..
    • (Score: 4, Funny) by NPC-131072 on Thursday March 05 2020, @01:19AM

      by NPC-131072 (7144) on Thursday March 05 2020, @01:19AM (#966773) Journal

      homophobe!

    • (Score: 2) by takyon on Thursday March 05 2020, @01:30AM (6 children)

      by takyon (881) <{takyon} {at} {soylentnews.org}> on Thursday March 05 2020, @01:30AM (#966775) Journal

      We need to put Klingon in Unicode.

      --
      [SIG] 10/28/2017: Soylent Upgrade v14 [soylentnews.org]
    • (Score: 1) by Ethanol-fueled on Thursday March 05 2020, @01:58AM

      by Ethanol-fueled (2792) on Thursday March 05 2020, @01:58AM (#966792) Homepage

      We will go away, you fucksuckers. Let's GO!

    • (Score: 2, Touché) by Anonymous Coward on Thursday March 05 2020, @05:10AM

      by Anonymous Coward on Thursday March 05 2020, @05:10AM (#966854)

      Only in the same sense as "killing all humans will prevent killing of humans". You don't have to kill Unicode to not allow anything outside of latin characters, numbers and a dash in URLs.

    • (Score: 4, Interesting) by maxwell demon on Thursday March 05 2020, @07:06AM (1 child)

      by maxwell demon (1608) Subscriber Badge on Thursday March 05 2020, @07:06AM (#966874) Journal

      It would be sufficient if punicode domains would be shown in their ASCII form.

      Indeed, it probably would be sufficient if punicode domains would be shown in a different colour. Or even better, show only those letters in different colour that are punicode-generated.

      For example, in punicode, "ɡoogle.apis" is encoded "xn--oogle-qmc.apis". Thus the domain would show (using bold instead of colour for obvious reasons) as "ɡoogle.apis". The difference being easy to spot this way.

      --
      The Tao of math: The numbers you can count are not the real numbers.
      • (Score: 2, Informative) by Anonymous Coward on Thursday March 05 2020, @07:59AM

        by Anonymous Coward on Thursday March 05 2020, @07:59AM (#966887)
        Which is exactly what Chrome and Firefox already do with the mixed-script domains given as examples. All of them except for the valid one are rendered as punycode.
  • (Score: 2, Funny) by Anonymous Coward on Thursday March 05 2020, @02:21AM (3 children)

    by Anonymous Coward on Thursday March 05 2020, @02:21AM (#966797)

    Send your donations to S0ylentnews.org... thanks for your generous support.

    • (Score: 3, Funny) by takyon on Thursday March 05 2020, @02:30AM

      by takyon (881) <{takyon} {at} {soylentnews.org}> on Thursday March 05 2020, @02:30AM (#966799) Journal

      hɑx

      --
      [SIG] 10/28/2017: Soylent Upgrade v14 [soylentnews.org]
    • (Score: 2, Funny) by Anonymous Coward on Thursday March 05 2020, @03:09AM

      by Anonymous Coward on Thursday March 05 2020, @03:09AM (#966810)
      Try soyɩentnews.orɡ instead.
    • (Score: 3, Funny) by maxwell demon on Thursday March 05 2020, @07:14AM

      by maxwell demon (1608) Subscriber Badge on Thursday March 05 2020, @07:14AM (#966879) Journal

      Use ꜱοylentNews.οrɡ instead.

      --
      The Tao of math: The numbers you can count are not the real numbers.
  • (Score: 3, Interesting) by stormwyrm on Thursday March 05 2020, @02:44AM

    by stormwyrm (717) on Thursday March 05 2020, @02:44AM (#966802) Journal

    Firefox has an algorithm [mozilla.org] for detecting and mitigating these kinds of IDN homograph attacks. Chrome seems to use a similar algorithm [chromium.org] as well. Both algorithms are based on the recommendations of UTS #39 [unicode.org]. These characters from the IPA Extension Block appear to be part of the classes of characters that UTS #39 defines as Restricted, namely:

    The Restricted characters are characters not in common use, and they can be blocked to further reduce the possibilities for visual confusion. They include the following:

    • characters not in modern use
    • characters only used in specialized fields, such as liturgical characters, phonetic letters, and mathematical letter-like symbols
    • characters in limited use by very small communities

    (emphasis added). So domains using them shouldn't display normally, coming out as punycode garbage. Indeed, Firefox 73.0.1 displays all of the domain names in the test, except for the proper one, using punycode, so #1 comes out as google.xn--pis-fsb, and #6 appears as xn--oole-z7bc.xn--pis-fsb. Chromium 79.0.3945.130 does same thing. The attack doesn't seem to work on these modern browsers at least.

    --
    Numquam ponenda est pluralitas sine necessitate.
  • (Score: 0) by Anonymous Coward on Thursday March 05 2020, @03:23AM (1 child)

    by Anonymous Coward on Thursday March 05 2020, @03:23AM (#966813)

    They gotta Яuin everything.

    • (Score: 2, Funny) by redneckmother on Thursday March 05 2020, @05:12AM

      by redneckmother (3597) on Thursday March 05 2020, @05:12AM (#966855)

      They gotta Яuin everything.

      At least they didn't rune it.

      --
      Mas cerveza por favor.
  • (Score: 3, Funny) by Rosco P. Coltrane on Thursday March 05 2020, @03:27AM

    by Rosco P. Coltrane (4757) on Thursday March 05 2020, @03:27AM (#966815)

    No gay mathematicians, no homographs. Simple!

  • (Score: 0) by Anonymous Coward on Thursday March 05 2020, @06:09AM (5 children)

    by Anonymous Coward on Thursday March 05 2020, @06:09AM (#966861)

    rest all look the same over here on ubuntu 16.04, chromium.
    so 3, 4, 5 and 7 all look the same.

    • (Score: 2) by maxwell demon on Thursday March 05 2020, @07:21AM (4 children)

      by maxwell demon (1608) Subscriber Badge on Thursday March 05 2020, @07:21AM (#966882) Journal

      On Mint/Waterfox (but with several additional fonts installed, I can't tell if those make a difference), the only of those letters that look the same are g and ɡ.

      --
      The Tao of math: The numbers you can count are not the real numbers.
      • (Score: 2) by zocalo on Thursday March 05 2020, @08:55AM (3 children)

        by zocalo (302) on Thursday March 05 2020, @08:55AM (#966896)
        Might be the font rendering, but the differences are more subtle at smaller font sizes as well so zooming in a little can make a huge difference, but I've always found these have rendered pretty consistently across browsers and OSs for some years now so I doubt it's Mint/Waterfox. Still, it might have been worth including the solution in another spoiler so you could look closer and see what you may have missed, but the only letters that have been changed are the first "g" (2, 3, 6, 7), the second "g" (4, 6, 7, 8) and the "a" (1, 2, 6, 8).
        --
        UNIX? They're not even circumcised! Savages!
        • (Score: 3, Informative) by maxwell demon on Thursday March 05 2020, @09:40AM (2 children)

          by maxwell demon (1608) Subscriber Badge on Thursday March 05 2020, @09:40AM (#966904) Journal

          On my system, l and ɩ look very different, indeed more different than l and i. In particular, ɩ on my system has the height of a common lowercase letter, while l has the height of an uppercase letter. Moreover, ɩ has an arc at the bottom, while l doesn't.

          So you'd have more chances to fool me with “googie” than to fool me with “googɩe”.

          Actually they look more similar in <tt> () but still, the different height very clearly distinguishes them.

          --
          The Tao of math: The numbers you can count are not the real numbers.
          • (Score: 0) by Anonymous Coward on Friday March 06 2020, @05:55AM (1 child)

            by Anonymous Coward on Friday March 06 2020, @05:55AM (#967304)

            in the summary and in your reply I see different ell-s as well. but in the variants, I see only the regular ell. Maybe it's the spoiler tag, no idea.

            • (Score: 2) by maxwell demon on Friday March 06 2020, @09:31AM

              by maxwell demon (1608) Subscriber Badge on Friday March 06 2020, @09:31AM (#967348) Journal

              Maybe it's because in the spoiler tags there are only normal "l" letters. I just checked by copy-pasting it into a hex converter, and indeed every "l" is represented by the byte 6c, that is, the character “U+006C LATIN SMALL LETTER L”.

              --
              The Tao of math: The numbers you can count are not the real numbers.
  • (Score: 3, Informative) by Anonymous Coward on Thursday March 05 2020, @06:31AM (1 child)

    by Anonymous Coward on Thursday March 05 2020, @06:31AM (#966865)
    Learn it, use it.

    Reject emoti-utf.

    They had their chance, they proved they didn't deserve it.
    • (Score: 2, Funny) by Anonymous Coward on Thursday March 05 2020, @06:41AM

      by Anonymous Coward on Thursday March 05 2020, @06:41AM (#966866)

      😂😂😂

  • (Score: 2) by J_Darnley on Thursday March 05 2020, @04:18PM

    by J_Darnley (5679) on Thursday March 05 2020, @04:18PM (#966964)

    None are valid because ".apis" is not a valid TLD.

    Also Unicode only needs 1 way to express a Latin lower case A. If it worked for CJK then it'll work for Latin.

  • (Score: 1) by Snort on Thursday March 05 2020, @06:53PM

    by Snort (5141) on Thursday March 05 2020, @06:53PM (#967036)

    on a Plam Pilot on eBay back in the day.

    People will always take advantage of human failings.

  • (Score: 0) by Anonymous Coward on Thursday March 05 2020, @09:39PM

    by Anonymous Coward on Thursday March 05 2020, @09:39PM (#967109)

    We do have a known defense. Unfortunately every time FF updates, it seems to revert to shipped default :(
    You would think that by now someone at FF would call a meeting and choose to ship it with the more secure setting.

    Windows solution, but applies to all FF on all platforms:
    https://www.tenforums.com/tutorials/104760-enable-disable-idn-punycode-firefox-address-bar-windows.html [tenforums.com]

(1)