Stories
Slash Boxes
Comments

SoylentNews is people

posted by mrpg on Saturday September 01 2018, @07:01AM   Printer-friendly
from the blame-humans-of-course dept.

New research has shown just how bad AI is at dealing with online trolls.

Such systems struggle to automatically flag nudity and violence, don’t understand text well enough to shoot down fake news and aren’t effective at detecting abusive comments from trolls hiding behind their keyboards.

A group of researchers from Aalto University and the University of Padua found this out when they tested seven state-of-the-art models used to detect hate speech. All of them failed to recognize foul language when subtle changes were made, according to a paper [PDF] on arXiv.

Adversarial examples can be created automatically by using algorithms to misspell certain words, swap characters for numbers or add random spaces between words or attach innocuous words such as ‘love’ in sentences.

The models failed to pick up on adversarial examples and successfully evaded detection. These tricks wouldn’t fool humans, but machine learning models are easily blindsighted. They can’t readily adapt to new information beyond what’s been spoonfed to them during the training process.


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 0) by Anonymous Coward on Sunday September 02 2018, @01:20AM (3 children)

    by Anonymous Coward on Sunday September 02 2018, @01:20AM (#729375)

    Dick niggers FTW - kept TMB true to his "free speech" promise.

  • (Score: 2) by The Mighty Buzzard on Sunday September 02 2018, @01:36AM (2 children)

    by The Mighty Buzzard (18) Subscriber Badge <themightybuzzard@proton.me> on Sunday September 02 2018, @01:36AM (#729379) Homepage Journal

    Free speech as defined on this site has never included spam, just for clarity's sake.

    --
    My rights don't end where your fear begins.
    • (Score: 0) by Anonymous Coward on Sunday September 02 2018, @02:46AM (1 child)

      by Anonymous Coward on Sunday September 02 2018, @02:46AM (#729401)

      Maybe, but the countermeasures you took hurt free speech - nobody could use "dick niggers" even in non-spammy expression of speech; in effect you banned some words.

      • (Score: 2) by The Mighty Buzzard on Sunday September 02 2018, @10:42AM

        by The Mighty Buzzard (18) Subscriber Badge <themightybuzzard@proton.me> on Sunday September 02 2018, @10:42AM (#729464) Homepage Journal

        We also couldn't talk about viagra for a long time but I didn't hear anyone complaining about that. Simple word filters around here are going to happen because they're quick and stop most spammy jackassery. They're also meant to be temporary, lasting only as long as necessary to get the spammer to fuck off to greener sites.

        Any time you're curious about what words are being filtered, ask any admin. It takes like three seconds to look up. Right now the list includes:

        --
        My rights don't end where your fear begins.