Stories
Slash Boxes
Comments

SoylentNews is people

posted by Fnord666 on Wednesday September 25 2019, @02:11AM   Printer-friendly
from the AI-knows-where-you-are-hiding dept.

Surprising behaviours emerge when Open AI created a Hide and Seek game
https://openai.com/blog/emergent-tool-use/
(Lots of good GIF animations in the article)

AI bots were tasked with being the best "hider" or "seeker" in a 3D virtual environment.

But what resulted was cunning, brute force and downright cheating.

From blocking doors to stop seekers, to using ramps to get over walls. From stealing the ramps to stop the previous behaviour to "block surfing", not to mention accelerating objects into walls to exhibit glitches.

This is not the first time AI has done what is asked of it with a few unintended results.

https://hackaday.com/2012/07/09/on-not-designing-circuits-with-evolutionary-algorithms/
https://hackaday.com/2018/11/12/how-to-evolve-a-radio/
https://hackaday.com/2018/11/11/the-naughty-ais-that-gamed-the-system/


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 3, Interesting) by The Mighty Buzzard on Wednesday September 25 2019, @02:59AM (10 children)

    by The Mighty Buzzard (18) Subscriber Badge <themightybuzzard@proton.me> on Wednesday September 25 2019, @02:59AM (#898376) Homepage Journal

    Breaking rules that you should have thought of but didn't is not cheating, it's ingenuity. Ask any tax attorney.

    --
    My rights don't end where your fear begins.
    Starting Score:    1  point
    Moderation   +1  
       Interesting=1, Total=1
    Extra 'Interesting' Modifier   0  
    Karma-Bonus Modifier   +1  

    Total Score:   3  
  • (Score: 2) by Mykl on Wednesday September 25 2019, @04:41AM

    by Mykl (1112) on Wednesday September 25 2019, @04:41AM (#898391)

    Just ask any speedrunner.

  • (Score: 0) by Anonymous Coward on Wednesday September 25 2019, @08:08AM

    by Anonymous Coward on Wednesday September 25 2019, @08:08AM (#898414)

    It is the same mentality as Jewish scholars in interpreting God's perfect laws: the perfect rules were written in a particular way to forbid particular actions but allow the rest. Therefore, any loopholes in the rules that allow an action must have been put there on purpose for us to use; otherwise, they would have been forbidden in the first place.

  • (Score: 3, Interesting) by FatPhil on Wednesday September 25 2019, @10:12AM (7 children)

    by FatPhil (863) <{pc-soylent} {at} {asdf.fi}> on Wednesday September 25 2019, @10:12AM (#898434) Homepage
    Wrong. Breaking rules is cheating.

    However, working out things that are not forbidden by the rules is not breaking rules, and not cheating. And that's what these agents were doing. They were not even told any rules apart from "if hunters find seekers, hunters win, otherwise they lose" which is enforced post-facto by fiat, so there were no rules that could be broken anyway.

    The youtube vid (as there are no GIFs in that article, grrr!) http://www.youtube.com/watch?v=kopoLzvh5jY , seems to over-inflate their claims for the agents' discovery of how to play the game. They must know more than just being able to move about, as otherwise they'd not be able to heft the scenery around (as they don't just shove it by barging into it, they pull it too - so they must have been preprogrammed with the concept of taking and releasing a grip on the object. Similarly, they were programmed with the capability to block-surf, as if all you to is move about, you'd drop off the side of the block, there must be some option for them to do this surfing thing as an alternative. OK, they weren't preprogrammed with any information on what these actions would be good for, but still, they were told they could do it right from the start, and therefore implicitly that they may do it, as they weren't told they may not do it even though they could do it.

    To model something closer to cheating, they should be told that they have the capability to do something, and then get negative reinforcement (an immediate loss) if the opposing side sees them do that thing. They will then only "cheat" when they think that they can get away with it. However, that's still not "cheating", as there are plenty of human games where you're allowed to do something, but lose if you're seen doing it. It would only be cheating if they are programmed to know that violating the rule will lose them the game, but at the end of the game they sometimes don't get punished for it (for instance, if the opponents don't see the infraction).
    --
    Great minds discuss ideas; average minds discuss events; small minds discuss people; the smallest discuss themselves
    • (Score: 2) by rob_on_earth on Wednesday September 25 2019, @12:07PM

      by rob_on_earth (5485) on Wednesday September 25 2019, @12:07PM (#898456) Homepage

      That is really scary.

      All those mini animations running in short loops, I figured they were GIF animations, cos, why would you use 31 full blown vimeo iframes?

      document.querySelectorAll("iframe").length

    • (Score: 2) by The Mighty Buzzard on Wednesday September 25 2019, @12:49PM (2 children)

      by The Mighty Buzzard (18) Subscriber Badge <themightybuzzard@proton.me> on Wednesday September 25 2019, @12:49PM (#898465) Homepage Journal

      Reread. "rules that you should have thought of but didn't" means they are not rules, just your assumptions.

      --
      My rights don't end where your fear begins.
    • (Score: 2) by Freeman on Wednesday September 25 2019, @06:36PM

      by Freeman (732) on Wednesday September 25 2019, @06:36PM (#898674) Journal

      Exploiting bugs can still get you banned from online games, because it's cheating. Just because, a rule wasn't made to specifically cover this one instance of cheating, doesn't mean it's not cheating.

      --
      Joshua 1:9 "Be strong and of a good courage; be not afraid, neither be thou dismayed: for the Lord thy God is with thee"
    • (Score: 3, Interesting) by toddestan on Sunday September 29 2019, @10:19PM (1 child)

      by toddestan (4982) on Sunday September 29 2019, @10:19PM (#900558)

      The block-surfing sounds more like an exploit. It sounds like they can "grab" an object and when they move the object moves with them. Someone didn't put a check in the physics engine that disallows them from "grabbing" an object while on top of it.

      • (Score: 2) by FatPhil on Monday September 30 2019, @10:07AM

        by FatPhil (863) <{pc-soylent} {at} {asdf.fi}> on Monday September 30 2019, @10:07AM (#900748) Homepage
        Possibly, and a very interesting possibility at that, but I would presume there is some kind of where-is-the-block-relative-to-me check, such that you can't grab something that's behind you. Someone was clearly not very good at defining "in front" if that's the case.
        --
        Great minds discuss ideas; average minds discuss events; small minds discuss people; the smallest discuss themselves