Anonymous Coders Could be Identified Even from Compiled Code

posted by martyb on Wednesday March 21 2018, @04:35PM

from the rigid-coding-guidelines++ dept.

Anonymous coders can be identified using stylometry and machine learning techniques applied to executable binaries:

Source code stylometry – analyzing the syntax of source code for clues about the author – is an established technique used in digital forensics. As the US Army Research Laboratory (ARL) puts it, "Stylometry research has proven that anonymous code contributors can be de-anonymized to reveal the original author, provided the author has published code before."
The technique can help identify virus makers as well as unmask the creators of anti-censorship tools and other outlawed programs. It has the potential to pierce the privacy that many programmers assume they have.
Source code is designed to be human-readable, but binaries – typically produced by compiling or assembling source code – have fewer characteristics that may suggest authorship. Toolchains can be instructed to strip out variable names, function names and other symbols and metadata – which may say something about the author – and alter the structure of code through optimization.
Nonetheless, the researchers – Aylin Caliskan, Fabian Yamaguchi, Edwin Dauber, Richard Harang, Konrad Rieck, Rachel Greenstadt and Arvind Narayanan – building on work described in a 2011 paper, demonstrate that binary files can be analyzed using machine-learning and stylometric techniques.

If you want to remain an anonymous coder, you'd better not contribute anything under your own name publicly:

When Coding Style Survives Compilation: De-anonymizing Programmers from Executable Binaries (arXiv:1512.08546 [cs.CR])

We evaluate our approach on data from the Google Code Jam, obtaining attribution accuracy of up to 96% with 100 and 83% with 600 candidate programmers. We present an executable binary authorship attribution approach, for the first time, that is robust to basic obfuscations, a range of compiler optimization settings, and binaries that have been stripped of their symbol tables. We perform programmer de-anonymization using both obfuscated binaries, and real-world code found "in the wild" in single-author GitHub repositories and the recently leaked Nulled.IO hacker forum. We show that programmers who would like to remain anonymous need to take extreme countermeasures to protect their privacy.

Original Submission

This discussion has been archived. No new comments can be posted.

Anonymous Coders Could be Identified Even from Compiled Code | Log In/Create an Account | Top | 39 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

Anonymous Coders Could be Identified Even from Compiled Code

SilkGithub.onion Here We Come...(Score: 0) by Anonymous Coward on Wednesday March 21 2018, @04:47PM

Just adopt a different styleJust adopt a different style (Score: 2) by meustrus on Wednesday March 21 2018, @05:11PM (8 children)

Re:Just adopt a different styleRe:Just adopt a different style (Score: 2) by Snotnose on Wednesday March 21 2018, @05:52PM (5 children)

Re:Just adopt a different styleRe:Just adopt a different style (Score: 2, Insightful) by Anonymous Coward on Wednesday March 21 2018, @07:06PM (2 children)

Re:Just adopt a different styleRe:Just adopt a different style (Score: 0) by Anonymous Coward on Wednesday March 21 2018, @11:24PM (1 child)

Re:Just adopt a different style(Score: 2) by Wootery on Thursday March 22 2018, @09:50AM

Re:Just adopt a different style(Score: 2) by pipedwho on Thursday March 22 2018, @03:34AM

Re:Just adopt a different style(Score: 2) by Wootery on Thursday March 22 2018, @04:44PM

Re:Just adopt a different styleRe:Just adopt a different style (Score: 3, Funny) by Anonymous Coward on Wednesday March 21 2018, @09:53PM (1 child)

Re:Just adopt a different style(Score: 0) by Anonymous Coward on Thursday March 22 2018, @08:43AM

Look at me, I know how to bullshit a paperLook at me, I know how to bullshit a paper (Score: 0, Offtopic) by cocaine overdose on Wednesday March 21 2018, @05:14PM (6 children)

Re:Look at me, I know how to bullshit a paperRe:Look at me, I know how to bullshit a paper (Score: 0) by Anonymous Coward on Wednesday March 21 2018, @05:22PM (1 child)

Re:Look at me, I know how to bullshit a paper(Score: 1, Funny) by cocaine overdose on Wednesday March 21 2018, @05:30PM

Re:Look at me, I know how to bullshit a paperRe:Look at me, I know how to bullshit a paper (Score: 0) by Anonymous Coward on Wednesday March 21 2018, @06:04PM (3 children)

Re:Look at me, I know how to bullshit a paperRe:Look at me, I know how to bullshit a paper (Score: 3, Touché) by cocaine overdose on Wednesday March 21 2018, @06:27PM (2 children)

Re:Look at me, I know how to bullshit a paperRe:Look at me, I know how to bullshit a paper (Score: 0) by Anonymous Coward on Wednesday March 21 2018, @11:44PM (1 child)

Re:Look at me, I know how to bullshit a paper(Score: 2) by cocaine overdose on Thursday March 22 2018, @12:25AM

Comment Below Threshold

no taxes for you(Score: -1, Offtopic) by Anonymous Coward on Wednesday March 21 2018, @05:14PM

Identified anonymous coders eh(Score: 3, Interesting) by captain normal on Wednesday March 21 2018, @05:28PM

OutlawedOutlawed (Score: 3, Insightful) by Virindi on Wednesday March 21 2018, @05:31PM (2 children)

Re:OutlawedRe:Outlawed (Score: 4, Informative) by khallow on Wednesday March 21 2018, @05:54PM (1 child)

Unauthorized(Score: 2) by fyngyrz on Wednesday March 21 2018, @08:49PM

Working from a set of 100 programmers is easy..Working from a set of 100 programmers is easy.. (Score: 3, Informative) by jimtheowl on Wednesday March 21 2018, @06:16PM (4 children)

Re:Working from a set of 100 programmers is easy..Re:Working from a set of 100 programmers is easy.. (Score: 2) by bob_super on Wednesday March 21 2018, @06:34PM (2 children)

Re:Working from a set of 100 programmers is easy..Re:Working from a set of 100 programmers is easy.. (Score: 2) by jimtheowl on Wednesday March 21 2018, @07:06PM (1 child)

Re:Working from a set of 100 programmers is easy..(Score: 5, Funny) by Anonymous Coward on Wednesday March 21 2018, @07:10PM

Re:Working from a set of 100 programmers is easy..(Score: 2) by c0lo on Thursday March 22 2018, @02:24AM

Fight it with more algorithmsFight it with more algorithms (Score: 4, Interesting) by DannyB on Wednesday March 21 2018, @06:24PM (4 children)

Re:Fight it with more algorithms(Score: 0) by Anonymous Coward on Wednesday March 21 2018, @09:52PM

Re:Fight it with more algorithmsRe:Fight it with more algorithms (Score: 0) by Anonymous Coward on Thursday March 22 2018, @03:03AM (2 children)

Re:Fight it with more algorithmsRe:Fight it with more algorithms (Score: 2) by maxwell demon on Thursday March 22 2018, @12:35PM (1 child)

Re:Fight it with more algorithms(Score: 2) by DannyB on Thursday March 22 2018, @02:29PM

Interesting paper, but anonymity lives onInteresting paper, but anonymity lives on (Score: 5, Insightful) by bradley13 on Wednesday March 21 2018, @07:16PM (2 children)

Re:Interesting paper, but anonymity lives on(Score: 2) by Common Joe on Wednesday March 21 2018, @08:10PM

Re:Interesting paper, but anonymity lives on(Score: 2) by maxwell demon on Thursday March 22 2018, @12:42PM

This Is Why ...This Is Why ... (Score: 4, Funny) by Anonymous Coward on Wednesday March 21 2018, @07:45PM (2 children)

Re:This Is Why ...Re:This Is Why ... (Score: 2) by LoRdTAW on Thursday March 22 2018, @02:46AM (1 child)

Re:This Is Why ...(Score: 1, Funny) by Anonymous Coward on Thursday March 22 2018, @09:52AM

Freedom of speech(Score: 1, Interesting) by Anonymous Coward on Thursday March 22 2018, @10:45AM