Sometimes the Noise is Signals, Too

posted by Fnord666 on Tuesday December 13 2016, @07:46AM

from the come-on-feel-the-noize dept.

This insight into the information which can be gleaned from data is cool and worrisome by equal measures.

Early in his talk, computer scientist John Hopcroft noted a funny fact about clustering algorithms: they work better on synthetic data than real data. But this is more than an odd tidbit about software.
[...] When we invent our own synthetic data, we try to mimic real data by mixing true information with random distraction–combining "signal" with "noise." But in real data, the divide isn't so clear. What often looks like noise turns out to be the deep structure we haven't grasped yet.
Hopcroft's insight: data doesn't just have one structure. It has many. If I scanned notebooks from a hundred people, and made a database of all the individual letters, I could sort them lots of ways. Alphabetically. Capital/lowercase. Size. Darkness. Handwriting. Each of these is a different layer of structure.
And to understand data–and the world–you've got to reckon with all those layers.

The part of the video which discusses the above starts around 5:45.

Original Submission

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Sometimes the Noise is Signals, Too

Comment Below Threshold

niggers (Score: -1, Troll) by Anonymous Coward on Tuesday December 13 2016, @07:53AM

Too easy? Too easy? (Score: 0) by Anonymous Coward on Tuesday December 13 2016, @08:39AM

Comment Below Threshold

Re:Too easy? (Score: -1, Offtopic) by Anonymous Coward on Tuesday December 13 2016, @12:31PM

media (Score: 3, Insightful) by Anonymous Coward on Tuesday December 13 2016, @08:45AM

Far out, man (Score: 2) by wonkey_monkey on Tuesday December 13 2016, @08:50AM

no video no video (Score: 2) by VLM on Tuesday December 13 2016, @12:48PM

Re:no video Re:no video (Score: 1) by charon on Tuesday December 13 2016, @06:21PM

Re:no video (Score: 2) by VLM on Tuesday December 13 2016, @06:44PM

Comment Below Threshold

Heidelberg Laureate Forum (Score: -1, Offtopic) by Anonymous Coward on Tuesday December 13 2016, @02:06PM

Acid test (Score: 0) by Anonymous Coward on Tuesday December 13 2016, @07:28PM

guessing (Score: 0) by Anonymous Coward on Tuesday December 13 2016, @11:37PM

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

Sometimes the Noise is Signals, Too

Comment Below Threshold

niggers (Score: -1, Troll) by Anonymous Coward on Tuesday December 13 2016, @07:53AM

Too easy? Too easy? (Score: 0) by Anonymous Coward on Tuesday December 13 2016, @08:39AM

Comment Below Threshold

Re:Too easy? (Score: -1, Offtopic) by Anonymous Coward on Tuesday December 13 2016, @12:31PM

media (Score: 3, Insightful) by Anonymous Coward on Tuesday December 13 2016, @08:45AM

Far out, man (Score: 2) by wonkey_monkey on Tuesday December 13 2016, @08:50AM

no video no video (Score: 2) by VLM on Tuesday December 13 2016, @12:48PM

Re:no video Re:no video (Score: 1) by charon on Tuesday December 13 2016, @06:21PM

Re:no video (Score: 2) by VLM on Tuesday December 13 2016, @06:44PM

Comment Below Threshold

Heidelberg Laureate Forum (Score: -1, Offtopic) by Anonymous Coward on Tuesday December 13 2016, @02:06PM

Acid test (Score: 0) by Anonymous Coward on Tuesday December 13 2016, @07:28PM

guessing (Score: 0) by Anonymous Coward on Tuesday December 13 2016, @11:37PM