Stories
Slash Boxes
Comments

SoylentNews is people

SoylentNews is powered by your submissions, so send in your scoop. Only 18 submissions in the queue.
posted by on Saturday February 04 2017, @09:51AM   Printer-friendly
from the if-at-first-you-don't-succeed dept.

Humanity would understand very little about cancer, and be hard-pressed to find cures, without scientific research. But what if, when teams recreated each other's research, they didn't arrive at the same result?

That's what the Reproducibility Project: Cancer Biology of the Center for Open Science is attempting to do—redo parts of 50 important cancer studies and compare their results. They released their first five replications today, and it turns out that not all of the data is matching up. At least once in every paper, a result reported as statistically significant (the way scientists calculate whether an effect is caused by more than chance alone) was not statistically significant in the replicated study. In two of the cases, the differences between the initial and replicated studies were even more striking, giving the Center for Open Science researchers cause for concern

"I was surprised by the results because of all that homework that we did" to make sure the studies were being reproduced accurately, Tim Errington, Project Manager at the Center for Open Science told Gizmodo. "We thought we were crossing every T and dotting every I... Seeing some of these experimental systems not behave the same was something I was not expecting to happen."


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 2) by TheRaven on Saturday February 04 2017, @04:25PM

    by TheRaven (270) on Saturday February 04 2017, @04:25PM (#462862) Journal
    At least the do some statistical analysis. In computer science, many systems research papers, even at top-tier conferences, don't even put error margins and will happily report a mean to 6 significant figures with no mention of variance. The slightly better ones will give a standard deviation, but often the distribution isn't normal so that's barely meaningful. The only ones that do any kind of t-test will do Student's t-test and forget the fact that their two samples have different variance and so the result is largely meaningless. I think I've only ever seen one use Welch's t-test, which actually does account for this.
    --
    sudo mod me up
    Starting Score:    1  point
    Karma-Bonus Modifier   +1  

    Total Score:   2  
  • (Score: 3, Insightful) by mhajicek on Saturday February 04 2017, @05:53PM

    by mhajicek (51) on Saturday February 04 2017, @05:53PM (#462878)

    Perhaps it should become standard to publish the full data set, so that anyone cam do their own statistical analysis.

    --
    The spacelike surfaces of time foliations can have a cusp at the surface of discontinuity. - P. Hajicek
    • (Score: 2) by deimtee on Saturday February 04 2017, @10:57PM

      by deimtee (3272) on Saturday February 04 2017, @10:57PM (#462960) Journal

      I agree that publishing all the data would be best, but in human studies it is possible you will run into patient confidentiality problems. At the very least there would be extra work involved in anonymizing patient data. (however, any animal based studies should be regarded as very suspect if they don't make the full dataset available. )

      --
      If you cough while drinking cheap red wine it really cleans out your sinuses.
  • (Score: 0) by Anonymous Coward on Saturday February 04 2017, @09:56PM

    by Anonymous Coward on Saturday February 04 2017, @09:56PM (#462949)

    I think I've only ever seen one use Welch's t-test, which actually does account for this.

    The t-test that R uses by default is the Welch's t-test. I think there is not even an useful case where to use a Student's t-test above the Welch's t-test.
    In many biological cases one would do an ANOVA anyway, instead of multiple t-tests.