Stories
Slash Boxes
Comments

SoylentNews is people

SoylentNews is powered by your submissions, so send in your scoop. Only 15 submissions in the queue.
posted by takyon on Friday July 10 2015, @12:50PM   Printer-friendly
from the social-growth-medium dept.

In the age of Big Data, it turns out that the largest, fastest growing data source lies within your cells.

Quantitative biologists at the University of Illinois Urbana-Champaign and Cold Spring Harbor Laboratory, in New York, found that genomics reigns as champion over three of the biggest data domains around: astronomy, Twitter, and YouTube.

The scientists determined which would expand the fastest by evaluating acquisition, storage, distribution, and analysis of each set of data. Genomes are quantified by their chemical constructs, or base pairs. Genomics trumps other data generators because the genome sequencing rate doubles every seven months. If it maintains this rate, by 2020 more than one billion billion bases will be sequenced and stored per year, or 1 exabase. By 2025, researchers estimate the rate will be almost one zettabase, one trillion billion bases, per sequence per year.

Cripes, wouldn't you hate to be the guy paying the electricity bill to process that much data?

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 2) by ikanreed on Friday July 10 2015, @03:05PM

    by ikanreed (3164) Subscriber Badge on Friday July 10 2015, @03:05PM (#207470) Journal

    Interesting. I'll have to rethink (and further research) my understanding of cancer.

    Starting Score:    1  point
    Karma-Bonus Modifier   +1  

    Total Score:   2  
  • (Score: 0) by Anonymous Coward on Friday July 10 2015, @03:39PM

    by Anonymous Coward on Friday July 10 2015, @03:39PM (#207491)

    You can see the error in this paper. The parameter p refers to probability of a tumor cell forming:

    For small p (which is always the case in realistic scenarios) , eqn (1) can be usefully (and very accurately) simplified

    http://www.ncbi.nlm.nih.gov/pubmed/23396311 [nih.gov]

    The models they use have simplifying assumptions built in that are incompatible with the peak in age specific incidence. Equation 5 in that paper will only increase with age. To get the peaks, the tumor cells must form much more often, but most do not generate tumors (either they go dormant or die off). It is also possible the multistage theory (cancer is caused by some kind of accumulation of errors) is totally wrong, but it makes a lot of sense.