Beijing Genomics Institute - Bioinformatics Technology

Bioinformatics Technology

De novo sequencing requires aligning billions of short strings of DNA sequence into a full genome, itself three billion base pairs long for humans.

BGI’s computational biologists developed the first successful algorithm, based on graph theory, for aligning billions of 25 to 75-base pair strings produced by next-generation sequencers, specifically Illumina’s Genome Analyzer, during de novo sequencing. The algorithm, called SOAPdenovo, can assemble a genome in two days and has been used to sequence an array of plant and animal genomes.

BGI’s 500-node supercomputer processes 10 terabytes of raw sequencing data every 24 hours from its current 30 or so Genome Analyzers from Illumina. The annual budget for the computer center is US$9 million.

SOAPdenovo is part of "Short Oligonucleotide Analysis Package" (SOAP), a suite of tools developed by BGI for de novo assembly of human-sized genomes, alignment, SNP detection, resequencing, indel finding, and structural variation analysis. Built for the Illumina sequencers' short reads, SOAPdenovo has been used to assemble multiple human genomes (identifying an eight kilobase insertion not detected by mapping to the human reference genome) and animals, like the giant panda.

Read more about this topic:  Beijing Genomics Institute

Famous quotes containing the word technology:

    Radio put technology into storytelling and made it sick. TV killed it. Then you were locked into somebody else’s sighting of that story. You no longer had the benefit of making that picture for yourself, using your imagination. Storytelling brings back that humanness that we have lost with TV. You talk to children and they don’t hear you. They are television addicts. Mamas bring them home from the hospital and drag them up in front of the set and the great stare-out begins.
    Jackie Torrence (b. 1944)