Abstract

A new approach to assembling large, random shotgun sequencing projects has been developed. The TIGR Assembler overcomes several major obstacles to assembling such projects: the large number of pairwise comparisons required, the presence of repeat regions, chimeras introduced in the cloning process, and sequencing errors. A fast initial comparison of fragments based on oligonucleotide content is used to eliminate the need for a more sensitive comparison between most fragment pairs, thus greatly reducing computer search time. Potential repeat regions are recognized by determining which fragments have more potential overlaps than expected given a random distribution of fragments. Repeat regions are dealt with by increasing the match criteria stringency and by assembling these regions last so that maximum information from nonrepeat regions can be used. The algorithm also incorporates a number of constraints, such as clone length and the placement of sequences from the opposite ends of a clone. TIGR Assembler has been used to assemble the complete 1.8 Mbp Haemophilus influenzae (Fleischmann et al., 1995) and 0.58 Mbp Mycoplasma genitalium (Fraser et al., 1995) genomes.

Keywords

Shotgun sequencingFragment (logic)Computational biologyBiologyGenomeSequence assemblyComputer scienceHybrid genome assemblyShotgunPairwise comparisonNanopore sequencingGeneticsAlgorithmGeneArtificial intelligence

Related Publications

Publication Info

Year
1995
Type
article
Volume
1
Issue
1
Pages
9-19
Citations
408
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

408
OpenAlex

Cite This

Granger G. Sutton, Owen White, Mark D. Adams et al. (1995). TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects. Genome Science and Technology , 1 (1) , 9-19. https://doi.org/10.1089/gst.1995.1.9

Identifiers

DOI
10.1089/gst.1995.1.9