Tabix: fast retrieval of sequence features from generic TAB-delimited files

Heng Li Heng Li
2011 Bioinformatics 642 citations

Abstract

Abstract Summary: Tabix is the first generic tool that indexes position sorted files in TAB-delimited formats such as GFF, BED, PSL, SAM and SQL export, and quickly retrieves features overlapping specified regions. Tabix features include few seek function calls per query, data compression with gzip compatibility and direct FTP/HTTP access. Tabix is implemented as a free command-line tool as well as a library in C, Java, Perl and Python. It is particularly useful for manually examining local genomic features on the command line and enables genome viewers to support huge data files and remote custom tracks over networks. Availability and Implementation: http://samtools.sourceforge.net. Contact: hengli@broadinstitute.org

Keywords

PerlComputer sciencePython (programming language)File Transfer ProtocolSQLJavaFile formatSoftwareASCIIWorld Wide WebInformation retrievalDatabaseOperating systemThe Internet

Affiliated Institutions

Related Publications

Publication Info

Year
2011
Type
article
Volume
27
Issue
5
Pages
718-719
Citations
642
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

642
OpenAlex

Cite This

Heng Li (2011). Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics , 27 (5) , 718-719. https://doi.org/10.1093/bioinformatics/btq671

Identifiers

DOI
10.1093/bioinformatics/btq671