Abstract

Abstract Background The cost of high-throughput sequencing is rapidly decreasing, allowing researchers to investigate genomic variations across hundreds or even thousands of samples in the post-genomic era. The management and exploration of these large-scale genomic variation data require programming skills. The public genotype querying databases of many species are usually centralized and implemented independently, making them difficult to update with new data over time. Currently, there is a lack of a widely used framework for setting up user-friendly web servers to explore new genomic variation data in diverse species. Results Here, we present SnpHub, a Shiny/R-based server framework for retrieving, analysing, and visualizing large-scale genomic variation data that can be easily set up on any Linux server. After a pre-building process based on the provided VCF files and genome annotation files, the local server allows users to interactively access single-nucleotide polymorphisms and small insertions/deletions with annotation information by locus or gene and to define sample sets through a web page. Users can freely analyse and visualize genomic variations in heatmaps, phylogenetic trees, haplotype networks, or geographical maps. Sample-specific sequences can be accessed as replaced by detected sequence variations. Conclusions SnpHub can be applied to any species, and we build up a SnpHub portal website for wheat and its progenitors based on published data in recent studies. SnpHub and its tutorial are available at http://guoweilong.github.io/SnpHub/. The wheat-SnpHub-portal website can be accessed at http://wheat.cau.edu.cn/Wheat_SnpHub_Portal/.

Keywords

AnnotationComputer scienceGenome browserGenomicsSet (abstract data type)Web serverGenomeBiologyWorld Wide WebThe InternetGeneticsGene

Affiliated Institutions

Related Publications

Ensembl 2020

Abstract The Ensembl (https://www.ensembl.org) is a system for generating and distributing genome annotation such as genes, variation, regulation and comparative genomics across...

2019 Nucleic Acids Research 1174 citations

Ensembl 2021

Abstract The Ensembl project (https://www.ensembl.org) annotates genomes and disseminates genomic data for vertebrate species. We create detailed and comprehensive annotation of...

2020 Nucleic Acids Research 1694 citations

Publication Info

Year
2020
Type
article
Volume
9
Issue
6
Citations
94
Access
Closed

External Links

Citation Metrics

94
OpenAlex

Cite This

Wenxi Wang, Zihao Wang, Xintong Li et al. (2020). SnpHub: an easy-to-set-up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat. GigaScience , 9 (6) . https://doi.org/10.1093/gigascience/giaa060

Identifiers

DOI
10.1093/gigascience/giaa060