MMseqs software suite for fast and deep clustering and searching of large protein sequence sets
Abstract Motivation: Sequence databases are growing fast, challenging existing analysis pipelines. Reducing the redundancy of sequence databases by similarity clustering improve...