All Highest Scoring Paths in Weighted Grid Graphs and Their Application to Finding All Approximate Repeats in Strings

1998 SIAM Journal on Computing 141 citations

Abstract

Weighted paths in directed grid graphs of dimension (m X n) can be used to model the string edit problem, which consists of obtaining optimal (weighted) alignments between substrings of A, |A|=m, and substrings of B, |B|=n. We build a data structure (in O(mn log m) time) that supports O(log m) time queries about the weight of any of the O(m2n) best paths from the vertices in column 0 of the graph to all other vertices. Using these techniques we present a simple O(n2 log n) time and $\Theta(n^2)$ space algorithm to find all (the locally optimal) approximate tandem (or nontandem) repeats xy within a string of size n. This improves (by a factor of log n) upon several previous algorithms for this problem and is the first algorithm to find all locally optimal repeats. For edit graphs with weights in {0, -1, 1}, a slight modification of our techniques yields an O(n2) algorithm for the cyclic string comparison problem, as compared to O(n2 log n) for the case of general weights.

Keywords

SubstringCombinatoricsMathematicsBinary logarithmString (physics)Edit distanceString searching algorithmDimension (graph theory)Running timeDiscrete mathematicsData structureAlgorithmComputer science

Related Publications

Publication Info

Year
1998
Type
article
Volume
27
Issue
4
Pages
972-992
Citations
141
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

141
OpenAlex

Cite This

Jeanette P. Schmidt (1998). All Highest Scoring Paths in Weighted Grid Graphs and Their Application to Finding All Approximate Repeats in Strings. SIAM Journal on Computing , 27 (4) , 972-992. https://doi.org/10.1137/s0097539795288489

Identifiers

DOI
10.1137/s0097539795288489