Solving Some Sequence Problems on Run-length Encoded Strings: Longest Common Subsequences, Edit Distances, and Squares - Jia-jie Liu Y. L. Wang G. S. Huang - Books - VDM Verlag - 9783639022650 - May 16, 2008
In case cover and title do not match, the title is correct

Solving Some Sequence Problems on Run-length Encoded Strings: Longest Common Subsequences, Edit Distances, and Squares

Jia-jie Liu Y. L. Wang G. S. Huang

Christmas presents can be returned until 31 January
Add to your iMusic wish list

Solving Some Sequence Problems on Run-length Encoded Strings: Longest Common Subsequences, Edit Distances, and Squares

Measuring the similarity or difference between two strings is a fundamental problem to many applications. In bioinformatics, one has to predict the structures of RNA and proteins, to classify the functions of molecules, to infer the phylogeny of organisms, and to search entries in huge sequence databases. While processing electronic documents, one needs fast and flexible indexing techniques to perform searches. For this purpose, many measures are defined. The longest common subsequence and the edit distance are the most studied dealt with problems in string processing. In this book, we propose an O(min{mN, Mn}) time algorithm for finding a longest common subsequence of strings X and Y with lengths m and n, respectively, and run-length-encoded lengths M and N, respectively. On the other hand, we also improve the time bound to O(min{mN, Mn}) for finding the edit distance between strings X and Y. Squares play a central role from word combinatorics and application perspective. We show how to locate all squares in a run-length encoded string in time O(N logN). The time complexity of our result is optimal, and it is irrelevant to the length of the original uncompressed string.

Media Books     Paperback Book   (Book with soft cover and glued back)
Released May 16, 2008
ISBN13 9783639022650
Publishers VDM Verlag
Pages 78
Dimensions 113 g
Language English