Similarity search for multidimensional data sequences

Seok Lyong Lee, Seok Ju Chun, Deok Hwan Kim, Ju Hong Lee, Chin Wan Chung

Research output: Contribution to conferencePaperpeer-review

127 Scopus citations

Abstract

Time-series data, which are a series of one-dimensional real numbers, have been studied in various database applications. In this paper, we extend the traditional similarity search methods on time-series data to support a multidimensional data sequence, such as a video stream. We investigate the problem of retrieving similar multidimensional data sequences from a large database. To prune irrelevant sequences in a database, we introduce correct and efficient similarity functions. Both data sequences and query sequences are partitioned into subsequences, and each of them is represented by a Minimum Bounding Rectangle (MBR). The query processing is based upon these MBRs, instead of scanning data elements of entire sequences. Our method is designed (1) to select candidate sequences in a database, and (2) to find the subsequences of a selected sequence, each of which falls under the given threshold. The latter is of special importance in the case of retrieving subsequences from large and complex sequences such as video. By using it, we do not need to browse the whole of the selected video stream, but just browse the substreams to find a scene we want. We have performed an extensive experiment on synthetic, as well as real data sequences (a collection of TV news, dramas, and documentary videos) to evaluate our proposed method. The experiment demonstrates that 73-94 percent of irrelevant sequences are pruned using the proposed method, resulting in 16-28 times faster response time compared with that of the sequential search.

Original languageEnglish
Pages599-608
Number of pages10
StatePublished - 2000
Externally publishedYes
Event2000 IEEE 16th International Conference on Data Engineering (ICDE'00) - San Diego, CA, USA
Duration: 29 Feb 20003 Mar 2000

Conference

Conference2000 IEEE 16th International Conference on Data Engineering (ICDE'00)
CitySan Diego, CA, USA
Period29/02/003/03/00

Fingerprint

Dive into the research topics of 'Similarity search for multidimensional data sequences'. Together they form a unique fingerprint.

Cite this