Similarity measurement of XML documents based on structure and contents

Tae Soon Kim, Ju Hong Lee, Jae Won Song, Deok Hwan Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Researches on the similarity measure between XML documents are being progressed in order to effectively control and retrieve various XML documents. Previous works mostly suggest similarity-measuring methods focusing only on the tag structure of XML documents. However, they have a problem of incorrectly calculating the semantic similarity of XML contents. In this paper, we propose a new similarity measurement method considering not only the structural information of tags in XML documents but also the semantic information of tags and text content information related with the tags. Our experiments demonstrate that our proposed method improves the accuracy of similarity, compared to the previous works.

Original languageEnglish
Title of host publicationComputational Science - ICCS 2007 - 7th International Conference, Proceedings
PublisherSpringer Verlag
Pages902-905
Number of pages4
EditionPART 3
ISBN (Print)9783540725879
DOIs
StatePublished - 2007
Event7th International Conference on Computational Science, ICCS 2007 - Beijing, China
Duration: 27 May 200730 May 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 3
Volume4489 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Computational Science, ICCS 2007
Country/TerritoryChina
CityBeijing
Period27/05/0730/05/07

Fingerprint

Dive into the research topics of 'Similarity measurement of XML documents based on structure and contents'. Together they form a unique fingerprint.

Cite this