Enhancing Xml Data Retrieval Performance with Clustering and Indexing

Wanjari Ravindra Shankar, F. Rahman

PDF

Published: Dec 31, 2023

Keywords:

Similarity, Indexing, Retrieval, Database, Server

Wanjari Ravindra Shankar, F. Rahman

Abstract

Using SQL Server 2005 and Berkeley DBXML (BDBXML) as case studies, this research assesses how well a similarity-based clustering method retrieves XML data. If you're using SQL Server 2005, you can speed up data retrieval speeds by combining clustering and indexing. Applying clustering and indexing simultaneously, for instance, decreases the retrieval time for 10,000 items from 0.88 seconds to 0.46 seconds. The most efficient retrieval for 10,000 entries utilizing both clustering and indexing was 2.389 seconds in BDBXML, which demonstrates better retrieval speeds overall. With BDBXML demonstrating quicker retrieval speeds than SQL Server 2005 for big datasets, the results demonstrate that indexing and clustering improve performance. For big datasets utilized in similarity-based clustering tasks, this study shows that improving XML data retrieval by clustering and indexing is effective.

How to Cite

Wanjari Ravindra Shankar, F. Rahman. (2023). Enhancing Xml Data Retrieval Performance with Clustering and Indexing. International Journal on Recent and Innovation Trends in Computing and Communication, 11(11), 1295–1302. Retrieved from https://www.ijritcc.org/index.php/ijritcc/article/view/11371