Online Index Recommendations for High-Dimensional Databases Using Query Workloads

Abstract

High-dimensional databases pose a challenge with respect to efficient access. High-dimensional indexes do not work because of the often-cited “curse of dimensionality. However, users are usually interested in querying data over a relatively small subset of the entire attribute set at a time.

A potential solution is to use lower dimensional indexes that accurately represent the user access patterns. A query response using the physical database design that is developed based on a static snapshot of the query workload may significantly degrade if the query patterns change.

To address these issues, we introduce a parameterizable technique to recommend indexes based on index types that are frequently used for high-dimensional data sets and to dynamically adjust indexes as the underlying query workload changes.

We incorporate a query pattern change detection mechanism to determine when the access patterns have changed enough to warrant change in the physical database design. By adjusting analysis parameters,

We trade off analysis speed against analysis resolution. We perform experiments with a number of data sets, query sets, and parameters to show the effect that varying these characteristics has on analysis results. Online Index Recommendations for High-Dimensional Databases Using Query Workloads

Hardware Requirements
  • SYSTEM : Pentium IV 2.4 GHz
  • HARD DISK : 40 GB
  • FLOPPY DRIVE : 1.44 MB
  • MONITOR : 15 VGA colour
  • MOUSE : Logitech.
  • RAM : 256 MB
  • KEYBOARD            : 110 keys enhanced.
Software Requirements
  • Front End :J2EE
  • Back End :MS SQL
Existing System:
  • Query response does not perform well if query patterns change.
  • Because it uses static query workload.
  • Its performance may degrade if the database size gets increased.
  • Tradition feature selection technique may offer less or no data pruning capability given query attributes.
Proposed System:
  • We develop a flexible index selection frame work to achieve index selection for high dimensional data.
  • A control feedback technique is introduced for measuring the performance.
  • Through this a database could benefit from an index change.
  • The index selection minimizes the cost of the queries in the work load.
  • Online index selection is designed in the motivation if the query pattern changes over time.
  • By monitoring the query workload and detecting when there is a change on the query pattern, able to evolve good performance as query patterns evolve.

Related Post