Citation Connor R, MacKenzie-Leigh S & Moss R (2014) High dimensional search using polyhedral query. In: Juci Machado Traina A, Traina Jr. C & Leonardo Ferreira Cordeiro R (eds.) Similarity Search and Applications. SISAP 2014. Lecture Notes in Computer Science, 8821. 7th International Conference, SISAP 2014: Similarity Search and Applications, 29.10.2014-31.10.2014. Los Cabos, Mexico: Springer, pp. 176-188. https://doi.org/10.1007/978-3-319-11988-5_16
Abstract It is well known that, as the dimensionality of a metric space increases, metric search techniques become less effective and the cost of indexing mechanisms becomes greater than the saving they give. This is due to the so-called curse of dimensionality.
One effect of increasing dimensionality is that the ratio of unit hypersphere to unit hypercube volume decreases rapidly, making the solution to a similarity query (the query ball, or hypersphere) ever more difficult to identify by using metric invariants such as triangle inequality.
In this paper we take a different approach, by identifying points within a query polyhedron rather than a ball. We show how this can be achieved by constructing a surrogate metric space, such that a query ball in the surrogate space corresponds to a polyhedron in the original space. If the polyhedron contains the ball, the overall cost of the query is likely to be increased in high dimensions; however, we show that shrinking the polyhedron can capture a surprisingly high proportion of the points within the ball, whilst at the same time giving a more efficient, and more scalable, search.
We show results which confirm our underlying hypothesis. In some cases we can retrieve significant volumes of query results from spaces which are otherwise intractable.
Journal Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Connor, Richard; MacKenzie-Leigh, Stewart; Moss, Robert