Most research into similarity search in metric spaces relies on the triangle inequality property. This property allows the space to be arranged according to relative distances to avoid searching some subspaces. We show that many common metric spaces, notably including those using Euclidean and Jensen-Shannon distances, also have a stronger property, sometimes called the four-point property: In essence, these spaces allow an isometric embedding of any four points in three-dimensional Euclidean space, as well as any three points in two-dimensional Euclidean space. In fact, we show that any space that is isometrically embeddable in Hilbert space has the stronger property. This property gives stronger geometric guarantees, and one in particular, which we name the Hilbert Exclusion property, allows any indexing mechanism which uses hyperplane partitioning to perform better. One outcome of this observation is that a number of state-of-the-art indexing mechanisms over high-dimensional spaces can be easily refined to give a significant increase in performance; furthermore, the improvement given is greater in higher dimensions. This therefore leads to a significant improvement in the cost of metric search in these spaces.

Hilbert exclusion: improved metric search through finite isometric embeddings

Cardillo F A;Vadicamo L;Rabitti F
2017

Abstract

Most research into similarity search in metric spaces relies on the triangle inequality property. This property allows the space to be arranged according to relative distances to avoid searching some subspaces. We show that many common metric spaces, notably including those using Euclidean and Jensen-Shannon distances, also have a stronger property, sometimes called the four-point property: In essence, these spaces allow an isometric embedding of any four points in three-dimensional Euclidean space, as well as any three points in two-dimensional Euclidean space. In fact, we show that any space that is isometrically embeddable in Hilbert space has the stronger property. This property gives stronger geometric guarantees, and one in particular, which we name the Hilbert Exclusion property, allows any indexing mechanism which uses hyperplane partitioning to perform better. One outcome of this observation is that a number of state-of-the-art indexing mechanisms over high-dimensional spaces can be easily refined to give a significant increase in performance; furthermore, the improvement given is greater in higher dimensions. This therefore leads to a significant improvement in the cost of metric search in these spaces.
Campo DC Valore Lingua
dc.authority.ancejournal ACM TRANSACTIONS ON INFORMATION SYSTEMS -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.orgunit Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI -
dc.authority.people Connor R it
dc.authority.people Cardillo F A it
dc.authority.people Vadicamo L it
dc.authority.people Rabitti F it
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI *
dc.contributor.appartenenza.mi 973 *
dc.date.accessioned 2024/02/20 07:30:48 -
dc.date.available 2024/02/20 07:30:48 -
dc.date.issued 2017 -
dc.description.abstracteng Most research into similarity search in metric spaces relies on the triangle inequality property. This property allows the space to be arranged according to relative distances to avoid searching some subspaces. We show that many common metric spaces, notably including those using Euclidean and Jensen-Shannon distances, also have a stronger property, sometimes called the four-point property: In essence, these spaces allow an isometric embedding of any four points in three-dimensional Euclidean space, as well as any three points in two-dimensional Euclidean space. In fact, we show that any space that is isometrically embeddable in Hilbert space has the stronger property. This property gives stronger geometric guarantees, and one in particular, which we name the Hilbert Exclusion property, allows any indexing mechanism which uses hyperplane partitioning to perform better. One outcome of this observation is that a number of state-of-the-art indexing mechanisms over high-dimensional spaces can be easily refined to give a significant increase in performance; furthermore, the improvement given is greater in higher dimensions. This therefore leads to a significant improvement in the cost of metric search in these spaces. -
dc.description.affiliations University of Strathclyde, UK; CNR-ILC, Pisa, Italy; CNR-ISTI, Pisa, Italy; CNR-ISTI, Pisa, Italy -
dc.description.allpeople Connor, R; Cardillo, F A; Vadicamo, L; Rabitti, F -
dc.description.allpeopleoriginal Connor R.; Cardillo F. A.; Vadicamo L.; Rabitti F. -
dc.description.fulltext partially_open en
dc.description.note Online first: dicembre 2016. Pubblicazione print: Vol. 35, Issue 3, giugno 2017 -
dc.description.numberofauthors 4 -
dc.identifier.doi 10.1145/3001583 -
dc.identifier.isi WOS:000395475200002 -
dc.identifier.scopus 2-s2.0-85008222749 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/313924 -
dc.identifier.url http://doi.acm.org/10.1145/3001583 -
dc.language.iso eng -
dc.miur.last.status.update 2025-01-31T10:10:54Z *
dc.relation.firstpage 17 -
dc.relation.issue 3 -
dc.relation.lastpage 27 -
dc.relation.volume 35 -
dc.subject.keywords Similarity search -
dc.subject.keywords Metric space -
dc.subject.keywords Metric indexing -
dc.subject.keywords Four-point property -
dc.subject.keywords Hilbert embedding -
dc.subject.keywords H. Information systems. Data structures -
dc.subject.keywords H. Information systems. Multidimensional range search -
dc.subject.keywords H. Information systems. Proximity search -
dc.subject.keywords H. Information systems. Database query processing -
dc.subject.keywords H. Information systems. Retrieval models and ranking -
dc.subject.keywords Information systems. Retrieval efficiency -
dc.subject.keywords H. Information systems. Multimedia information systems -
dc.subject.keywords F. Theory of computation. Random projections and metric embeddings -
dc.subject.singlekeyword Similarity search *
dc.subject.singlekeyword Metric space *
dc.subject.singlekeyword Metric indexing *
dc.subject.singlekeyword Four-point property *
dc.subject.singlekeyword Hilbert embedding *
dc.subject.singlekeyword H. Information systems. Data structures *
dc.subject.singlekeyword H. Information systems. Multidimensional range search *
dc.subject.singlekeyword H. Information systems. Proximity search *
dc.subject.singlekeyword H. Information systems. Database query processing *
dc.subject.singlekeyword H. Information systems. Retrieval models and ranking *
dc.subject.singlekeyword Information systems. Retrieval efficiency *
dc.subject.singlekeyword H. Information systems. Multimedia information systems *
dc.subject.singlekeyword F. Theory of computation. Random projections and metric embeddings *
dc.title Hilbert exclusion: improved metric search through finite isometric embeddings en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 363052 -
iris.isi.extIssued 2017 -
iris.isi.extTitle Hilbert Exclusion: Improved Metric Search through Finite Isometric Embeddings -
iris.mediafilter.data 2025/03/28 03:34:15 *
iris.orcid.lastModifiedDate 2025/03/13 01:12:43 *
iris.orcid.lastModifiedMillisecond 1741824763232 *
iris.scopus.extIssued 2016 -
iris.scopus.extTitle Hilbert exclusion: Improved metric search through finite isometric embeddings -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost repository *
iris.unpaywall.bestoaversion submittedVersion *
iris.unpaywall.doi 10.1145/3001583 *
iris.unpaywall.hosttype repository *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://openportal.isti.cnr.it/doc?id=people______::221755482928b2d2aea325dee9a54a7c *
iris.unpaywall.license other-oa *
iris.unpaywall.metadataCallLastModified 26/04/2026 07:30:27 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1777181427043 -
iris.unpaywall.oastatus green *
isi.authority.ancejournal ACM TRANSACTIONS ON INFORMATION SYSTEMS###1046-8188 *
isi.category ET *
isi.contributor.affiliation University of Strathclyde -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Scotland -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Richard -
isi.contributor.name Franco Alberto -
isi.contributor.name Lucia -
isi.contributor.name Fausto -
isi.contributor.researcherId DVM-3552-2022 -
isi.contributor.researcherId AAP-5764-2021 -
isi.contributor.researcherId P-5138-2018 -
isi.contributor.researcherId DNK-4218-2022 -
isi.contributor.subaffiliation Dept Comp & Informat Sci -
isi.contributor.subaffiliation Inst Computat Linguist ILC -
isi.contributor.subaffiliation Informat Sci & Technol Inst ISTI -
isi.contributor.subaffiliation Informat Sci & Technol Inst ISTI -
isi.contributor.surname Connor -
isi.contributor.surname Cardillo -
isi.contributor.surname Vadicamo -
isi.contributor.surname Rabitti -
isi.date.issued 2017 *
isi.description.abstracteng Most research into similarity search in metric spaces relies on the triangle inequality property. This property allows the space to be arranged according to relative distances to avoid searching some subspaces. We show that many common metric spaces, notably including those using Euclidean and Jensen-Shannon distances, also have a stronger property, sometimes called the four-point property: In essence, these spaces allow an isometric embedding of any four points in three-dimensional Euclidean space, as well as any three points in two-dimensional Euclidean space. In fact, we show that any space that is isometrically embeddable inHilbert space has the stronger property. This property gives stronger geometric guarantees, and one in particular, which we name the Hilbert Exclusion property, allows any indexing mechanism which uses hyperplane partitioning to perform better. One outcome of this observation is that a number of state-of-the-art indexing mechanisms over high-dimensional spaces can be easily refined to give a significant increase in performance; furthermore, the improvement given is greater in higher dimensions. This therefore leads to a significant improvement in the cost of metric search in these spaces. *
isi.description.allpeopleoriginal Connor, R; Cardillo, FA; Vadicamo, L; Rabitti, F; *
isi.document.sourcetype WOS.SCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.1145/3001583 *
isi.identifier.eissn 1558-2868 *
isi.identifier.isi WOS:000395475200002 *
isi.journal.journaltitle ACM TRANSACTIONS ON INFORMATION SYSTEMS *
isi.journal.journaltitleabbrev ACM T INFORM SYST *
isi.language.original English *
isi.publisher.place 2 PENN PLAZA, STE 701, NEW YORK, NY 10121-0701 USA *
isi.relation.issue 3 *
isi.relation.volume 35 *
isi.title Hilbert Exclusion: Improved Metric Search through Finite Isometric Embeddings *
scopus.authority.ancejournal ACM TRANSACTIONS ON INFORMATION SYSTEMS###1046-8188 *
scopus.category 1710 *
scopus.category 1400 *
scopus.category 1706 *
scopus.contributor.affiliation University of Strathclyde -
scopus.contributor.affiliation National Research Council of Italy (CNR) -
scopus.contributor.affiliation National Research Council of Italy (CNR) -
scopus.contributor.affiliation National Research Council of Italy (CNR) -
scopus.contributor.afid 60024724 -
scopus.contributor.afid 60021199 -
scopus.contributor.afid 60085207 -
scopus.contributor.afid 60085207 -
scopus.contributor.auid 7102097432 -
scopus.contributor.auid 57191090133 -
scopus.contributor.auid 56419939400 -
scopus.contributor.auid 6602132101 -
scopus.contributor.country United Kingdom -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 113500075 -
scopus.contributor.dptid 104078586 -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Richard -
scopus.contributor.name Franco Alberto -
scopus.contributor.name Lucia -
scopus.contributor.name Fausto -
scopus.contributor.subaffiliation Department of Computer and Information Sciences; -
scopus.contributor.subaffiliation Institute for Computational Linguistics (ILC); -
scopus.contributor.subaffiliation Information Science and Technology Institute (ISTI); -
scopus.contributor.subaffiliation Information Science and Technology Institute (ISTI); -
scopus.contributor.surname Connor -
scopus.contributor.surname Cardillo -
scopus.contributor.surname Vadicamo -
scopus.contributor.surname Rabitti -
scopus.date.issued 2016 *
scopus.description.abstracteng Most research into similarity search in metric spaces relies on the triangle inequality property. This property allows the space to be arranged according to relative distances to avoid searching some subspaces. We show that many common metric spaces, notably including those using Euclidean and Jensen-Shannon distances, also have a stronger property, sometimes called the four-point property: In essence, these spaces allow an isometric embedding of any four points in three-dimensional Euclidean space, as well as any three points in two-dimensional Euclidean space. In fact, we show that any space that is isometrically embeddable in Hilbert space has the stronger property. This property gives stronger geometric guarantees, and one in particular, which we name the Hilbert Exclusion property, allows any indexing mechanism which uses hyperplane partitioning to perform better. One outcome of this observation is that a number of state-of-the-art indexing mechanisms over high-dimensional spaces can be easily refined to give a significant increase in performance; furthermore, the improvement given is greater in higher dimensions. This therefore leads to a significant improvement in the cost of metric search in these spaces. *
scopus.description.allpeopleoriginal Connor R.; Cardillo F.A.; Vadicamo L.; Rabitti F. *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.date.issued *
scopus.document.type ar *
scopus.document.types ar *
scopus.funding.funders 501100000266 - Engineering and Physical Sciences Research Council; *
scopus.funding.ids EP/G012407/1; *
scopus.identifier.doi 10.1145/3001583 *
scopus.identifier.eissn 1558-2868 *
scopus.identifier.pui 613967738 *
scopus.identifier.scopus 2-s2.0-85008222749 *
scopus.journal.sourceid 18997 *
scopus.language.iso eng *
scopus.publisher.name Association for Computing Machinery *
scopus.relation.article 17 *
scopus.relation.issue 3 *
scopus.relation.volume 35 *
scopus.subject.keywords Four-point property; Hilbert embedding; Metric indexing; Metric Space; Similarity search; *
scopus.title Hilbert exclusion: Improved metric search through finite isometric embeddings *
scopus.titleeng Hilbert exclusion: Improved metric search through finite isometric embeddings *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
prod_363052-doc_119660.pdf

solo utenti autorizzati

Descrizione: Hilbert exclusion: improved metric search through finite isometric embeddings
Tipologia: Versione Editoriale (PDF)
Dimensione 1.26 MB
Formato Adobe PDF
1.26 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
prod_363052-doc_166091.pdf

accesso aperto

Descrizione: Hilbert exclusion: improved metric search through finite isometric embeddings
Tipologia: Versione Editoriale (PDF)
Dimensione 2.38 MB
Formato Adobe PDF
2.38 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/313924
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 25
  • ???jsp.display-item.citation.isi??? 23
social impact