Memory-efficient key/Foreign-key join size estimation via multiplicity and intersection size
Müller, Magnus
;
Flachs, Daniel
;
Moerkotte, Guido
DOI:
|
https://doi.org/10.1109/ICDE51399.2021.00090
|
URL:
|
https://store.computer.org/csdl/proceedings-articl...
|
Weitere URL:
|
https://www.researchgate.net/publication/352675505...
|
Dokumenttyp:
|
Konferenzveröffentlichung
|
Erscheinungsjahr:
|
2021
|
Buchtitel:
|
2021 IEEE 37th International Conference on Data Engineering (ICDE)
|
Seitenbereich:
|
984-995
|
Veranstaltungstitel:
|
ICDE 2021
|
Veranstaltungsort:
|
Online
|
Veranstaltungsdatum:
|
19.-22.04.2021
|
Herausgeber:
|
Ailamaki, Anastasia
;
Garofalakis, Minos N.
|
Ort der Veröffentlichung:
|
Chania
|
Verlag:
|
IEEE
|
ISBN:
|
78-1-7281-9185-0 , 978-1-7281-9184-3
|
ISSN:
|
1063-6382 , 2375-026X
|
Verwandte URLs:
|
|
Sprache der Veröffentlichung:
|
Englisch
|
Einrichtung:
|
Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Practical Computer Science III (Moerkotte 1996-)
|
Fachgebiet:
|
004 Informatik
|
Freie Schlagwörter (Englisch):
|
query processing , conferences , estimation , data structures , data engineering
|
Abstract:
|
Join size estimation plays a crucial role in query optimization. In this paper, we present a technique to estimate the size of a key/foreign-key join of two filtered relations. We build on a model by Allen Van Gelder, in which there is no notion of join selectivity. Instead, the size of a join is estimated as a multiple of the intersection size of the join attributes. We present both a data structure to approximate the number of distinct values in a join attribute after a filter operation, and formulas to estimate the factor by which a join size exceeds the intersection size. In addition, we evaluate three existing intersection size estimation methods that are based on HyperLogLog sketches, to which our approach is closely linked. For both real-world and generated data sets, our estimator competes well, in terms of accuracy and memory footprint, against several industry-strength and state-of-the-art join size estimation methods. In particular, our experiments indicate that our approach is less prone to heavy underestimates.
|
| Dieser Eintrag ist Teil der Universitätsbibliographie. |
Suche Autoren in
Sie haben einen Fehler gefunden? Teilen Sie uns Ihren Korrekturwunsch bitte hier mit: E-Mail
Actions (login required)
|
Eintrag anzeigen |
|
|