Abstract
Approximate query processing (AQP) plays a critical role in modern data analytics. Although machine learning models are used for AQP, existing methods fail to uncover implicit relationships among the underlying data, the aggregate functions in queries, and the query predicates. In this work, we propose a Graph REpresentation Learning-based AQP model (GRELA for short) for answering queries with multiple aggregate functions. GRELA models the aggregate functions and the query predicates as task and clause nodes respectively in a graph and then learns appropriate node representations via its two modules. In particular, the Encoder module coalesces query predicates and underlying data into the representations of clause nodes. The Graph module bridges task nodes and clause nodes such that each task node can aggregate the information from its neighborhood into its representation. Through the inner products of clause and task representations, GRELA is able to make accurate estimates for queries with multiple aggregate functions. Extensive experimental results verify that GRELA outperforms the state-of-the-art AQP methods on different kinds of datasets.
Originalsprog | Engelsk |
---|---|
Artikelnummer | 35 |
Tidsskrift | VLDB Journal |
Vol/bind | 34 |
Udgave nummer | 3 |
Antal sider | 26 |
ISSN | 1066-8888 |
DOI | |
Status | Udgivet - 24 apr. 2025 |
Emneord
- AQP
- Graph representation learning