A NEAR-OPTIMAL PARALLEL ALGORITHM FOR JOINING BINARY RELATIONS

Bas Ketsman, Dan Suciu, Yufei Tao

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)
28 Downloads (Pure)

Abstract

We present a constant-round algorithm in the massively parallel computation (MPC) model for evaluating a natural join where every input relation has two attributes. Our algorithm achieves a load ofÕ(m/p1/ρ) where m is the total size of the input relations, p is the number of machines, ρ is the join’s fractional edge covering number, and Õ(.) hides a polylogarithmic factor. The load matches a known lower bound up to a polylogarithmic factor. At the core of the proposed algorithm is a new theorem (which we name the isolated cartesian product theorem) that provides fresh insight into the problem’s mathematical structure. Our result implies that the subgraph enumeration problem, where the goal is to report all the occurrences of a constant-sized subgraph pattern, can be settled optimally (up to a polylogarithmic factor) in the MPC model.

Original languageEnglish
Pages (from-to)6:1-6:22
Number of pages <span style="color:red"p> <font size="1.5"> ✽ </span> </font>22
JournalLogical Methods in Computer Science
Volume18
Issue number2
DOIs
Publication statusPublished - 1 Jan 2022

Keywords

  • Conjunctive Queries
  • Database Theory
  • Joins
  • Parallel Computing

Fingerprint

Dive into the research topics of 'A NEAR-OPTIMAL PARALLEL ALGORITHM FOR JOINING BINARY RELATIONS'. Together they form a unique fingerprint.

Cite this