Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 147 tok/s
Gemini 2.5 Pro 53 tok/s Pro
GPT-5 Medium 29 tok/s Pro
GPT-5 High 33 tok/s Pro
GPT-4o 120 tok/s Pro
Kimi K2 221 tok/s Pro
GPT OSS 120B 449 tok/s Pro
Claude Sonnet 4.5 38 tok/s Pro
2000 character limit reached

GraphMatch: Subgraph Query Processing on FPGAs (2402.17559v1)

Published 27 Feb 2024 in cs.DB and cs.AR

Abstract: Efficiently finding subgraph embeddings in large graphs is crucial for many application areas like biology and social network analysis. Set intersections are the predominant and most challenging aspect of current join-based subgraph query processing systems for CPUs. Previous work has shown the viability of utilizing FPGAs for acceleration of graph and join processing. In this work, we propose GraphMatch, the first genearl-purpose stand-alone subgraph query processing accelerator based on worst-case optimal joins (WCOJ) that is fully designed for modern, field programmable gate array (FPGA) hardware. For efficient processing of various graph data sets and query graph patterns, it leverages a novel set intersection approach, called AllCompare, tailor-made for FPGAs. We show that this set intersection approach efficiently solves multi-set intersections in subgraph query processing, superior to CPU-based approaches. Overall, GraphMatch achieves a speedup of over 2.68x and 5.16x, compared to the state-of-the-art systems GraphFlow and RapidMatch, respectively.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (27)
  1. EmptyHeaded: A Relational Engine for Graph Processing. ACM Trans. Database Syst. 42, 4 (2017), 20:1–20:44.
  2. CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching. In SIGMOD. ACM, 1447–1462.
  3. Efficient Subgraph Matching by Postponing Cartesian Products. In SIGMOD. ACM, 1199–1214.
  4. ForeGraph: Exploring Large-scale Graph Processing on Multi-FPGA Architecture. In FPGA. 217–226.
  5. GraphScale: Scalable Bandwidth-Efficient Graph Processing on FPGAs. In FPL. IEEE, 24–32.
  6. Non-relational Databases on FPGAs: Survey, Design Decisions, Challenges. ACM Comput. Surv. 55, 11 (2023), 225:1–225:37.
  7. PipeJSON: Parsing JSON at Line Speed on FPGAs. In DaMoN. 3:1–3:7.
  8. Adopting Worst-Case Optimal Joins in Relational Database Systems. PVLDB 13, 11 (2020), 1891–1904.
  9. EdgeFrame: Worst-Case Optimal Joins for Graph-Pattern Matching in Spark. In GRADES-NDA, Akhil Arora, Semih Salihoglu, and Nikolay Yakovets (Eds.). ACM, 4:1–4:11.
  10. Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together. In SIGMOD. ACM, 1429–1446.
  11. Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions. In SIGMOD. ACM, 1587–1602.
  12. FAST: FPGA-based Subgraph Matching on Massive Graphs. In 37th IEEE International Conference on Data Engineering, ICDE 2021, Chania, Greece, April 19-22, 2021. IEEE, 1452–1463. https://doi.org/10.1109/ICDE51399.2021.00129
  13. Bandwidth-optimal Relational Joins on FPGAs. In EDBT. 1:27–1:39.
  14. An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases. Proc. VLDB Endow. 6, 2 (2012), 133–144.
  15. Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data.
  16. GraphZero: A High-Performance Subgraph Matching System. ACM SIGOPS Oper. Syst. Rev. 55, 1 (2021), 21–37.
  17. Amine Mhedhbi and Semih Salihoglu. 2019. Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins. PVLDB 12, 11 (2019), 1692–1704.
  18. Worst-case Optimal Join Algorithms. J. ACM 65, 3 (2018), 16:1–16:40.
  19. Skew strikes back: new developments in the theory of join algorithms. SIGMOD Rec. 42, 4 (2013), 5–16.
  20. Efficient estimation of graphlet frequency distributions in protein-protein interaction networks. Bioinform. 22, 8 (2006), 974–980.
  21. Ryan A. Rossi and Nesreen K. Ahmed. 2015. The Network Data Repository with Interactive Graph Analytics and Visualization. http://networkrepository.com. In AAAI.
  22. The ubiquity of large graphs and surprising challenges of graph processing: extended survey. VLDB J. 29, 2-3 (2020), 595–618.
  23. New specifications for exponential random graph models. Sociological methodology 36, 1 (2006), 99–153.
  24. Shixuan Sun and Qiong Luo. 2020. In-Memory Subgraph Matching: An In-depth Study. In SIGMOD. ACM, 1083–1098.
  25. RapidMatch: A Holistic Approach to Subgraph Query Processing. Proc. VLDB Endow. 14, 2 (2020), 176–188.
  26. Julian R. Ullmann. 1976. An Algorithm for Subgraph Isomorphism. J. ACM 23, 1 (1976), 31–42.
  27. Todd L. Veldhuizen. 2012. Leapfrog Triejoin: a worst-case optimal join algorithm. CoRR abs/1210.0481 (2012).

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.