Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A 3D Parallel Algorithm for QR Decomposition (1805.05278v1)

Published 14 May 2018 in cs.DC

Abstract: Interprocessor communication often dominates the runtime of large matrix computations. We present a parallel algorithm for computing QR decompositions whose bandwidth cost (communication volume) can be decreased at the cost of increasing its latency cost (number of messages). By varying a parameter to navigate the bandwidth/latency tradeoff, we can tune this algorithm for machines with different communication costs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Grey Ballard (36 papers)
  2. James Demmel (54 papers)
  3. Laura Grigori (30 papers)
  4. Mathias Jacquelin (14 papers)
  5. Nicholas Knight (5 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.