Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Relax: Composable Abstractions for End-to-End Dynamic Machine Learning (2311.02103v1)

Published 1 Nov 2023 in cs.LG, cs.AI, and cs.PL

Abstract: Dynamic shape computations have become critical in modern machine learning workloads, especially in emerging LLMs. The success of these models has driven demand for deploying them to a diverse set of backend environments. In this paper, we present Relax, a compiler abstraction for optimizing end-to-end dynamic machine learning workloads. Relax introduces first-class symbolic shape annotations to track dynamic shape computations globally across the program. It also introduces a cross-level abstraction that encapsulates computational graphs, loop-level tensor programs, and library calls in a single representation to enable cross-level optimizations. We build an end-to-end compilation framework using the proposed approach to optimize dynamic shape models. Experimental results on LLMs show that Relax delivers performance competitive with state-of-the-art hand-optimized systems across platforms and enables deployment of emerging dynamic models to a broader set of environments, including mobile phones, embedded devices, and web browsers.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (19)
  1. Ruihang Lai (9 papers)
  2. Junru Shao (11 papers)
  3. Siyuan Feng (55 papers)
  4. Steven S. Lyubomirsky (1 paper)
  5. Bohan Hou (9 papers)
  6. Wuwei Lin (5 papers)
  7. Zihao Ye (16 papers)
  8. Hongyi Jin (6 papers)
  9. Yuchen Jin (5 papers)
  10. Jiawei Liu (156 papers)
  11. Lesheng Jin (3 papers)
  12. Yaxing Cai (2 papers)
  13. Ziheng Jiang (23 papers)
  14. Yong Wu (56 papers)
  15. Sunghyun Park (38 papers)
  16. Prakalp Srivastava (2 papers)
  17. Jared G. Roesch (1 paper)
  18. Todd C. Mowry (10 papers)
  19. Tianqi Chen (77 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.