OMPGPT: A Generative Pre-trained Transformer Model for OpenMP (2401.16445v3)

Published 28 Jan 2024 in cs.SE, cs.DC, and cs.LG

Abstract: LLMssuch as ChatGPT have significantly advanced the field of NLP. This trend led to the development of code-based LLMs such as StarCoder, WizardCoder, and CodeLlama, which are trained extensively on vast repositories of code and programming languages. While the generic abilities of these code LLMs are useful for many programmers in tasks like code generation, the area of high-performance computing (HPC) has a narrower set of requirements that make a smaller and more domain-specific model a smarter choice. This paper presents OMPGPT, a novel domain-specific model meticulously designed to harness the inherent strengths of LLMs for OpenMP pragma generation. Furthermore, we leverage prompt engineering techniques from the NLP domain to create Chain-of-OMP, an innovative strategy designed to enhance OMPGPT's effectiveness. Our extensive evaluations demonstrate that OMPGPT outperforms existing LLMs specialized in OpenMP tasks and maintains a notably smaller size, aligning it more closely with the typical hardware constraints of HPC environments. We consider our contribution as a pivotal bridge, connecting the advantage of LLMs with the specific demands of HPC tasks.

References (30)

Citations (8)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Underfox3/status/1752550971274608706

https://twitter.com/gastronomy/status/1752515846943932428

https://twitter.com/HPCPapers/status/1752573010224025832

https://twitter.com/gastronomy/status/1752719670765883844

https://twitter.com/gastronomy/status/1752721144833073276

https://twitter.com/ComputerPapers/status/1790401660931432783

OMPGPT: A Generative Pre-trained Transformer Model for OpenMP (2401.16445v3)

Summary

Related Papers

Tweets