CSA-Trans: Code Structure Aware Transformer for AST (2404.05767v1)

Published 7 Apr 2024 in cs.SE and cs.AI

Abstract: When applying the Transformer architecture to source code, designing a good self-attention mechanism is critical as it affects how node relationship is extracted from the Abstract Syntax Trees (ASTs) of the source code. We present Code Structure Aware Transformer (CSA-Trans), which uses Code Structure Embedder (CSE) to generate specific PE for each node in AST. CSE generates node Positional Encoding (PE) using disentangled attention. To further extend the self-attention capability, we adopt Stochastic Block Model (SBM) attention. Our evaluation shows that our PE captures the relationships between AST nodes better than other graph-related PE techniques. We also show through quantitative and qualitative analysis that SBM attention is able to generate more node specific attention coefficients. We demonstrate that CSA-Trans outperforms 14 baselines in code summarization tasks for both Python and Java, while being 41.92% faster and 25.31% memory efficient in Java dataset compared to AST-Trans and SG-Trans respectively.

References (47)

Authors (2)

Saeyoon Oh (4 papers)
Shin Yoo (49 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/ComputerPapers/status/1777919069267656843

https://twitter.com/gastronomy/status/1777911411781374354

CSA-Trans: Code Structure Aware Transformer for AST (2404.05767v1)

Summary

Related Papers

Tweets