Formal Specifications from Natural Language (2206.01962v2)

Published 4 Jun 2022 in cs.SE, cs.LG, and cs.PL

Abstract: We study the generalization abilities of LLMs when translating natural language into formal specifications with complex semantics. In particular, we fine-tune LLMs on three datasets consisting of English sentences and their corresponding formal representation: 1) regular expressions (regex), frequently used in programming and search; 2) First-order logic (FOL), commonly used in software verification and theorem proving; and 3) linear-time temporal logic (LTL), which forms the basis for industrial hardware specification languages. Our experiments show that, in these diverse domains, the LLMs maintain their generalization capabilities from pre-trained knowledge of natural language to generalize, e.g., to new variable names or operator descriptions. Additionally, they achieve competitive performance, and even outperform the state-of-the-art for translating into regular expressions, with the benefits of being easy to access, efficient to fine-tune, and without a particular need for domain-specific reasoning.

Citations (23)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/kc_srk/status/1790232870532927909

Formal Specifications from Natural Language (2206.01962v2)

Summary

Related Papers

Tweets