Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

175 tokens/sec

GPT-4o

8 tokens/sec

Gemini 2.5 Pro Pro

47 tokens/sec

o3 Pro

5 tokens/sec

GPT-4.1 Pro

38 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents (2401.00812v2)

Published 1 Jan 2024 in cs.CL

Abstract: The prominent LLMs of today differ from past LLMs not only in size, but also in the fact that they are trained on a combination of natural language and formal language (code). As a medium between humans and computers, code translates high-level goals into executable steps, featuring standard syntax, logical consistency, abstraction, and modularity. In this survey, we present an overview of the various benefits of integrating code into LLMs' training data. Specifically, beyond enhancing LLMs in code generation, we observe that these unique properties of code help (i) unlock the reasoning ability of LLMs, enabling their applications to a range of more complex natural language tasks; (ii) steer LLMs to produce structured and precise intermediate steps, which can then be connected to external execution ends through function calls; and (iii) take advantage of code compilation and execution environment, which also provides diverse feedback for model improvement. In addition, we trace how these profound capabilities of LLMs, brought by code, have led to their emergence as intelligent agents (IAs) in situations where the ability to understand instructions, decompose goals, plan and execute actions, and refine from feedback are crucial to their success on downstream tasks. Finally, we present several key challenges and future directions of empowering LLMs with code.

References (185)

Citations (55)

View on Semantic Scholar

Summary

The paper demonstrates that incorporating code into LLM training significantly enhances programming proficiency and systematic, step-by-step reasoning.
It reports improved model capabilities in handling multiple programming languages and generating structured outputs like HTML and tabular data.
The survey highlights how code integration transforms LLMs into intelligent agents that learn from feedback and execute complex, multi-modal tasks.

Overview of Integrating Code into LLM Training

Training data plays a pivotal role in developing LLMs. The inclusion of formal language or code in the training materials greatly broadens the scope of what these models can do. Code, by nature, is structured and logically consistent, qualities that when learned can significantly enhance the capabilities of an LLM. The paper explores how code not only aids in improving LLMs' core programming abilities but also in boosting their problem-solving skills and connecting them to a variety of functional ends.

Enhancements in LLM Abilities

Programming Proficiency

One of the most noticeable improvements with code integration is in programming competence. LLMs excel in both writing and assessing code, surpassing previous benchmarks. They can manage multiple programming languages and have been applied to an array of tasks from software development to data analysis. This proficiency has also led to growth in LLMs' ability to evaluate code for errors and performance, indicating a comprehensive understanding of code beyond mere generation.

Complex Reasoning Capabilities

Another critical enhancement is in sophisticated reasoning skills. LLMs show improved performance in tasks that require methodical, step-by-step processing—key elements of programming logic. This has been evident in models showing improved 'chain of thought' capabilities, which allow them to break down complex tasks into smaller, more manageable steps, resulting in better decision-making processes.

Structured Knowledge Capture

LLMs trained with code also show advanced abilities to understand and generate content that involves structured information. The models can handle diverse forms of structured data, such as HTML or tabular representations. This ability is particularly important for interpreting multimedia information and enhances the LLMs' interaction with graph-structured data.

LLMs as Intelligent Agents

Decision Making and Execution

Through code training, LLMs have developed into more sophisticated IAs. They can perceive environments, plan, and execute actions more effectively, thanks to their improved ability to process structured information. The models can also use code to interact with multiple tools and APIs, allowing them to perform a wider range of tasks, including those that involve complex interplays between different modalities and physical-world interactions.

Self-improvement through Feedback

An important aspect of intelligent behavior is the ability to learn from feedback and adapt. Code-driven environments allow LLMs to collect highly reliable and precise feedback—essential for refining their performance. This is a self-reinforcing loop as better performance opens up avenues for receiving more feedback, which in turn leads to further improvements.

Challenges and Future Directions

Despite the progress, there are still some key questions and challenges. For one, it is not clear exactly how much of the enhancement in reasoning abilities comes directly from code pre-training. The nature of reasoning beyond what code provides is another area that invites further exploration. Lastly, perfecting methods to fully leverage feedback for performance improvement in multi-turn interactions is still ongoing, with potential avenues like reinforcement learning yet to be thoroughly explored.

Conclusion

The survey presents a robust argument for the value of including code in LLM training. It underscores the significant impact this inclusion has on the models' capabilities and their application as IAs. Nevertheless, it also offers a perspective on the challenges that remain, providing a catalyst for further research in this domain.

PDF Markdown

Tweets

https://twitter.com/919860212/status/1742328375916564622

https://twitter.com/ashebytes/status/1749838356610207841

https://twitter.com/vmuaddib/status/1785429218739826980

https://twitter.com/123543935/status/1742050888997326997

https://twitter.com/1487345386226614278/status/1742339320504332733

https://twitter.com/1487345386226614278/status/1742347549686862163

HackerNews

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers (2 points, 1 comment)