Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generative Models as a Complex Systems Science: How can we make sense of large language model behavior? (2308.00189v1)

Published 31 Jul 2023 in cs.LG, cs.AI, and cs.CL

Abstract: Coaxing out desired behavior from pretrained models, while avoiding undesirable ones, has redefined NLP and is reshaping how we interact with computers. What was once a scientific engineering discipline-in which building blocks are stacked one on top of the other-is arguably already a complex systems science, in which emergent behaviors are sought out to support previously unimagined use cases. Despite the ever increasing number of benchmarks that measure task performance, we lack explanations of what behaviors LLMs exhibit that allow them to complete these tasks in the first place. We argue for a systematic effort to decompose LLM behavior into categories that explain cross-task performance, to guide mechanistic explanations and help future-proof analytic research.

Citations (10)

Summary

We haven't generated a summary for this paper yet.