Competition Dynamics Shape Algorithmic Phases of In-Context Learning

stp2yDecember 23, 20240 Comments

LLM Guardrails: Secure and Controllable Deployment

[Submitted on 1 Dec 2024 (v1), last revised 20 Dec 2024 (this version, v2)]

View a PDF of the paper titled Competition Dynamics Shape Algorithmic Phases of In-Context Learning, by Core Francisco Park and 3 other authors

View PDF
HTML (experimental)

Abstract:In-Context Learning (ICL) has significantly expanded the general-purpose nature of large language models, allowing them to adapt to novel tasks using merely the inputted context. This has motivated a series of papers that analyze tractable synthetic domains and postulate precise mechanisms that may underlie ICL. However, the use of relatively distinct setups that often lack a sequence modeling nature to them makes it unclear how general the reported insights from such studies are. Motivated by this, we propose a synthetic sequence modeling task that involves learning to simulate a finite mixture of Markov chains. As we show, models trained on this task reproduce most well-known results on ICL, hence offering a unified setting for studying the concept. Building on this setup, we demonstrate we can explain a model’s behavior by decomposing it into four broad algorithms that combine a fuzzy retrieval vs. inference approach with either unigram or bigram statistics of the context. These algorithms engage in a competition dynamics to dominate model behavior, with the precise experimental conditions dictating which algorithm ends up superseding others: e.g., we find merely varying context size or amount of training yields (at times sharp) transitions between which algorithm dictates the model behavior, revealing a mechanism that explains the transient nature of ICL. In this sense, we argue ICL is best thought of as a mixture of different algorithms, each with its own peculiarities, instead of a monolithic capability. This also implies that making general claims about ICL that hold universally across all settings may be infeasible.

Submission history

From: Core Francisco Park [view email]
[v1]
Sun, 1 Dec 2024 23:35:53 UTC (27,227 KB)
[v2]
Fri, 20 Dec 2024 07:53:56 UTC (27,405 KB)

Source link
lol

By stp2y