×
Emergent properties of LLMs puzzle AI researchers
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The emergence of new capabilities in large language models (LLMs) follows predictable mathematical patterns rather than appearing mysteriously. Understanding these threshold-based behaviors can help researchers better anticipate and potentially accelerate the development of advanced AI capabilities. This mathematical perspective on emergence offers valuable insights into why LLMs suddenly demonstrate new abilities when scaled beyond certain parameter thresholds.

The big picture: Emergence—the sudden appearance of new capabilities at specific thresholds—occurs naturally in many systems from physics to mathematics, making similar patterns in LLMs mathematically expected rather than surprising.

  • Examples in nature include phase changes like ice suddenly becoming water, or a car becoming functional only after receiving its fourth wheel.
  • The phenomenon also appears in simple machine learning scenarios, such as regression models that suddenly achieve perfect accuracy when parameter counts reach specific thresholds.

Key examples in mathematics: Simple regression models demonstrate how emergent behaviors arise naturally from mathematical principles.

  • A regression using monomials will have non-zero error until the degree reaches N-1 (for N data points), at which point the error suddenly drops to zero.
  • Similarly, k-means clustering algorithms show dramatic error reduction when the number of cluster centers matches the actual number of clusters in the data.

How this applies to LLMs: The complexity requirements for language tasks create natural thresholds that explain why new capabilities emerge abruptly.

  • Boolean circuits illustrate how some functions require a minimum complexity threshold to be implemented at all—until that threshold is reached, the capability simply cannot exist.
  • Similarly, language tasks like understanding multi-step instructions or solving complex logic problems may be impossible until LLM parameter counts reach specific thresholds.

Between the lines: What appears as “emergent” behavior may simply reflect the inherent minimum complexity requirement for implementing certain capabilities.

  • The apparent suddenness of new capabilities likely reflects crossing mathematical thresholds rather than unexplainable emergence.
  • These patterns suggest some LLM capabilities cannot be achieved incrementally but instead require reaching specific complexity thresholds.

The bottom line: While the emergence of new capabilities in LLMs may seem surprising, it follows mathematically predictable patterns seen across natural and computational systems.

  • Predicting specific emergent behaviors remains challenging but understanding the mathematical basis makes the phenomenon itself less mysterious.
  • This perspective helps explain why scaling laws in AI development often show sudden performance jumps rather than smooth, continuous improvement.
Why do LLMs have emergent properties?

Recent News

AI chip tracking bill aims to curb China’s tech access

New legislation would mandate location-tracking systems on export-controlled AI semiconductors to prevent smuggling to China, as bipartisan concerns grow over technology diversion that could enhance military capabilities.

AI transforms motivation and learning approaches, study finds

AI systems showcase learning without traditional motivation, suggesting educators should focus more on designing effective learning environments rather than inspiring students.

Chrome’s new AI mode blocks scams: How it safeguards users

Chrome's new AI protection system analyzes suspicious patterns to identify and block previously undetected scams across Google's platforms, keeping users twice as safe from phishing attempts.