[AARR] What’s the Magic Word? A Control Theory Of LLM Prompting

Post Details

Company

Align AI

Date Published

July 1, 2024

Author

Align AI R&D Team

Word Count

644

Language

English

Hacker News Points

-

Source URL

tryalign.ai/resources/blog/aarr-what-s-the-magic-word-a-control-theory-of-llm-prompting

Summary

Caltech's new study investigates how effectively designed input prompts can significantly impact large language model (LLM) outcomes, changing unlikely predictions into likely ones. The research conceptualizes LLMs as discrete stochastic dynamical systems and uses control theory to understand and modify their outputs. Prompt engineering is shown to have a major impact on LLM behavior. Limitations in existing work include the reliance on heuristics for prompt optimization, dependence on gradient information at the token embedding layer, and restricted analysis of LLM controllability to 'meaningful sentences.' The proposed system formalizes LLMs as a type of discrete stochastic dynamical system and analyzes the reachable set of system outputs. Empirical findings indicate that short prompt sequences can significantly change the chance of specific outputs, even transforming the least likely tokens into the most likely ones.