Optimization is ubiquitous. While derivative-based algorithms have been powerful tools for various problems, the absence of gradient imposes challenges on many real-world applications. In this work, we propose Optimization by PROmpting (OPRO), a simple and effective approach to leverage large language models (LLMs) as optimizers, where the optimization task is described in natural language. In each optimization step, the LLM generates new solutions from the prompt that contains previously generated solutions with their values, then the new solutions are evaluated and added to the prompt for the next optimization step. We first showcase OPRO on linear regression and traveling salesman problems, then move on to our main application in prompt optimization, where the goal is to find instructions that maximize the task accuracy. With a variety of LLMs, we demonstrate that the best prompts optimized by OPRO outperform human-designed prompts by up to 8% on GSM8K, and by up to 50% on Big-Bench Hard tasks. Code at https://github.com/google-deepmind/opro.

## Overview

- Optimization is a common task, but traditional gradient-based methods have limitations when gradients are not available.
- The paper proposes a new approach called "Optimization by PROmpting" (OPRO) that uses large language models (LLMs) as optimizers, where the optimization task is described in natural language.
- OPRO generates new solutions iteratively, evaluates them, and adds them to the prompt for the next step.
- The authors demonstrate OPRO's effectiveness on linear regression, traveling salesman, and prompt optimization problems, showing significant improvements over human-designed prompts.

## Plain English Explanation

Optimization is a fundamental problem that arises in many real-world situations, such as [finding the best route for a delivery truck](https://aimodels.fyi/papers/arxiv/automatic-prompt-selection-large-language-models) or [selecting the most effective prompts for a language model](https://aimodels.fyi/papers/arxiv/language-model-prompt-selection-via-simulation-optimization). Traditional optimization methods that rely on calculating gradients can work well, but they struggle when gradients are not available, which is common in many practical applications.

To address this, the researchers propose a new approach called "Optimization by PROmpting" (OPRO). The key idea is to use powerful [large language models (LLMs)](https://aimodels.fyi/papers/arxiv/cseprompts-benchmark-introductory-computer-science-prompts) as the optimizers, where the optimization task is described in natural language. In each optimization step, the LLM generates new candidate solutions based on the prompt, which contains information about the previously generated solutions and their values. These new solutions are then evaluated, and the best ones are added to the prompt for the next optimization step.

The researchers demonstrate OPRO's effectiveness on several problems, including [linear regression](https://aimodels.fyi/papers/arxiv/llm-rec-personalized-recommendation-via-prompting-large) and the [traveling salesman problem](https://aimodels.fyi/papers/arxiv/large-language-models-are-good-prompt-learners). They also show that OPRO can be used to optimize the prompts themselves, finding instructions that significantly outperform human-designed prompts on challenging language model tasks.

## Technical Explanation

The key innovation in this work is the use of large language models (LLMs) as optimization engines, where the optimization task is described in natural language. This approach, called "Optimization by PROmpting" (OPRO), iteratively generates new candidate solutions based on the current prompt, evaluates them, and adds the best ones to the prompt for the next iteration.

In each optimization step, the LLM takes the current prompt, which includes information about the previously generated solutions and their values, and generates new candidate solutions. These new solutions are then evaluated, and the best ones are added to the prompt for the next step. This process continues until a stopping criterion is met, such as a maximum number of iterations or a target objective value.

The researchers demonstrate OPRO's effectiveness on several problems, including linear regression, the traveling salesman problem, and prompt optimization for language models. In the prompt optimization task, they show that the best prompts found by OPRO can outperform human-designed prompts by up to 8% on the GSM8K benchmark and up to 50% on the more challenging Big-Bench Hard tasks.

## Critical Analysis

One potential limitation of the OPRO approach is that it relies on the ability of the LLM to generate high-quality candidate solutions based on the current prompt. If the LLM struggles to understand the optimization problem or to generate promising new solutions, the optimization process may not converge to a good result. Additionally, the authors note that OPRO can be computationally expensive, as each optimization step requires running the LLM to generate new solutions.

Another concern is the need for careful prompt engineering to ensure that the LLM understands the optimization problem correctly. If the prompt is not well-designed, the LLM may generate irrelevant or suboptimal solutions, leading to poor optimization performance.

Despite these potential limitations, the OPRO approach represents an interesting and novel application of large language models, demonstrating their potential as powerful optimization tools. The authors have provided an open-source implementation of OPRO, which should encourage further research and experimentation in this area.

## Conclusion

The paper presents a novel approach called "Optimization by PROmpting" (OPRO) that leverages the power of large language models (LLMs) to optimize complex problems where traditional gradient-based methods may struggle. By describing the optimization task in natural language and iteratively generating and evaluating candidate solutions, OPRO has been shown to outperform human-designed prompts on a range of tasks, including prompt optimization for language models.

While OPRO has some potential limitations, such as the need for careful prompt engineering and computational expense, the authors' work highlights the exciting potential of using LLMs as optimization tools. As language models continue to advance, the OPRO approach may become an increasingly valuable tool for tackling a wide range of optimization challenges in the real world.