Metaprompting: Teaching LLMs How to Think, Not What to Think

The problem with writing prompts

If you've spent any time building with LLMs, you know the cycle. You write a prompt for one task. It works. Then you need a slightly different task, so you write another prompt. Then another. Eventually you have a folder full of carefully crafted prompts, each tuned to one specific thing.

Metaprompting is the idea that instead of writing better prompts for each task, you write a prompt that teaches the model how to approach tasks in general.

What metaprompting actually is

The core concept is simple: instead of telling an LLM what to do, you tell it how to figure out what to do.

A regular prompt says: "Summarize this article in 3 bullet points."

A meta-prompt says something closer to: "When given a task, first identify what type of problem it is. Then determine what approach would work best for that type of problem. Then execute that approach. Then verify your output."

The model gets a reasoning scaffold — a structure for thinking — rather than step-by-step instructions for a specific task. One well-designed meta-prompt can handle tasks across completely different domains.

Two ways people think about it

There are two main framings in the research:

The conductor model — described in Suzgun and Kalai's 2024 paper from Stanford and Microsoft Research. The idea is that a single LLM acts as a "conductor" that breaks a problem into subtasks, spawns specialized "expert" personas for each subtask, collects their outputs, and verifies the result. The conductor and the experts are all the same model — the differentiation is purely through instructions.

The structural template model — from Zhang, Yuan, and Yao's work. Instead of giving the model few-shot examples (concrete input-output pairs), you give it an abstract template for how to reason about a category of problems. The focus is on structure over content.

Both approaches share the same insight: LLMs perform better when you give them a framework for reasoning rather than a recipe for a specific task.

A concrete example

Say you want an LLM to solve math problems. The standard approach is few-shot prompting — you show it 3-5 solved problems and hope it picks up the pattern.

With metaprompting, you'd instead give it something like:

For any mathematical problem: identify the problem type and relevant domain. State the key variables and their relationships. Select and apply the appropriate solution method. Verify by substituting the answer back into the original problem.

No concrete examples needed. That single structural prompt works across algebra, geometry, calculus — because it describes how to think about math problems, not how to solve any particular one.

Why it works

It sounds almost too simple, but the results are surprisingly strong. Suzgun and Kalai's experiments with GPT-4 showed metaprompting outperforming standard prompting by around 17% across a range of tasks. Zhang et al. demonstrated that an untuned base model with meta-prompts could match or beat fine-tuned models on math benchmarks — using fewer tokens than approaches like tree-of-thought.

The likely reason: large models already have the knowledge to solve most tasks. What they often lack is the right structure for applying that knowledge. Metaprompting provides that structure.

Where it falls short

This isn't a silver bullet. A few honest limitations:

It needs capable models. Metaprompting works well with GPT-4 class models. With smaller models, the conductor can't reliably decompose and delegate tasks.
It costs more per call. The conductor/expert pattern means multiple LLM invocations for a single task. More tokens, more latency, more cost.
Novel tasks are still hard. If the model has no prior knowledge of a domain, no amount of structural scaffolding will help. It's still zero-shot under the hood.

How I think about it practically

When I'm building AI features, I don't use metaprompting for everything. For straightforward, well-defined tasks — extracting structured data from text, generating a summary, classifying inputs — a direct prompt is fine. Writing a meta-prompt for those would be overengineering.

Where metaprompting starts to make sense is when:

The task varies a lot between inputs (customer support queries that range from billing to technical issues)
You're building a general-purpose agent that needs to handle unpredictable requests
You find yourself maintaining dozens of task-specific prompts and want to consolidate

Getting started

If you want to experiment with metaprompting, start small. Take a direct prompt you already use and try rewriting it as a structural template — describe the reasoning process instead of the specific task. Compare the outputs.

For deeper reading:

Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding — the Suzgun and Kalai paper, with code on GitHub
Meta Prompting for AI Systems — the Zhang, Yuan, and Yao paper
DAIR.AI Prompt Engineering Guide — a practical reference that covers metaprompting alongside other techniques

The mental shift is small but useful: instead of asking "what should the model do?", ask "how should the model think about this type of problem?" That reframing is the core of metaprompting.