Skip to Main Content
Labor Efficiencype-citation-116P2

Automated prompt optimisation outperforms human-written prompts.

APO-generated prompts outperformed human…APO-generated prompts outperformed human expert prompts by 3-8% on BIG-Bench Hard tasks, while requiring zero human iteration time.

Context & Methodology

Automatic prompt optimisation uses the LLM itself to generate, evaluate, and refine prompts — eliminating the trial-and-error cycle of manual prompt engineering.

Applies To

openaianthropicgoogle

Confidence Level

Medium

Implementation Effort

medium

Recommendation

test

Execution Priority

P2

Put This Evidence to Work

Use the STCO framework to implement findings like this in structured, testable prompts.

Marking user-provided text with special delimiters and encoding transformations reduced injection attack success from 56.Hines et al., 'Defending Against Indirect Prompt I…