Back to AI Failure Dictionary
AI Failure Dictionary

Agent & Prompting Failures

Agent & Prompting Failures terms and explanations from the AI Failure Dictionary.

36 terms in this chapter
01

Ambiguous Prompt

Definition

The instruction is unclear, causing poor output.

Solution

Specify the goal, context, constraints, audience, and desired format.

02

Context Overload

Definition

The prompt contains too much information for the model to use effectively.

Solution

Remove noise and prioritize the most relevant context.

03

Context Window Overflow

Definition

The input exceeds the model's maximum context length.

Solution

Use chunking, summarization, retrieval, or hierarchical processing.

04

Few-Shot Failure

Definition

Examples in the prompt confuse the model or teach the wrong pattern.

Solution

Use clear, consistent, relevant examples and test them across cases.

05

Instruction Dilution

Definition

Important instructions are weakened by too much surrounding text.

Solution

Place critical rules clearly and use structured sections.

06

Prompt Injection

Definition

A user or retrieved document tries to override trusted instructions.

Solution

Use instruction hierarchy, content isolation, filtering, and output validation.

07

Prompt Leakage

Definition

Hidden or private prompt instructions are exposed.

Solution

Avoid sensitive prompt content and filter outputs for prompt disclosure.

08

Prompt Sensitivity

Definition

Small wording changes cause large output differences.

Solution

Test prompt variants and use robust templates.

09

Prompt Overfitting

Definition

The prompt works only for narrow examples and fails on real cases.

Solution

Evaluate prompts against diverse, realistic, and adversarial examples.

10

Prompt Contradiction

Definition

The prompt contains conflicting rules.

Solution

Simplify the prompt and prioritize instructions clearly.

11

Prompt Ambiguity

Definition

The model cannot tell which instruction matters most.

Solution

Use hierarchy, numbered requirements, and examples.

12

Prompt Fragility

Definition

The prompt works in demos but breaks with real users.

Solution

Use prompt evaluation sets and continuous prompt improvement.

13

Planning Failure

Definition

The agent creates a poor, incomplete, or impossible plan.

Solution

Use planning constraints, examples, plan validation, and step limits.

14

Goal Drift

Definition

The agent shifts away from the original objective.

Solution

Restate the goal and check progress after each step.

15

Agent Loop

Definition

The agent keeps repeating steps without finishing the task.

Solution

Add loop limits, stopping rules, and escalation paths.

16

Infinite Retry Loop

Definition

The agent keeps retrying failed actions without escalation.

Solution

Use retry limits, fallback strategies, and failure escalation.

17

Action Failure

Definition

The agent chooses or performs the wrong action.

Solution

Add action validation, safer tool APIs, and approval gates.

18

Tool Selection Failure

Definition

The agent fails to choose the best available tool.

Solution

Improve tool descriptions, routing logic, examples, and tool-use evaluation.

19

Tool Execution Failure

Definition

A tool call fails because of bad input, timeout, or system error.

Solution

Use input validation, retries, error handling, and timeout policies.

20

Tool Misuse

Definition

The agent uses a tool incorrectly.

Solution

Use strict schemas, tool-specific guardrails, and test cases.

21

Tool Output Misinterpretation

Definition

The agent reads the tool result incorrectly.

Solution

Use structured tool outputs and verification prompts.

22

Tool Permission Failure

Definition

The agent tries to use a tool it is not allowed to use.

Solution

Apply least privilege and permission checks before tool calls.

23

Memory Corruption

Definition

Stored memory becomes inaccurate, outdated, or misleading.

Solution

Use memory validation, expiration, review, and user control.

24

Memory Leakage

Definition

Private or irrelevant memory appears in an answer.

Solution

Use memory access rules, privacy filters, and memory scoping.

25

Long-Term Memory Drift

Definition

Saved knowledge becomes less accurate over time.

Solution

Refresh, expire, or revalidate memory regularly.

26

Memory Retrieval Failure

Definition

The agent fails to retrieve useful memory.

Solution

Improve memory indexing, relevance scoring, and retrieval tests.

27

Verification Failure

Definition

The agent does not check whether its answer or action is correct.

Solution

Add verifier steps before final output or external action.

28

Reflection Failure

Definition

The agent reviews its work but fails to catch mistakes.

Solution

Use external checks, automated tests, or human review.

29

Autonomy Failure

Definition

The agent takes unsafe or unnecessary actions without supervision.

Solution

Add approval gates, scope limits, and risk-based controls.

30

Unbounded Autonomy

Definition

The agent has too much freedom and too few limits.

Solution

Use permission boundaries, budgets, step limits, and human approval for risky actions.

31

Permission Boundary Failure

Definition

The agent accesses or changes resources it should not.

Solution

Apply least-privilege access and enforce tool permissions.

32

State Management Failure

Definition

The agent loses track of task state, progress, or decisions.

Solution

Use explicit state tracking, checkpoints, and structured task memory.

33

Multi-Agent Coordination Failure

Definition

Multiple agents disagree, duplicate work, or block each other.

Solution

Define clear roles, shared state, coordination protocols, and conflict resolution.

34

Handoff Failure

Definition

One agent passes incomplete or incorrect context to another agent.

Solution

Use structured handoff summaries and required context fields.

35

Human Escalation Failure

Definition

The agent should ask for human help but continues alone.

Solution

Use uncertainty thresholds, risk rules, and escalation triggers.

36

Tool Chain Failure

Definition

One failed tool call breaks the entire workflow.

Solution

Design recovery paths, partial success handling, and fallback tools.

Explore more chapters or test your knowledge with quizzes.