Human-in-the-Loop: Complete Guide

What You’ll Learn

This tutorial series teaches you how to build interactive AI agents that can pause execution, ask questions, and get user approval before taking actions. You’ll learn three different approaches, from the simplest (Claude Code) to the most flexible (model-agnostic).

graph TB
    A[Choose Your Path] --> B{Requirements?}
    B -->|Quick & Simple| C[Claude Code<br/>Built-in Tool]
    B -->|OpenAI Only| D[OpenAI<br/>Function Calling + SDK]
    B -->|Multi-Model| E[Model Agnostic<br/>LangChain + OpenRouter]

    C --> F[1. Overview]
    D --> F
    E --> F

    F --> G[2. Claude Code]
    G --> H[3. OpenAI]
    H --> I[4. Model Agnostic]

Tutorial Series

1. Overview

Start here to understand the fundamentals.

Learn:

What human-in-the-loop is and why it matters
Common use cases (clarification, approval, configuration)
Architecture patterns and constraints
Comparison of different approaches
When to use HITL vs autonomous agents

Read this if: You’re new to human-in-the-loop or want to understand the big picture.

2. Claude Code Implementation

The simplest approach - zero setup required.

Learn:

How Claude Code’s built-in AskUserQuestion works
Creating interactive skills with automatic UI rendering
Design patterns (sequential, multi-select, progressive disclosure)
Best practices for questions and options
Limitations and when to use other approaches

Read this if: You’re building Claude Code skills or want the fastest way to add HITL.

Pros:

✅ Zero code - just write instructions
✅ Automatic UI rendering in terminal
✅ No state management needed

Cons:

❌ Claude Code CLI only
❌ Can’t customize UI
❌ Not portable to other platforms

3. OpenAI Implementation

Two powerful approaches for OpenAI users.

Learn:

Function Calling: Manual implementation with full control
Agents SDK: Built-in approval workflows with needsApproval
Structured Outputs for guaranteed schema compliance
Handling parallel tool calls
Error handling and validation

Read this if: You’re using OpenAI and want built-in approval features or need to understand function calling.

Pros:

✅ Provider-supported patterns
✅ Agents SDK has built-in approvals
✅ Structured Outputs guarantee compliance
✅ Good documentation and examples

Cons:

❌ OpenAI models only
❌ Still need to implement UI layer
❌ SDK adds dependency

4. Model-Agnostic Implementation

Maximum flexibility - works with any LLM provider.

Learn:

LangChain + OpenRouter architecture
Custom tool definition and execution loops
Pluggable UI handlers (CLI, web, mobile)
State management across pauses
Constraints and trade-offs
Production-ready patterns

Read this if: You need to support multiple LLM providers or build web/mobile apps.

Pros:

✅ Works with any model (Claude, GPT, Gemini, Llama, etc.)
✅ Full control over UI (terminal, web, mobile)
✅ Custom validation and logic
✅ Production-ready architecture

Cons:

❌ More code (~200+ lines)
❌ Manual state management
❌ Higher complexity

Quick Comparison

By Complexity

graph LR
    A[Simplest] --> B[Claude Code<br/>0 LOC]
    B --> C[OpenAI SDK<br/>50 LOC]
    C --> D[Model Agnostic<br/>200+ LOC]
    D --> E[Most Control]

By Use Case

Use Case	Best Approach
Claude Code skill development	Claude Code built-in
Quick prototyping	Claude Code built-in
OpenAI-only production app	OpenAI Agents SDK
Custom approval logic	OpenAI Function Calling
Web/mobile application	Model Agnostic
Multi-provider support	Model Agnostic
Maximum flexibility	Model Agnostic

Feature Matrix

Feature	Claude Code	OpenAI SDK	Model Agnostic
Setup Time	Instant	Minutes	Hours
Lines of Code	0	~50	~200+
UI Rendering	Automatic	Manual	Fully Custom
Model Support	Claude only	OpenAI only	Any model
UI Options	Terminal only	Any (DIY)	Any (DIY)
State Management	Automatic	SDK helps	Manual
Approval Flow	Manual	Built-in	Manual
Production Ready	Demos	Yes	Yes
Customization	Low	Medium	High
Portability	None	Low	High

Decision Tree

flowchart TD
    A[I need human-in-the-loop] --> B{What are you building?}

    B -->|Claude Code skill| C[Use Claude Code<br/>AskUserQuestion]
    B -->|OpenAI-only app| D{Need approvals?}
    B -->|Multi-model app| E[Use Model Agnostic<br/>LangChain + OpenRouter]
    B -->|Web/mobile app| E

    D -->|Yes| F[Use OpenAI<br/>Agents SDK]
    D -->|No| G[Use OpenAI<br/>Function Calling]

    C --> H[Tutorial 2:<br/>Claude Code]
    F --> I[Tutorial 3:<br/>OpenAI SDK]
    G --> I
    E --> J[Tutorial 4:<br/>Model Agnostic]

Architecture Comparison

Claude Code: Integrated

graph TB
    U[User] --> CC[Claude Code CLI]
    CC --> C[Claude]
    C --> AUQ[AskUserQuestion<br/>Tool Call]
    AUQ --> CC
    CC --> UI[Auto UI Render]
    UI --> U

Characteristics:

Single, integrated system
No separation between agent and UI
Automatic everything
Least flexible, easiest to use

OpenAI: Semi-Integrated

graph TB
    U[User] --> App[Your App]
    App --> OA[OpenAI API]
    OA --> GPT[GPT-4]
    GPT --> FC[Function Call]
    FC --> App
    App --> YUI[Your UI Layer]
    YUI --> U

Characteristics:

Semi-integrated (SDK helps)
Some separation (UI is yours)
Manual UI implementation
Medium flexibility, medium effort

Model-Agnostic: Fully Separated

graph TB
    U[User] --> UI[UI Layer<br/>Pluggable]
    UI --> App[Application Layer]
    App --> LC[LangChain]
    LC --> OR[OpenRouter]
    OR --> M1[Claude]
    OR --> M2[GPT-4]
    OR --> M3[Gemini]
    OR --> M4[Llama]

Characteristics:

Fully separated layers
Complete independence
Maximum flexibility
Most work, most control

Common Patterns

Pattern 1: Sequential Questions

Ask questions one after another based on previous answers.

Example: Project setup wizard

“What type of project?” → Web App
“Which framework?” → React
“Which styling?” → Tailwind CSS

Best implemented in:

✅ All three approaches
Easiest in Claude Code
Most flexible in Model Agnostic

Pattern 2: Approval Gates

Require confirmation before sensitive operations.

Example: Database operations

“Delete 10,000 records?”
“Deploy to production?”
“Send email to all users?”

Best implemented in:

✅ OpenAI Agents SDK (built-in needsApproval)
⚠️ Manual in Claude Code
⚠️ Manual in Model Agnostic

Pattern 3: Conditional Branching

Different follow-up questions based on initial answer.

Example: Setup complexity

Beginner → Use defaults, skip questions
Intermediate → Ask key questions
Advanced → Ask all configuration details

Best implemented in:

✅ All three approaches
Most elegant in Model Agnostic (full control)

Pattern 4: Multi-Select Features

Allow users to select multiple non-exclusive options.

Example: Feature selection

Select all that apply:
- ☑ Authentication
- ☑ Database
- ☐ Email
- ☑ Testing

Best implemented in:

✅ Claude Code (multiSelect: true)
✅ OpenAI (custom logic)
✅ Model Agnostic (custom logic)

Key Constraints Across All Approaches

1. Tool Calling Reliability

Not all models are equally good at tool calling:

Model	Reliability	Notes
Claude (Anthropic)	95%+	Best-in-class
GPT-4 (OpenAI)	95%+	Very reliable
Gemini (Google)	85%+	Generally good
Llama 3	60-80%	Depends on fine-tuning
Mistral	50-70%	Limited support

Implication: Stick to Claude or GPT-4 for production systems.

2. UI Separation

Only Claude Code has built-in UI. For everything else, you must:

Implement your own UI layer
Handle rendering logic
Manage user input collection
Validate responses

3. State Management

Pausing execution requires preserving conversation state:

Message history must be maintained
Tool calls must be tracked
Answers must be properly formatted
Context must flow back to the LLM

4. User Experience

Consider cognitive load:

2-4 options per question (max)
1-4 questions per interaction
Clear descriptions for each option
Progressive disclosure for complexity

Real-World Examples

Example 1: Feature Development Assistant

Scenario: Help developers implement new features by asking clarifying questions.

Best Approach: Model-Agnostic

Needs to work with multiple LLMs
Web UI for better collaboration
Complex validation logic
Integration with GitHub, Jira

Implementation:

Tutorial 4: Model-Agnostic
Web UI (Streamlit or React)
LangChain for flexibility
OpenRouter for model access

Example 2: Claude Code Skill

Scenario: Interactive database setup skill for Claude Code users.

Best Approach: Claude Code Built-in

Target audience uses Claude Code
Terminal UI is sufficient
Want zero-setup experience
Quick development

Implementation:

Tutorial 2: Claude Code
Use built-in AskUserQuestion
Write skill in Markdown
~0 lines of code

Example 3: Deployment Assistant

Scenario: ChatGPT-based deployment tool with safety checks.

Best Approach: OpenAI Agents SDK

OpenAI-only is acceptable
Need built-in approvals
Production deployments (high risk)
Standardized flow

Implementation:

Tutorial 3: OpenAI Agents SDK
Use needsApproval: true
Conditional approvals for prod
~50 lines of code

Getting Started

Path 1: Complete Beginner

Start: Overview - Understand concepts
Try: Claude Code - Simplest implementation
Explore: Run the examples and modify them
Advance: Try OpenAI or Model-Agnostic when needed

Path 2: OpenAI Developer

Start: Overview - Understand landscape
Deep Dive: OpenAI - Learn both approaches
Implement: Choose Function Calling or Agents SDK
Consider: Model-Agnostic for multi-provider support

Path 3: Production Engineer

Start: Overview - Understand options
Compare: Read all three implementation tutorials
Decide: Based on requirements (single vs multi-provider)
Implement: Model-Agnostic for maximum flexibility

Further Resources

Official Documentation

ReAct Pattern - Simple reasoning and acting loop
Plan-Execute-Verify - Production-grade agent pattern

Community Resources

Need Help?

Conceptual questions? → Start with Overview
Claude Code specific? → See Claude Implementation
OpenAI specific? → Check OpenAI Implementation
Architecture questions? → Review Model-Agnostic Implementation

Ready to start? → Begin with the Overview to understand the fundamentals!

Human-in-the-Loop: Complete Guide

What You’ll Learn

Tutorial Series

1. Overview

2. Claude Code Implementation

3. OpenAI Implementation

4. Model-Agnostic Implementation

Quick Comparison

By Complexity

By Use Case

Feature Matrix

Decision Tree

Architecture Comparison

Claude Code: Integrated

OpenAI: Semi-Integrated

Model-Agnostic: Fully Separated

Common Patterns

Pattern 1: Sequential Questions

Pattern 2: Approval Gates

Pattern 3: Conditional Branching

Pattern 4: Multi-Select Features

Key Constraints Across All Approaches

1. Tool Calling Reliability

2. UI Separation

3. State Management

4. User Experience

Real-World Examples

Example 1: Feature Development Assistant

Example 2: Claude Code Skill

Example 3: Deployment Assistant

Getting Started

Path 1: Complete Beginner

Path 2: OpenAI Developer

Path 3: Production Engineer

Further Resources

Official Documentation

Related Patterns

Community Resources

Need Help?