Core Concepts

GoClaw is built around several key concepts that define how the agent operates. Understanding these will help you configure and extend the system effectively.

Agent Loop

At its heart, GoClaw runs an agent loop that orchestrates LLM interactions:

User Message → LLM → Tool Use? → Execute Tool → LLM → ... → Final Response

The loop continues until the LLM provides a final response without requesting tool use. This enables complex, multi-step reasoning where the agent can read files, execute commands, search memory, and more.

Turns can originate from several sources: an incoming user message on a channel, a cron job, a heartbeat tick, or a memory trigger when a routine memory’s scheduled time arrives. All four feed the same agent loop on the user’s primary session. See Memory Triggers for the routine-memory case and Delegated Runs for cron/subagent origins.

Sessions

A session represents a conversation with persistent state:

Messages — The conversation history (user, assistant, tool calls)
Context window — How much history fits in the LLM’s memory
Compaction — Automatic truncation when context is nearly full
Checkpoints — Rolling snapshots for recovery

Sessions are identified by a session key (for example, primary for owner sessions or user:<id> for non-owner sessions). Channels can also pass explicit session IDs in some flows.

See Session Management for details on compaction and checkpoints.

Channels

Channels are communication interfaces between users and the agent:

Channel	Description
Telegram	Bot interface via Telegram
WhatsApp	Personal WhatsApp via linked device
HTTP	Web interface and API
HTTP Voice	Real-time voice conversations
TUI	Interactive terminal UI
Cron	Scheduled task execution

Text channels use the main Gateway and LLM Registry. The voice channel uses a separate VoiceLLM Registry with per-session WebSocket connections to real-time voice APIs.

See Channels for the full overview.

Tools

Tools extend the agent’s capabilities beyond text generation:

Category	Examples
File operations	read, write, edit
System	exec (shell commands), jq
Search	memory_search, memory_get, transcript, web_search
Memory Graph	memory_graph_recall/query/store/update/forget
Orchestration	cron, subagent_spawn/status/cancel, subagent_fanout
Integration	hass (Home Assistant), browser
Communication	message (send to channels)
Utility	media, media_display, skills, goclaw_update, user_auth
Media generation	xai_imagine, xai_video

Tools are registered with the gateway and exposed to the LLM via function calling. Many tools are conditionally enabled by configuration and channel/runtime availability (for example browser/HASS/subagent features).

GoClaw also supports subagents for delegated and parallel work:

subagent_spawn starts one worker and reports back later by default
subagent_status / subagent_cancel let you inspect or stop worker runs
subagent_fanout starts several workers in parallel and returns their results in the current turn

See Tools for the complete tool reference.

Subagents

Subagents are isolated worker agents you can use for background work or parallel work.

Use them when:

one task should continue separately from the main conversation
several tasks can be done in parallel
you want a run you can inspect or cancel later

This is also the system behind the runners dashboard and delegated background execution such as cron-driven work.

See Delegated Runs for the user guide to subagents, fanout, monitoring, and control.

Skills

Skills are markdown files that provide domain-specific knowledge and instructions. They extend the agent’s capabilities without code changes:

skills/
├── weather/
│   └── SKILL.md
├── discord/
│   └── SKILL.md
└── ...

Skills can declare requirements (binaries, environment variables) and are automatically filtered based on availability.

See Skills for the skills system.

LLM Providers

GoClaw supports multiple LLM providers through a unified registry:

Provider	Use Cases
Anthropic	Agent responses (Claude), extended thinking
OpenAI	GPT models, compatible APIs
Ollama	Local inference, embeddings, summarization
xAI	Grok models, stateful conversations

The registry supports purpose chains — different providers for different tasks (agent, summarization, embeddings) with automatic fallback.

VoiceLLM Registry

A separate VoiceLLM Registry handles real-time voice conversations:

Provider	Description
xAI Voice	Grok-based real-time voice

Voice providers maintain per-session WebSocket connections and handle audio streaming directly.

See LLM Providers for configuration.

Memory

GoClaw has three memory systems:

Workspace Memory

Traditional markdown files that the agent can read and write:

MEMORY.md — Long-term curated memories
memory/*.md — Daily notes and logs

Semantic Memory

Embeddings-based search over memory files and conversation transcripts:

memory_search — Search memory files by meaning
transcript — Search/query past conversations

Memory Graph

A semantic knowledge graph for structured facts and relationships:

memory_graph_recall — Retrieve relevant context automatically
memory_graph_store/update/forget — Manage entities and facts
memory_graph_query — Natural language questions over the graph

Memory Graph provides structured, queryable memory that persists across sessions. It’s designed to eventually supersede file-based memory.

See Agent Memory for the memory architecture and Memory Graph for details.

Roles & Access Control

Users have roles that determine their access level:

Role	Description
`owner`	Full access to all tools and settings
`user`	Limited access based on permissions

Users authenticate via identities (Telegram ID, API key, etc.) and can have tool-specific permissions.

See Roles for access control configuration.

Workspace

The workspace is the agent’s home directory — where it operates and stores files:

Identity files: SOUL.md, AGENTS.md, USER.md
Memory files: MEMORY.md, memory/
Skills: skills/
Configuration: goclaw.json, users.json

File operations are sandboxed to the workspace by default.

About

Getting Started

LLM Providers

Channels

Tools

Agent Memory

Advanced

Security

Core Concepts

Agent Loop

Sessions

Channels

Tools

Subagents

Skills

LLM Providers

VoiceLLM Registry

Memory

Workspace Memory

Semantic Memory

Memory Graph

Roles & Access Control

Workspace

See Also