Memory Search

GoClaw includes semantic search over memory files using embeddings.

Overview

Memory search allows the agent to find relevant information from:

memory/ directory (daily notes)
MEMORY.md (long-term memory)
Custom paths you configure

Search combines:

Vector search (semantic similarity via embeddings)
Keyword search (BM25 full-text search)

Setup

1. Use the Built-In Embeddings Provider

Memory search requires an embedding model. By default, GoClaw uses the built-in local Hugot provider for semantic search:

{
  "llm": {
    "providers": {
      "hugot-local": {
        "driver": "hugot",
        "embeddingOnly": true
      }
    },
    "embeddings": {
      "models": ["hugot-local/KnightsAnalytics/all-MiniLM-L6-v2"]
    }
  }
}

The first semantic search may take longer because the model downloads on first use and is then cached under ~/.goclaw/hugot/models/.

2. Enable Memory Search

{
  "memory": {
    "enabled": true
  }
}

3. Configure Search Behavior (Optional)

{
  "memory": {
    "enabled": true,
    "query": {
      "maxResults": 6,
      "minScore": 0.35,
      "vectorWeight": 0.7,
      "keywordWeight": 0.3
    },
    "paths": [
      "notes/",
      "journals/"
    ]
  }
}

Alternative: Ollama for Embeddings

If you prefer Ollama for embeddings, configure it explicitly:

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull an embedding model
ollama pull nomic-embed-text

Configuration Options

Field	Type	Default	Description
`enabled`	bool	`true`	Enable memory search
`query.maxResults`	int	`6`	Maximum results per search
`query.minScore`	float	`0.35`	Minimum similarity score (0-1)
`query.vectorWeight`	float	`0.7`	Weight for semantic search
`query.keywordWeight`	float	`0.3`	Weight for keyword search
`paths`	string[]	`[]`	Additional paths to index

How It Works

Indexing

When enabled, GoClaw indexes:

All .md files in memory/ directory
MEMORY.md in workspace root
Files in custom paths

Files are:

Read and chunked into segments
Embedded via the configured embeddings provider
Stored in an in-memory index

Searching

When the agent uses memory_search:

1. Query → embedding
2. Vector search (cosine similarity)
3. Keyword search (BM25)
4. Combine scores: vector * 0.7 + keyword * 0.3
5. Return top N results above minScore

Agent Tools

`memory_search`

Search memory files:

{
  "tool": "memory_search",
  "input": {
    "query": "What did we discuss about authentication?"
  }
}

Returns:

{
  "results": [
    {
      "file": "memory/2024-01-15.md",
      "content": "Discussed JWT vs session auth...",
      "score": 0.85
    }
  ]
}

`memory_get`

Read full content from a memory file. Use after memory_search to get complete context around a match.

{
  "tool": "memory_get",
  "input": {
    "path": "memory/2024-01-15.md"
  }
}

Optional parameters:

Field	Type	Description
`path`	string	Path to memory file (required)
`from`	int	Start reading from this line (1-indexed)
`lines`	int	Number of lines to read (omit for entire file)

Example reading a specific section:

{
  "tool": "memory_get",
  "input": {
    "path": "memory/2024-01-15.md",
    "from": 10,
    "lines": 20
  }
}

Memory File Format

Daily Notes (`memory/YYYY-MM-DD.md`)

# 2024-01-15

## Worked on
- Authentication system
- Database migrations

## Decisions
- Using JWT for API auth
- PostgreSQL for main database

## Notes
User mentioned preference for...

Long-term Memory (`MEMORY.md`)

# Long-term Memory

## User Preferences
- Prefers concise responses
- Uses VSCode as editor

## Project Context
- Main project: GoClaw
- Tech stack: Go, SQLite, Anthropic

## Important Decisions
- 2024-01-10: Chose SQLite over PostgreSQL for simplicity

Embedding Models

Recommended embeddings choices:

Model	Dimensions	Speed	Quality
`KnightsAnalytics/all-MiniLM-L6-v2`	384	Fast	Good
`nomic-embed-text`	768	Fast	Good
`mxbai-embed-large`	1024	Medium	Better
`all-minilm`	384	Fastest	Basic

Performance

Index Size

Memory usage scales with:

Number of files indexed
Chunk size
Embedding dimensions

Rough estimate: ~1KB per chunk (768-dim float32)

Search Speed

Typical search: 10-50ms for small indexes (<1000 chunks)

For large indexes, consider:

Reducing paths scope
Increasing minScore threshold
Decreasing maxResults

Troubleshooting

“Model download is slow the first time”

The built-in hugot-local provider downloads its model on first use. Later searches reuse the cached model.

“Ollama not available”

# Check Ollama is running
curl http://localhost:11434/api/tags

# Start Ollama if needed
ollama serve

“No results found”

Check files exist in memory/ or configured paths
Lower minScore threshold
Check query is meaningful (not too short/generic)

Slow Indexing

Embedding generation can be slow on CPU. Consider:

Reducing number of files indexed
Using a smaller model
Switching to a different embeddings provider if you prefer

About

Getting Started

LLM Providers

Channels

Tools

Agent Memory

Advanced

Security

Memory Search

Overview

Setup

1. Use the Built-In Embeddings Provider

2. Enable Memory Search

3. Configure Search Behavior (Optional)

Alternative: Ollama for Embeddings

Configuration Options

How It Works

Indexing

Searching

Agent Tools

`memory_search`

`memory_get`

Memory File Format

Daily Notes (`memory/YYYY-MM-DD.md`)

Long-term Memory (`MEMORY.md`)

Embedding Models

Performance

Index Size

Search Speed

Troubleshooting

“Model download is slow the first time”

“Ollama not available”

“No results found”

Slow Indexing

See Also

About

Getting Started

LLM Providers

Channels

Tools

Agent Memory

Advanced

Security

Memory Search

Overview

Setup

1. Use the Built-In Embeddings Provider

2. Enable Memory Search

3. Configure Search Behavior (Optional)

Alternative: Ollama for Embeddings

Configuration Options

How It Works

Indexing

Searching

Agent Tools

memory_search

memory_get

Memory File Format

Daily Notes (memory/YYYY-MM-DD.md)

Long-term Memory (MEMORY.md)

Embedding Models

Performance

Index Size

Search Speed

Troubleshooting

“Model download is slow the first time”

“Ollama not available”

“No results found”

Slow Indexing

See Also

`memory_search`

`memory_get`

Daily Notes (`memory/YYYY-MM-DD.md`)

Long-term Memory (`MEMORY.md`)