Semantic Search

Semantic search lets you find recording sessions using natural language queries. It works by generating vector embeddings from session metadata (robot name, operator, tags, labels, track names) and using pgvector for similarity matching.

Semantic search requires PostgreSQL with pgvector. Make sure you’ve completed the PostgreSQL setup first.

Install Dependencies

Install the search extras, which include sentence-transformers and the PostgreSQL driver:

uv pip install repoch[search]

Enable Search

Config File
Environment Variables

Use the provided configs/search.toml:

[database]
backend = "postgres"

[database.postgres]
# password should be set via environment variable for security

[search]
enabled = true

Start the server:

REPOCH__DATABASE__POSTGRES__PASSWORD=your_secure_password repoch server --config configs/search.toml

REPOCH__DATABASE__BACKEND=postgres \
REPOCH__DATABASE__POSTGRES__PASSWORD=your_secure_password \
REPOCH__SEARCH__ENABLED=true \
repoch server

When the server starts with search enabled, it automatically creates the pgvector extension and begins generating embeddings for new and updated sessions.

Build the Index

To generate embeddings for existing sessions, run the reindex command against a running server:

repoch db reindex

Use --force to re-embed all sessions, even those that haven’t changed:

repoch db reindex --force

Configuration Options

All search settings are optional and have sensible defaults:

Setting	Default	Description
`enabled`	`false`	Enable semantic search
`embedding_model`	`all-MiniLM-L6-v2`	Sentence-transformer model to use
`device`	`cpu`	Computation device: `cpu`, `cuda`, or `auto`
`batch_size`	`100`	Sessions per embedding batch
`max_reindex_sessions`	`10000`	Maximum sessions per reindex request

GPU Acceleration

For faster embedding generation on machines with a CUDA-compatible GPU:

[search]
enabled = true
device = "cuda"

Set device = "auto" to use the GPU when available and fall back to CPU otherwise.

The default all-MiniLM-L6-v2 model is lightweight and runs well on CPU. GPU acceleration is most beneficial when reindexing large numbers of sessions.

How It Works

Embeddings are generated from session metadata including:

Robot name
Operator username
Tags
Timeslice labels
Track, camera, audio source and time series group names

When any of these change, the embedding is automatically regenerated. A source text hash is stored alongside each embedding to detect when sessions need re-embedding. Search queries are embedded using the same model and matched against stored embeddings using cosine distance. Results can be further filtered by robot name, tags, time range and visibility.

Next Steps

Configuration System

Learn more about the full configuration system

Getting Started

Cloud

Advanced

Contributing

Semantic Search

Install Dependencies

Enable Search

Build the Index

Configuration Options

GPU Acceleration

How It Works

Next Steps

Configuration System

Getting Started

Cloud

Advanced

Contributing

​Install Dependencies

​Enable Search

​Build the Index

​Configuration Options

​GPU Acceleration

​How It Works

​Next Steps

Configuration System

Install Dependencies

Enable Search

Build the Index

Configuration Options

GPU Acceleration

How It Works

Next Steps