Memory for AI Agents
(Claude, Copilot, and more)

PersistQ gives your agents long-term memory with local embeddings, pgvector, and Claude MCP — no OpenAI keys, no vector DB setup.

Free: 500 memories, 5,000 API calls/month. No credit card required.

For Developers

  • Simple REST API and official SDKs (TypeScript, Python)
  • JSON metadata, tags, groups for flexible querying
  • Works with any stack that can do HTTP

For AI Agents

  • Perfect for Claude Code and Copilot via MCP
  • Store conversation history, decisions, and user preferences
  • Hybrid semantic + keyword search tuned for agent memory, not generic search

For Privacy-Sensitive Teams

  • Embeddings generated locally with Transformers.js; nothing sent to OpenAI/Anthropic
  • Data stored in standard PostgreSQL + pgvector; no proprietary vector DB lock-in
  • No external dependencies on AI providers for core functionality

See PersistQ in Action

45-second demo

One prompt to setup — watch how PersistQ stores, searches, and retrieves memories in real-time

Built for Developers, Optimized for AI

Everything you need to add semantic memory to your AI agents.

No external API calls for embeddings

Generate embeddings locally with Transformers.js (384d). Your data never leaves your infrastructure.

Find the right memory instantly with lightning-fast semantic search

Powered by pgvector with cosine similarity. Query your memories naturally and get relevant results every time.

Combine semantic and exact matching for the most relevant results

Hybrid keyword + vector search ensures you never miss important memories, whether the query is fuzzy or exact.

Fast enough for real-time agent loops

Average latency of ~200ms means your AI agents never wait. Optimized for production workloads at scale.

Claude Code
> One prompt, instant setup
"Add PersistQ memory to my AI agent using MCP"
✓ MCP server configured
✓ Memory tools available
✓ Ready to use in seconds

Works with AI tools.
MCP ready.

Integrates seamlessly with Claude Code and GitHub Copilot CLI via Model Context Protocol. Simple setup, powerful memory.

  • Claude Code integration
    Full MCP support with tools and resources
  • GitHub Copilot CLI
    MCP tools integration available
  • Quick setup
    Install globally and configure in minutes

Three ways to integrate

Choose the integration that works best for your workflow

TypeScript
javascript
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
import { createClient } from 'persistq-sdk';
 
const client = createClient({
baseUrl: 'https://api.persistq.com',
apiKey: process.env.PERSISTQ_API_KEY,
});
 
// Store a memory
await client.createMemory(
'User prefers dark mode',
'preferences',
{ tags: ['ui', 'settings'] }
);
 
// Search memories
const results = await client.searchMemories(
'user preferences',
{ limit: 10 }
);

Use Cases

Add persistent memory to any AI application

🤖

AI Assistants

Give your assistant long-term memory of preferences, tasks, and interactions.

Example: Claude remembers your tech stack (Next.js + PostgreSQL + Tailwind) and stops suggesting React class components or MongoDB.

⚙️

Customer Support Agents

Add a persistent knowledge layer to multi-step agents.

Example: Your AI support agent remembers each user's last 5 issues and doesn't ask the same onboarding questions again.

💡

Custom AI Apps

Store contextual data, notes, summaries, and conversation history.

🔍

RAG Enhancements

Improve retrieval quality with structured semantic memory.

From the Founder

PersistQ is new — but it was built from real frustration with privacy concerns, overly complex vector databases, and the need to send data to third parties for embeddings.

Developers deserve a memory layer that is fast, private, simple, and easy to integrate.

PersistQ gives you semantic memory with local embeddings, no vendor lock-in, and full Claude MCP support. No external API calls for embeddings means your data never leaves your infrastructure.

If you're building an AI agent, PersistQ is designed for you.

Simple, transparent pricing

Start free. Scale as you grow. No hidden fees.

MonthlyYearly
Hobby
Free
API calls5,000/mo
Memories500
Storage25MB
Basic semantic search
Local embeddings
Community support only
Builder
$5/mo
API calls50K/mo
Memories2,500
Storage250MB
Everything in Hobby
Advanced search filters
Email support (best-effort)
99% uptime SLA
Most Popular
RAG Apps
$12/mo
API calls500K/mo
Memories25,000
Storage5GB
Everything in Builder
Priority email support (24-48h)
Advanced analytics
99.9% uptime SLA
Team / Scale
$29/mo
API calls2M/mo
Memories100K
Storage50GB
Everything in RAG Apps
Priority support + SLA
Advanced analytics
99.9% uptime guaranteed
No credit card required
Cancel anytime
14-day money-back guarantee

All plans include zero embedding costs and local processing

Built for Scale

Powerful Features

Everything you need to build AI applications with persistent memory

Performance
Fast and reliable for real-time applications
Search Speed

Average search response

~200ms
Scalability

Memories per project

Millions
Availability

Uptime guarantee

99.9%
Processing

No external dependencies

Local
Memory Features
Powerful capabilities for AI applications
Storage

JSON metadata support

Flexible
Search

Understands meaning and context

Semantic
Operations

Create, read, update, delete

Full CRUD
Projects

Organize by topic/use case

Unlimited
Privacy & Security
Your data stays private and secure
Encryption

Data protection at rest

AES-256
Processing

Embeddings never leave your system

Local
Compliance

Privacy by design

GDPR Ready
Control

Your data, your rules

Full
Developer Tools
Everything you need to get started quickly
TypeScript SDK

Full type support

Included
REST API

Standard HTTP endpoints

Simple
Documentation

Guides and examples

Complete
Integration

Works with any tech stack

Easy

Ready to add memory to your AI?

Start building with PersistQ today. No credit card required.

No external API calls
Privacy-first
Claude MCP ready