NeuraMemory AI: Persistent Memory Layer for the AI age

"Intelligence without memory is just a series of disconnected thoughts."

NeuraMemory AI is an open-source, high-performance memory engine designed to give Large Language Models (LLMs) a persistent, cross-session, and model-agnostic long-term memory. It acts like a “second brain,” helping you save, organize, and find information easily, while understanding context, linking related ideas, and giving smart summaries and insights.

Presentation link: https://neuramemory-ai-ea5s372.gamma.site/

Demo Link: https://youtu.be/eSowmleQzQY?si=iYT1t2C6Sxl-blqX

Live Link: https://neura-memory-ai.vercel.app/

The Problem

Currently, interacting with AI feels like meeting a brilliant person who gets a concussion every time you close the chat window. This creates three critical failures in the developer and user experience:

The Context Window "Token Tax"
Every time you start a new session, you have to re-feed the AI your project structure, coding preferences, or personal history. This wastes thousands of tokens and real money on redundant processing. Current context windows are growing, but they are still a "leaky bucket". Once the limit is reached, the oldest (and often most important) context is discarded.
The Model Silo Problem
Your "relationship" with an AI is trapped inside a single platform. If you move from Gemini to Claude or a local Llama model, you lose all previous context. There is no interoperable layer for personal or professional AI memory.
The "Grounding" Gap
Generic LLMs lack "Personal Grounding." They know how to write code, but they don't know your specific project's architectural quirks unless you explicitly tell them every single time.

What NeuraMemory-AI Solves

Massive Token Efficiency
By using Retrieval-Augmented Generation (RAG) specifically for personal history, NeuraMemory can represent years of interaction in just a few hundred tokens. It solves the "Lost-in-the-Middle" phenomenon by ensuring only the most semantically relevant data is placed in the LLM's "Working Memory."
User-Centric Sovereignty
In a world where big tech companies want to own your "digital twin," NeuraMemory is Local-First. You own your memory database. It can be hosted on your local machine or a private server, ensuring that your "Personal Context" never becomes someone else's training data.
Model Agnosticism
NeuraMemory is designed with a Universal API. Whether you are using a Go-based backend, a React frontend, or a CLI tool, you can hook into the same memory stream. It bridges the gap between different AI providers, making your personal context portable.

Hierarchical Memory Management
Unlike simple databases, NeuraMemory distinguishes between:

- **Episodic Memory:**
 Specific events (e.g., "We fixed the bug in the auth controller yesterday").
- **Semantic Memory:**
General facts (e.g., "I prefer using functional programming patterns in TypeScript").
- **Procedural Memory:**
How you like things done (e.g., "Always use snake_case for database schemas").

Key Features

Multi-Modal Interaction: Users can interact via text, links, files, and documents.
Memory Management:
- All chats are stored as memories.
- Memories are displayed as cards on the Manage Memory page.
- Users can add, update, or delete memories easily.
Conversational Memory: Talk to your stored memories anytime.
Central AI Hub: Acts as a unified interface connecting multiple AI tools and services.

How to Run

Quick commands (Make + Docker)

make dev          # Start development environment
make dev-down     # Stop development environment
make prod-up      # Start production services
make prod-down    # Stop production services
make logs         # View logs
make clean        # Stop and remove containers

# Development
docker compose -f docker-compose.yml -f docker-compose.dev.yml up --build

# Production
docker compose up -d
docker compose down

# View logs
docker compose logs -f

Local setup

git clone https://github.com/Gautam7352/NeuraMemory-AI.git

cd NeuraMemory-AI

cp server/.env.example server/.env

cp client/.env.example client/.env.production

make dev

Endpoints:

Frontend: http://localhost:5173
API: http://localhost:3000

For more details, see:

Environment variables

cp server/.env.example server/.env
cp client/.env.example client/.env.production

Connect MCP in Claude Desktop

Paste in the claude desktop config:

{
  "mcpServers": {
    "memories": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "https://neura-memory-ai.vercel.app/api/v1/mcp?apiKey={YOUR_API_KEY}"
      ]
    }
  },
  "preferences": {
    "coworkScheduledTasksEnabled": false,
    "ccdScheduledTasksEnabled": false,
    "sidebarMode": "chat",
    "coworkWebSearchEnabled": true
  }
}

Project Status 🚀

NeuraMemory-AI is currently in active development with a stable core. It is deployed as a distributed system:

Frontend: Hosted on Vercel for high performance and global edge delivery.
Backend: Hosted on a dedicated GCP VM with a reverse proxy setup to avoid CORS and performance bottlenecks.

🏗 System Design & Operations

For deep dives into how the system works and how to manage it, see:

Developer Guide — Architecture, local setup, CI/CD, and repository map.
Production Ops Guide — Monitoring, logs, backups, and scaling instructions.
Server Docs — API specifications, database design, and best practices.

Key Features

Multi-Modal Interaction: Users can interact via text, links, files, and documents (including OCR support).
Intelligent Memory Extraction: Automatically distinguishes between episodic facts (bubbles) and semantic knowledge.
Conversational Retrieval: Talk to your memories using context-grounded RAG.
Model Context Protocol (MCP): Native integration with AI tools like Claude for remote memory access.
Local-First / Sovereignty: You own your database; it can be fully self-hosted.

How to Run

...

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.github/workflows		.github/workflows
client		client
server		server
.gitignore		.gitignore
DEVELOPER_GUIDE.md		DEVELOPER_GUIDE.md
DOCKER.md		DOCKER.md
LICENSE		LICENSE
Makefile		Makefile
PROD_OPS_GUIDE.md		PROD_OPS_GUIDE.md
README.md		README.md
docker-compose.cloud.yml		docker-compose.cloud.yml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
gcp-deploy-key.json		gcp-deploy-key.json
startup.sh		startup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeuraMemory AI: Persistent Memory Layer for the AI age

The Problem

What NeuraMemory-AI Solves

Key Features

How to Run

Quick commands (Make + Docker)

Local setup

Environment variables

Connect MCP in Claude Desktop

Project Status 🚀

🏗 System Design & Operations

Key Features

How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NeuraMemory AI: Persistent Memory Layer for the AI age

The Problem

What NeuraMemory-AI Solves

Key Features

How to Run

Quick commands (Make + Docker)

Local setup

Environment variables

Connect MCP in Claude Desktop

Project Status 🚀

🏗 System Design & Operations

Key Features

How to Run

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages