With the rise of AI Agents and personalized AI, AI Memory (AI memory systems) is gradually becoming a hot topic. Many projects are beginning to experiment with building long-term memory for AI, enabling models to remember user information, historical behavior, and long-term preferences.

However, most current AI memory projects still rely on a relatively simple architecture:

Vector Store + Embedding Retrieval

For example, common implementations include:

Generating embeddings from user conversations or actions
Storing embeddings in a vector database
Generating a query embedding when a new request arrives
Finding the top-k relevant memories via similarity search

This architecture does solve the problem of "historical information retrieval," but it functions more like a searchable log system than a true memory system.

The truly difficult problems actually center around three aspects:

Memory Compaction
Memory Evolution
Memory Conflict Resolution

These three problems determine whether AI memory can evolve from simple "log retrieval" into a genuine "long-term knowledge system."

1. Memory Compaction: Compressing Memories#

The Problem: Memory Grows Indefinitely#

If a system simply stores every conversation in a vector store, the memory size will quickly balloon.

For example, if a user expresses similar opinions multiple times:

I like sushi
I love sushi
Sushi is my favorite food
I enjoy eating sushi

A naive system would save 4 or even more memories.

But the truly valuable memory is just one:

User likes sushi

Therefore, a memory system must possess a capability:

Compress a large number of raw interactions into a higher-level knowledge representation.

Analogy to Database Systems#

This problem is actually very similar to LSM-tree compaction in databases.

Data in a database typically follows this pattern:

event log → compaction → snapshot

Raw logs are compacted into a higher-level state.

AI memory is similar:

raw interactions → memory compaction → structured knowledge

For example:

Raw interactions:

User: I moved to Seattle
User: The weather in Seattle is rainy
User: I like living here

Compacted memory:

User lives in Seattle

Technical Challenges#

Memory compaction is far more than simple summarization.

1. Abstraction Level Problem

Suppose the system observes:

User likes sushi
User likes ramen
User likes pizza

Should the system generate:

User likes food

or:

User likes Japanese food

How to automatically decide the abstraction level is a very difficult problem.

2. When to Perform Compaction

There are two common strategies:

Periodic Compaction For example, perform compression after accumulating N memories.
Similarity-Based Trigger Trigger compaction when the system discovers a cluster of memories in the embedding space.

3. Avoiding Information Loss

The compression process can lead to incorrect abstraction.

For example:

User likes sushi
User is allergic to shellfish

If compressed into:

User likes seafood

This is clearly wrong.

Therefore, memory compaction must be performed very carefully.

2. Memory Evolution: The Evolution of Memories#

Human memory is not static; it constantly updates over time.

For example:

2023: User lives in New York
2024: User moved to Seattle

The system must understand:

New York → outdated information
Seattle → current information

But a vector store does not possess this capability; it is append-only.

The Nature of Memory#

A vector store is more like:

append-only log

Whereas a true memory system needs:

state machine

In other words, memory must support updates and evolution.

Key Problems#

1. Fact Updates

For example:

User favorite language: Python

Later the user says:

I switched to Rust.

What the system should do is:

update memory

Not simply add a new memory.

2. Temporal Dimension

Memory usually needs to include:

timestamp
confidence
validity window

For example:

User lives in NYC (2019–2024)
User lives in Seattle (2024–)

This allows the system to correctly infer the current state.

3. Long-Term vs. Short-Term Memory

Not all memories are valid long-term.

For example:

Long-term stable:

User likes sushi

Short-term information:

User is traveling in Tokyo

In cognitive science, this is typically divided into:

Episodic Memory
Semantic Memory

AI memory systems often need a similar hierarchical structure.

3. Memory Conflict Resolution: Resolving Memory Conflicts#

This is one of the most difficult problems in AI memory.

Because memories can easily contradict each other.

For example:

Memory A
User is vegetarian

Memory B
User likes steak

The system must decide: which one is correct?

Sources of Conflict#

1. Changes in User Behavior

User was vegetarian
User is no longer vegetarian

2. Inconsistent User Statements

User: I hate Python
User: Python is actually great

3. Model Inference Errors

An LLM might infer an incorrect memory based on context.

Common Resolution Strategies#

1. Time Priority (Latest Wins)

The latest information takes precedence:

2023: vegetarian
2024: eats meat

The system adopts the 2024 state.

But this strategy is not always correct.

2. Confidence Mechanism

Memories can be accompanied by:

confidence score

For example:

User explicitly said → high confidence
LLM inference → low confidence

In case of conflict, prioritize the memory with higher confidence.

3. Source Tracking

Record the source of a memory:

source = user_statement
source = inference
source = system

In case of conflict, prioritize direct user statements.

4. Multi-Version Memory

Another strategy is to retain multiple temporal versions:

User was vegetarian (2018–2023)
User eats meat (2023–)

This allows the system to use different memories in different temporal contexts.

Why Are These Three Problems So Difficult?#

Because AI memory is actually not a simple retrieval system, but a knowledge management system.

A vector database solves:

similarity retrieval

Whereas a memory system needs to solve:

knowledge representation
knowledge evolution
knowledge conflict resolution

This is more akin to building a:

Knowledge Graph
Database System
Reasoning Engine

Not just an embedding index.

A More Complete AI Memory Architecture#

A mature AI memory system typically requires the following structure:

Raw interactions
      │
      ▼
Memory extraction (LLM)
      │
      ▼
Structured memory store
      │
      ├── compaction
      ├── evolution
      └── conflict resolution
      │
      ▼
Retrieval layer

Here, the memory store could be:

Graph Database
Document Store
Relational Database

Not just a vector database.

Conclusion#

Currently, many AI memory projects (e.g., mem0) have realized:

Memory ≠ Retrieval

They are beginning to explore:

memory extraction
memory scoring
memory updating

But overall, this still represents only the first generation of AI memory systems.

Truly mature AI memory will likely be much closer to a continuously evolving knowledge system—capable of compressing experience, updating facts, and resolving contradictions.

And the core problem behind this is really just one:

How should we represent "memory" itself?

Agent Cookbook

The Real Challenge of AI Memory: Why Vector Store + Embedding is Far From Enough

1. Memory Compaction: Compressing Memories#

The Problem: Memory Grows Indefinitely#

Analogy to Database Systems#

Technical Challenges#

2. Memory Evolution: The Evolution of Memories#

The Nature of Memory#

Key Problems#

3. Memory Conflict Resolution: Resolving Memory Conflicts#

Sources of Conflict#

Common Resolution Strategies#

Why Are These Three Problems So Difficult?#

A More Complete AI Memory Architecture#

Conclusion#