Chat Memory Enhancements #2890

ThomasVitale · 2025-04-25T15:54:17Z

ChatMemory will become a generic interface to implement different memory management strategies. It’s been moved from the “”spring-ai-client-chat” package to “spring-ai-model” package while retaining the same package, so it’s transparent to users.
A MessageWindowChatMemory has been introduced to provide support for a chat memory that keeps at most N messages in the memory.
A ChatMemoryRepository interface has been introduced to support different storage strategies for the chat memory. It’s meant to be used as part of a ChatMemory implementation. This is different than before, where the storage-specific implementation was directly tied to the ChatMemory. This design is familiar to Spring users since it’s used already in the ecosystem. The goal was to use a programming model similar to Spring Session and Spring Data.
The JdbcChatMemory has been supersed by JdbcChatMemoryRepository.
A ChatMemory bean is auto-configured for you whenever using one of the Spring AI Model starters. By default, it uses the MessageWindowChatMemory implementation and stores the conversation history in memory. If a different repository is already configured (e.g., Cassandra, JDBC, or Neo4j), Spring AI will use that instead.
First-class documentation has been introduced to describe the ChatMemory API and related features.
All the changes introduced in this PR are backward-compatible.

ThomasVitale · 2025-04-25T16:18:57Z

@leijendary we are introducing a new ChatMemoryRepository API to decouple the storage mechanism (JDBC, Cassandra...) from the memory management strategy (last N messages, last N tokens...). In this pull request, as the first mover, I have extended the JDBC chat memory support to use the new APIs. The existing ones are still there, but deprecated. Since you implemented the initial JDBC chat memory support, it would be great if you could have a look at the JDBC changes in this PR and share your feedback. Thank you!

The new documentation explains how these new APIs will work.

The upgrade notes explains how to migrate to the new APIs.

markpollack · 2025-04-25T18:39:12Z

spring-ai-model/src/main/java/org/springframework/ai/chat/memory/ChatMemoryRepository.java


-	void add(String conversationId, List<Message> messages);
+	List<Message> findMessages(String conversationId);


findByConversationId instead? just thinking of spring data repo conventions for finders.

markpollack · 2025-04-25T18:39:24Z

spring-ai-model/src/main/java/org/springframework/ai/chat/memory/ChatMemoryRepository.java


-	void clear(String conversationId);
+	void deleteMessages(String conversationId);


deleteByConversationId ?

markpollack · 2025-04-25T18:39:57Z

spring-ai-model/src/main/java/org/springframework/ai/chat/memory/ChatMemoryRepository.java


-	List<Message> get(String conversationId, int lastN);
+	void saveMessages(String conversationId, List<Message> messages);


markpollack · 2025-04-25T18:42:28Z

...ory-jdbc/src/main/java/org/springframework/ai/chat/memory/jdbc/JdbcChatMemoryRepository.java

+				case USER -> new UserMessage(content);
+				case ASSISTANT -> new AssistantMessage(content);
+				case SYSTEM -> new SystemMessage(content);
+				case TOOL -> null;


is returning null ok here?

why not accomodate tool messages so that if the lower level apis are currently used to store them, they can be retrieved?

Makes sense! I kept the original logic from JdbcChatMemory, but it does make sense to handle tool messages nicely. I'll do that.

I changed it to return an empty ToolResponseMessage, because the storage saves the content of "getText()" and that method returns always an empty string for ToolResponseMessage.

We need a separate enhancement if we want to store the list of tool call results as well.

markpollack · 2025-04-25T18:43:39Z

...ory-jdbc/src/main/java/org/springframework/ai/chat/memory/jdbc/JdbcChatMemoryRepository.java

+ */
+public class JdbcChatMemoryRepository implements ChatMemoryRepository {
+
+	private static final String QUERY_GET_IDS = """


as with vector store schemas, in the future we need to let users customize their schema, e.g. table and row names.

Agreed. For now, I kept the same logic as we have in JdbcChatMemory, but we do need the possibility to choose a custom table name and customise queries.

markpollack · 2025-04-25T19:02:04Z

...ory-jdbc/src/main/java/org/springframework/ai/chat/memory/jdbc/JdbcChatMemoryRepository.java

+
+	private static final String QUERY_GET_IDS = """
+			SELECT conversation_id FROM ai_chat_memory
+			""";


select distinct instead otherwise get duplicates?

Done. This comment made me realise the current SQL schema definition do not define a primary key. I fixed that as well.

Postponed the primary key discussion for a separate task.

markpollack · 2025-04-25T19:12:44Z

spring-ai-model/src/main/java/org/springframework/ai/chat/memory/MessageWindowChatMemory.java

+
+		boolean hasNewSystemMessage = newMessages.stream()
+			.filter(message -> message instanceof SystemMessage)
+			.anyMatch(message -> !memoryMessages.contains(message));


this is a loop within a loop.

Set<Message> memoryMessagesSet = new HashSet<>(memoryMessages); boolean hasNewSystemMessage = newMessages.stream() .filter(SystemMessage.class::isInstance) .anyMatch(message -> !memoryMessagesSet.contains(message));

makes it more efficient

Is it correct that the UserMessage object equals/hashcode doesn't take into account media? we aren't storing it, so yes, but prob it needs to be updated?

About the UserMessage, I guess it's not correct that we are not considering the media. Agreed that it should be updated.

For SystemMessages, that works fine since it only has text. But for other message types, we should probably override the equals/hashcode inherited from the AbstractMessage.

* ChatMemory will become a generic interface to implement different memory management strategies. It’s been moved from the “”spring-ai-client-chat” package to “spring-ai-model” package while retaining the same package, so it’s transparent to users. * A MessageWindowChatMemory has been introduced to provide support for a chat memory that keeps at most N messages in the memory. * A ChatMemoryRepository interface has been introduced to support different storage strategies for the chat memory. It’s meant to be used as part of a ChatMemory implementation. This is different than before, where the storage-specific implementation was directly tied to the ChatMemory. This design is familiar to Spring users since it’s used already in the ecosystem. The goal was to use a programming model similar to Spring Session and Spring Data. * The JdbcChatMemory has been supersed by JdbcChatMemoryRepository. * A ChatMemory bean is auto-configured for you whenever using one of the Spring AI Model starters. By default, it uses the MessageWindowChatMemory implementation and stores the conversation history in memory. If a different repository is already configured (e.g., Cassandra, JDBC, or Neo4j), Spring AI will use that instead. * First-class documentation has been introduced to describe the ChatMemory API and related features. * All the changes introduced in this PR are backward-compatible. Signed-off-by: Thomas Vitale <[email protected]>

ThomasVitale · 2025-04-25T20:24:05Z

Delivered as 0024e4d

linarkou · 2025-04-28T10:46:35Z

@ThomasVitale thank you for the new api, really fine!
But I think fetching all rows from the database, loading it to memory and then keeping only last N seems to be not very efficient - maybe better to keep an ability to fetch only N rows in ChatMemoryRepository?

Moreover, for inserting single message we do need to 1) select all, 2) delete all and 3) insert all, is it right? Seems also not efficient..

And one more thing. I see that current solution leaves only one system message in history. But currently PromptChatMemoryAdvisor and MessageChatMemoryAdvisor save only user and assistant messages. Does there any plans to save all messages to chatMemory, not only user/assistant?

linarkou · 2025-04-28T11:16:18Z

...l-chat-memory-jdbc/src/main/java/org/springframework/ai/chat/memory/jdbc/JdbcChatMemory.java

 public class JdbcChatMemory implements ChatMemory {

 	private static final String QUERY_ADD = """
 			INSERT INTO ai_chat_memory (conversation_id, content, type) VALUES (?, ?, ?)""";

 	private static final String QUERY_GET = """
-			SELECT content, type FROM ai_chat_memory WHERE conversation_id = ? ORDER BY "timestamp" DESC LIMIT ?""";


It wasn't correct to remove descending order here - now it will always fetch only first N rows instead of last N.
I'll fix message order in #2781.

Thanks for the feedback! The ChatMemoryRepository as designed now keeps in storage only the messages allowed by the specific ChatMemory strategy. Right now, the only built-in strategy is MessageWindowChatMemory, which keeps at most M messages. The logic for sorting out which messages to keep/remove is handled within ChatMemory before storing them, whereas ChatMemoryRepository stores exactly what the ChatMemory defines.

When you call JdbcChatMemoryRepository.findByConversationId(), you want to return all the messages because those are the ones determined as relevant to keep in memory. That's why I removed the "DESC". And that's why the lastN() method is deprecated and will be removed (besides leading to wrong results). When you call JdbcChatMemoryRepository.saveAll(), you practically overwrite the current list of messages with the new list (which has been processed already based on a given memory management strategy, such as # of messages or # of tokens).

If the message window is set to 15 messages, I will always have at most 15 messages stored in the database for a given conversationId. Does it make sense? I'm sure we can optimize the operations, especially the saveAll() one, since it's not the best. But I guess we need to introduce identifiers for each row (right now they don't have a primary key). Possibly related to #2902. If we make the schema definition customizable, it would be possible to enable different types of implementations, such as having one row per conversationId, stored as a JSON BLOB since they are handled as a single unit anyway.

Considering also your other comment, I wonder if we need some kind of specialization of the API or perhaps new APIs. ChatMemory as designed now is tailored towards standard short-term memory, keeping only the messages identified by the specific strategy.

If we want to keep the entire history and then make decisions at query time (with standard capabilities for filtering, selecting, and so on), I think we might need a different strategy. That would probably require a separate API since it wouldn't be chat memory, but it would be chat history (two different concepts). And for that we would benefit from the existing Spring Data APIs. I'll create a separate issue to talk about that since it's strictly related to our wish to surface memory as first-class citizen in ChatClient, and when doing that we need to support actual memory (short-term) but also chat history (long-term).

About what kind of messages end up in the memory, right now the built-in advisors in ChatClient never store system messages and tool response messages. But since ChatMemory is a core API that can be used outside ChatClient (for example, directly with ChatModel as shown here), we need to support all message types.

Thank you very much for so detailed explanation, everything is now clarified!

What about DESC order - my comment was about old JdbcChatMemory , not JdbcChatMemoryRepository. So I still think it is a mistake there.

Oh yeah, good catch! I hadn't noticed that. Thanks for reporting it! That should still be there for the old implementation. Are you fixing it in your existing PR or should I create a separate task?

Thanks so much for all your contributions!

Already fixed it in my PR

Thanks so much!

ThomasVitale force-pushed the chat-memory-repository branch from e95b81e to 772be8b Compare April 25, 2025 15:59

ThomasVitale mentioned this pull request Apr 25, 2025

Introduce first-class chat memory support #2803

Closed

ThomasVitale force-pushed the chat-memory-repository branch from 772be8b to 2b96e4e Compare April 25, 2025 16:25

markpollack reviewed Apr 25, 2025

View reviewed changes

ThomasVitale force-pushed the chat-memory-repository branch from bd11883 to 59f8df8 Compare April 25, 2025 19:46

ThomasVitale force-pushed the chat-memory-repository branch from 59f8df8 to 918b788 Compare April 25, 2025 19:51

This was referenced Apr 25, 2025

Improve JDBC Chat Memory #2662

Closed

Fixed message order for JDBC Chat Memory #2781

Closed

fix(JdbcChatMemory): get query for MSSQL Server #2806

Closed

Create MongoDB chat memory implementation #2679

Open

ThomasVitale closed this Apr 25, 2025

ThomasVitale mentioned this pull request Apr 26, 2025

feat(redis): Add Redis-based semantic caching and chat memory implementations #2295

Open

linarkou reviewed Apr 28, 2025

View reviewed changes

This was referenced May 5, 2025

Migrate CassandraChatMemory to CassandraChatMemoryRepository #2998

Closed

Migrate Neo4jChatMemory to Neo4jChatMemoryRepository #2999

Closed


		void add(String conversationId, List<Message> messages);
		List<Message> findMessages(String conversationId);


		void clear(String conversationId);
		void deleteMessages(String conversationId);


		List<Message> get(String conversationId, int lastN);
		void saveMessages(String conversationId, List<Message> messages);

Chat Memory Enhancements #2890

Chat Memory Enhancements #2890

Uh oh!

Conversation

ThomasVitale commented Apr 25, 2025

Uh oh!

ThomasVitale commented Apr 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ThomasVitale commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

linarkou commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ThomasVitale Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ThomasVitale commented Apr 25, 2025 •

edited

Loading

linarkou commented Apr 28, 2025 •

edited

Loading

ThomasVitale Apr 28, 2025 •

edited

Loading