feat: limit message context sent to llm #66

gregnr · 2024-08-14T03:27:58Z

Problem

In an ideal world, we send as much context to the LLM as possible so that it has the most amount of information to help respond to your questions/tasks. Unfortunately token costs grow quadratically for every message added to the chat, since you need to send all previous messages each time.

Solution

To prevent token costs from blowing out of proportion, we can limit the max number of previous messages to send to the LLM as context each request.

Important: this does not mean the actual message history is limited on the frontend. This is strictly referring to the sliding window of the past X messages being sent to the LLM as context each time.

This PR sets this message context limit to 30 messages. The consequence of this change is that the model will have no memory of messages older than 30 messages, so will be unable to answer a questions or refer to information from more than 30 messages back.

30 messages was chosen as a reasonable amount of context to help the user accomplish whatever task they are working on in that moment, but not be too much that irrelevant/stale messages are sent every time (adding large costs) with little value.

Future

In the future, we can consider summarizing old messages before trimming the context so that the model at least has some history to refer to. Rewriting message history can get tricky though, so this will need more thought.

feat: limit message context sent to llm

e148551

jgoux approved these changes Aug 14, 2024

View reviewed changes

Merge branch 'main' into feat/limit-message-context

d8342f0

vercel bot deployed to Preview August 14, 2024 14:11 View deployment

gregnr merged commit 2480217 into main Aug 14, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: limit message context sent to llm #66

feat: limit message context sent to llm #66

Uh oh!

gregnr commented Aug 14, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

feat: limit message context sent to llm #66

feat: limit message context sent to llm #66

Uh oh!

Conversation

gregnr commented Aug 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Future

Uh oh!

Uh oh!

Uh oh!

gregnr commented Aug 14, 2024 •

edited

Loading