Multiple models context management like Ollama. #40

svjack · 2024-05-04T00:44:39Z

With the help of llama-cpp-agent, I can use function calling and json-schema ability of one llama model
nearly perfectly. 😊
Given I want to use code-llm like codellama to generate function tools and use hermes-2-pro-mistral-7b to use them as
https://github.com/Maximilian-Winter/llama-cpp-agent/blob/master/examples/05_Agents/hermes_2_pro_agent.py
do.
And may use another llm by llama-cpp-python to take other tasks.
If I only have Limited gpu memory ,What's going to disturb me is the lack of model switch ability in llama-cpp-python, which also can see in
abetlen/llama-cpp-python#223

Auto model switch and the manage of gpu memory have be done by Ollama, but it lack ability of convenient function tools and json-schema output.

Or you can add a model switch ability in llama-cpp-agent, as
abetlen/llama-cpp-python#736
and
abetlen/llama-cpp-python#302
say.

How can I tackle this ? Looking forward to your reply. 😊

svjack closed this as completed May 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multiple models context management like Ollama. #40

Multiple models context management like Ollama. #40

svjack commented May 4, 2024 •

edited

Loading

Multiple models context management like Ollama. #40

Multiple models context management like Ollama. #40

Comments

svjack commented May 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

svjack commented May 4, 2024 •

edited

Loading