-
Notifications
You must be signed in to change notification settings - Fork 146
Byok rework #611
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Byok rework #611
Conversation
7de8829
to
c15ac59
Compare
f7197b3
to
e4bb5f8
Compare
|
|
looks like the |
Why would we need more than one embedding model? if we change it we'll need to create a new vecdb, and this is the only use for embedding model right now, but if you see any way we could use different embedding models for some reason maybe it's better to make it indexmap like others for future-proofing I did not found where we used n_ctx for embedding models everywhere, but didn't want to move it outside of base model rec to make some things easier in the code, still, I'll add it to those two to not have 0 there for the known models, but it's not that important to fill that for all models |
in caps.rs
lets remove it, we want flat list without any weird stuff
actually we dont have models with multiple scratchpads (or nobody use it)
in server we can have 2 embedding models (pgu and cpu) Looks like |
Inside one provider (server caps, cloud caps, or yaml files in providers.d, it's name, then lsp makes it "provider/model_name" when it loads the caps of all providers, if you found this in the merged caps that lsp makes, then it's a bug |
You're right about one scratchpad per model, I'll change that to make model record simpler, just scratchpad_name (string) and scratchpad_value (json value) looks better. For embedding models, it's better to use same as other I think from what you talked, and we may use more than one in the future, better to make same logic as for others I guess, I'll change that. If similar_models is not needed for compatibility in self-host/enterprise then I'll get rid of it, it will make things simpler, we don't need it for cloud or byok |
one more
annoying errors (which is not error btw) |
in the
we need it for backward compat with old caps / byok style, but of course not for new internal caps format
for example I want to use openai embedder instead of gte (byok) about embedding model: it has |
* un-disable input when limit reached. * chore: add `compression_strength` to tool messages * add paused state to thread * add hook to pause auto send based on compression. * ui: let the user know that their chat is being compressed. * fix: linter issues after removing `limitReached` information call out. * fix: also use `/links` to decided if a new chat should be suggested. * refactor: remove `useTotalTokenUsage` hook. * add comments about `newChatSuggested`. * pause and unpaused using newChatSuggested. * fix(NewChatSuggested): use a hook to get the compression strength. * feat: add second condition for pausing the chat. * case: it might be posable for many messages with out compression.
…ting them to last assistant message
ebd5da4
to
fb86e2d
Compare
it was only used in telemetry, and it was replaced by IntoResponse
* wip: remove attach file checkbox. * feat: attach files button. * test: active file is no longer attached by default. * add an event for the ide to attach a file to chat. * fix: remove attached files after submit.
8afcda3
to
3f65277
Compare
- claude-3-5-sonnet-latest | ||
- claude-3-5-haiku-latest | ||
|
||
model_defaults: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ui
refact-agent/engine/src/caps/mod.rs
Outdated
#[derive(Debug, Serialize, Deserialize, Clone, Default)] | ||
pub struct CodeAssistantCaps { | ||
#[serde(deserialize_with = "normalize_string")] | ||
pub cloud_name: String, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// "refact", "refact_self_hosted"
…eople don't confuse it with instruct
…se if stream was DONE
This reverts commit 860578a.
This reverts commit 860578a.
* add metadata in caps * wip: add coin count to history item * add cost info to caps select * wip: replace message with coins in menu. * wip: handle out of coins error. * Byok rework (#611) * ui: add better ui for caps cost. * wip: history item, calculate cost. * remove usage editing in lsp * wip: calculate total price in history. * refactor: display coin usage * chore: update caps stubs. * chore(coins): remove old agent usage logic * refactor(startPollingForUser): rename useAgentUsage. * feat: add warning when balance starts to get low. * fix: add metadata to caps after moving caps * fix: re-add metadata to caps. * fix: caps type guard. * fix: <p> being a descendant of <p> * fix typo: ballance -> balance * fix: typo `ballance` * small change to trigger actions. * fix: caps pricing maybe undefined when using byok * fix type check for pricing being undefined when using byok. * fix: cost * usage counter should be two decimal points. * update calculate total cost. * missed toFixed on total cost. * round total cost. --------- Co-authored-by: Kirill Starkov <[email protected]> Co-authored-by: Humberto Yusta Gómez <[email protected]>
Let's don't merge it yet