llama : add `retrieval` example

Since we now support embedding models in `llama.cpp` we should add a simple example to demonstrate retrieval functionality. Here is how it should work:

- load a set of text files (provided from the command line)
- split the text into chunks of user-configurable size, each chunk ending on a configurable stop string
- embed all chunks using an embedding model (BERT / SBERT)
- receive input from the command line, embed it and display the top N most relevant chunks based on cosine similarity between the input and chunk emebeddings

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : add `retrieval` example #5692

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

llama : add retrieval example #5692

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

llama : add `retrieval` example #5692