ollama/server
royjhan b9f5e16c80
Introduce /api/embed endpoint supporting batch embedding (#5127)
* Initial Batch Embedding

* Revert "Initial Batch Embedding"

This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.

* Initial Draft

* mock up notes

* api/embed draft

* add server function

* check normalization

* clean up

* normalization

* playing around with truncate stuff

* Truncation

* Truncation

* move normalization to go

* Integration Test Template

* Truncation Integration Tests

* Clean up

* use float32

* move normalize

* move normalize test

* refactoring

* integration float32

* input handling and handler testing

* Refactoring of legacy and new

* clear comments

* merge conflicts

* touches

* embedding type 64

* merge conflicts

* fix hanging on single string

* refactoring

* test values

* set context length

* clean up

* testing clean up

* testing clean up

* remove function closure

* Revert "remove function closure"

This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.

* remove function closure

* remove redundant error check

* clean up

* more clean up

* clean up
2024-07-15 12:14:24 -07:00
..
auth.go Revert "use post token" 2024-05-11 22:19:14 -07:00
download.go server: skip blob verification for already verified blobs 2024-06-05 16:39:11 -07:00
fixblobs.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
fixblobs_test.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
images.go update message processing 2024-07-05 13:16:58 -07:00
layer.go Merge pull request #3718 from ollama/mxyng/modelname-3 2024-05-29 12:02:07 -07:00
manifest.go remove ManifestV2 2024-07-01 10:40:54 -07:00
manifest_test.go remove ManifestV2 2024-07-01 10:40:54 -07:00
model.go Merge pull request #5051 from ollama/mxyng/capabilities 2024-07-02 14:26:07 -07:00
model_test.go err on insecure path 2024-07-01 15:55:59 -07:00
modelpath.go fix: use envconfig.ModelsDir directly (#4821) 2024-07-03 15:36:11 -07:00
modelpath_test.go add OLLAMA_MODELS to envconfig (#5029) 2024-06-13 12:52:03 -07:00
prompt.go fix system prompt (#5662) 2024-07-12 21:04:44 -07:00
prompt_test.go fix system prompt (#5662) 2024-07-12 21:04:44 -07:00
routes.go Introduce /api/embed endpoint supporting batch embedding (#5127) 2024-07-15 12:14:24 -07:00
routes_create_test.go revert embedded templates to use prompt/response 2024-07-11 14:49:35 -07:00
routes_delete_test.go add OLLAMA_MODELS to envconfig (#5029) 2024-06-13 12:52:03 -07:00
routes_list_test.go add OLLAMA_MODELS to envconfig (#5029) 2024-06-13 12:52:03 -07:00
routes_test.go Introduce /api/embed endpoint supporting batch embedding (#5127) 2024-07-15 12:14:24 -07:00
sched.go sched: only error when over-allocating system memory (#5626) 2024-07-11 00:53:12 -07:00
sched_test.go Introduce /api/embed endpoint supporting batch embedding (#5127) 2024-07-15 12:14:24 -07:00
upload.go lint 2024-06-04 11:13:30 -07:00