ollama/integration
Jesse Gross 7121dfa309 runner.go: Retry decoding after defragmentation if needed
Fragmentation of the KV cache can occur due to cache shifting or
different sequences getting processed. Decode uses a heuristic to
decide if it should defrag. However, this heuristic isn't 100%
accurate, so decoding can sometimes fail by surprise.

For these cases, if decode indicates that there is no KV cache space,
we should defrag and then try again.
2024-11-20 12:49:24 -08:00
..
basic_test.go Give unicode test more time to run (#7437) 2024-10-31 13:35:31 -07:00
concurrency_test.go Give unicode test more time to run (#7437) 2024-10-31 13:35:31 -07:00
context_test.go runner.go: Retry decoding after defragmentation if needed 2024-11-20 12:49:24 -08:00
embed_test.go integration: harden embedding test (#7306) 2024-10-22 15:25:22 -07:00
llm_image_test.go Add basic mllama integration tests (#7455) 2024-10-31 17:25:48 -07:00
llm_test.go fix concurrency test 2024-08-05 16:36:16 -07:00
max_queue_test.go fix concurrency test 2024-08-05 16:36:16 -07:00
README.md Revamp go based integration tests 2024-03-23 14:24:18 +01:00
utils_test.go Re-introduce the llama package (#5034) 2024-10-08 08:53:54 -07:00

Integration Tests

This directory contains integration tests to exercise Ollama end-to-end to verify behavior

By default, these tests are disabled so go test ./... will exercise only unit tests. To run integration tests you must pass the integration tag. go test -tags=integration ./...

The integration tests have 2 modes of operating.

  1. By default, they will start the server on a random port, run the tests, and then shutdown the server.
  2. If OLLAMA_TEST_EXISTING is set to a non-empty string, the tests will run against an existing running server, which can be remote