No description

Find a file

Michael Chiang 2906cbab11 Update development.md		2023-06-27 14:07:31 -04:00
desktop	update port in client	2023-06-27 14:00:25 -04:00
docs	Update development.md	2023-06-27 14:07:31 -04:00
models	adding orca models	2023-06-27 11:41:54 -04:00
.gitignore	add templates to prompt command	2023-06-26 13:41:16 -04:00
build.py	move back to root	2023-06-27 12:12:44 -04:00
LICENSE	`proto` -> `ollama`	2023-06-26 15:57:13 -04:00
model_prompts.json	fix build	2023-06-27 12:32:21 -04:00
ollama.py	Change default port	2023-06-27 13:59:43 -04:00
README.md	fix `README.md` typo	2023-06-27 13:52:32 -04:00
requirements.txt	move back to root	2023-06-27 12:12:44 -04:00
template.py	fix build	2023-06-27 12:32:21 -04:00

Ollama

Install

pip install ollama

import ollama
ollama.generate("./llama-7b-ggml.bin", "hi")

Generate a completion

ollama.generate("./llama-7b-ggml.bin", "hi")

Load a model for generation

ollama.load("model")

List available local models

models = ollama.models()

Serve the ollama http server

Download a model

ollama.pull("huggingface.co/thebloke/llama-7b-ggml")

Import a model from a file

ollama.import("./path/to/model")

Search for compatible models that Ollama can run

ollama.search("llama-7b")

In the future, there will be an ollama CLI for running models on servers, in containers or for local development environments.

ollama generate huggingface.co/thebloke/llama-7b-ggml "hi"
> Downloading [================>          ] 66.67% (2/3) 30.2MB/s