No description

Find a file

Michael Chiang c47786c1b0 Update docs/modelfile.md Co-authored-by: Michael Yang <mxyng@pm.me>		2023-07-19 23:24:32 -07:00
api	fix pull 0 bytes on completed layer	2023-07-18 19:38:11 -07:00
app	update icons to have different images for bright and dark mode	2023-07-19 11:14:43 -04:00
cmd	vendor in progress bar and change to bytes instead of bibytes (#130 )	2023-07-19 17:24:03 -07:00
docs	Update docs/modelfile.md	2023-07-19 23:24:32 -07:00
examples	update	2023-07-19 13:37:21 -07:00
format	add new list command (#97 )	2023-07-18 09:09:45 -07:00
llama	fix multibyte responses	2023-07-14 20:11:44 -07:00
parser	separate prompt into template and system	2023-07-19 23:24:31 -07:00
progressbar	vendor in progress bar and change to bytes instead of bibytes (#130 )	2023-07-19 17:24:03 -07:00
scripts	build app in publish script	2023-07-12 19:16:39 -07:00
server	handle vnd.ollama.image.prompt for compat	2023-07-19 23:24:32 -07:00
web	web: fix newsletter signup	2023-07-19 16:11:56 -07:00
.dockerignore	update `Dockerfile`	2023-07-06 16:34:44 -04:00
.gitignore	fix compilation issue in Dockerfile, remove from `README.md` until ready	2023-07-11 19:51:08 -07:00
.prettierrc.json	move .prettierrc.json to root	2023-07-02 17:34:46 -04:00
Dockerfile	fix compilation issue in Dockerfile, remove from `README.md` until ready	2023-07-11 19:51:08 -07:00
ggml-metal.metal	look for ggml-metal in the same directory as the binary	2023-07-11 15:58:56 -07:00
go.mod	vendor in progress bar and change to bytes instead of bibytes (#130 )	2023-07-19 17:24:03 -07:00
go.sum	vendor in progress bar and change to bytes instead of bibytes (#130 )	2023-07-19 17:24:03 -07:00
LICENSE	`proto` -> `ollama`	2023-06-26 15:57:13 -04:00
main.go	continue conversation	2023-07-13 17:13:00 -07:00
models.json	update vicuna model	2023-07-12 09:42:26 -07:00
README.md	fix discord link in `README.md`	2023-07-19 12:31:48 -07:00

README.md

Ollama

Create, run, and share large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.

Note: Ollama is in early preview. Please report any issues you find.

Download

Download for macOS on Apple Silicon (Intel coming soon)
Download for Windows and Linux (coming soon)
Build from source

Quickstart

To run and chat with Llama 2, the new model by Meta:

ollama run llama2

Model library

Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.

Model	Parameters	Size	Download
Llama2	7B	3.8GB	`ollama pull llama2`
Llama2 13B	13B	7.3GB	`ollama pull llama2:13b`
Orca Mini	3B	1.9GB	`ollama pull orca`
Vicuna	7B	3.8GB	`ollama pull vicuna`
Nous-Hermes	13B	7.3GB	`ollama pull nous-hermes`
Wizard Vicuna Uncensored	13B	7.3GB	`ollama pull wizard-vicuna`

Examples

Run a model

ollama run llama2
>>> hi
Hello! How can I help you today?

Create a custom character model

Pull a base model:

ollama pull orca

Create a Modelfile:

FROM orca
PROMPT """
### System:
You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.

### User:
{{ .Prompt }}

### Response:
"""

Next, create and run the model:

ollama create mario -f ./Modelfile
ollama run mario
>>> hi
Hello! It's your friend Mario.

For more info on Modelfile syntax see this doc.

Pull a model from the registry

ollama pull nous-hermes

Building

go build .

To run it start the server:

./ollama serve &

Finally, run a model!

./ollama run llama2

README.md Unescape Escape