No description

Find a file

Jeffrey Morgan 4dd296e155 build app in publish script		2023-07-12 19:16:39 -07:00
api	check api status	2023-07-11 13:42:05 -07:00
app	app: trim server lines before logging	2023-07-11 16:43:19 -07:00
cmd	pull fixes	2023-07-12 09:55:07 -07:00
docs	add publish script	2023-07-07 12:59:45 -04:00
examples/python	examples: add basic python example	2023-07-08 17:40:05 -04:00
llama	fix eof error in generate	2023-07-12 09:36:16 -07:00
scripts	build app in publish script	2023-07-12 19:16:39 -07:00
server	pull fixes	2023-07-12 09:55:07 -07:00
web	web: disable signup button while submitting	2023-07-12 17:32:27 -07:00
.dockerignore	update `Dockerfile`	2023-07-06 16:34:44 -04:00
.gitignore	fix compilation issue in Dockerfile, remove from `README.md` until ready	2023-07-11 19:51:08 -07:00
.prettierrc.json	move .prettierrc.json to root	2023-07-02 17:34:46 -04:00
Dockerfile	fix compilation issue in Dockerfile, remove from `README.md` until ready	2023-07-11 19:51:08 -07:00
ggml-metal.metal	look for ggml-metal in the same directory as the binary	2023-07-11 15:58:56 -07:00
go.mod	no errgroup	2023-07-11 14:58:10 -07:00
go.sum	no errgroup	2023-07-11 14:58:10 -07:00
LICENSE	`proto` -> `ollama`	2023-06-26 15:57:13 -04:00
main.go	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00
models.json	update vicuna model	2023-07-12 09:42:26 -07:00
README.md	update `README.md` API reference	2023-07-12 19:16:28 -07:00

Ollama

Run large language models with llama.cpp.

Note: certain models that can be run with Ollama are intended for research and/or non-commercial use only.

Install

You can also build the binary from source.

Run a fast and simple model.

ollama run orca

Have a conversation.

ollama run vicuna "Why is the sky blue?"

Get a helping hand.

ollama run orca "Write an email to my boss."

Send the contents of a document and ask questions about it.

ollama run nous-hermes "$(cat input.txt)", please summarize this story

Venture into the unknown.

ollama run nous-hermes "Once upon a time"

ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

go build .

To run it start the server:

./ollama server &

Finally, run a model!

./ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

Download a model

curl -X POST http://localhost:11343/api/pull -d '{"model": "orca"}'

Complete a prompt

curl -X POST http://localhost:11434/api/generate -d '{"model": "orca", "prompt": "hello!"}'