ollama/README.md

97 lines
1.7 KiB
Markdown
Raw Normal View History

2023-07-05 15:37:33 -04:00
![ollama](https://github.com/jmorganca/ollama/assets/251292/961f99bb-251a-4eec-897d-1ba99997ad0f)
2023-06-27 12:08:52 -04:00
# Ollama
2023-06-22 12:45:31 -04:00
2023-07-05 15:37:33 -04:00
Run large language models with `llama.cpp`.
> Note: certain models that can be run with this project are intended for research and/or non-commercial use only.
### Features
2023-06-27 17:13:07 -04:00
2023-07-05 15:37:33 -04:00
- Download and run popular large language models
- Switch between multiple models on the fly
- Hardware acceleration where available (Metal, CUDA)
2023-07-05 16:32:49 -04:00
- Fast inference server written in Go, powered by [llama.cpp](https://github.com/ggerganov/llama.cpp)
2023-07-05 15:37:33 -04:00
- REST API to use with your application (python, typescript SDKs coming soon)
2023-06-28 09:57:36 -04:00
## Install
2023-06-22 12:45:31 -04:00
2023-07-05 15:37:33 -04:00
- Download for macOS
- Download for Windows (coming soon)
- Docker: `docker run -p 8080:8080 ollama/ollama`
You can also build the [binary from source](#building).
## Quickstart
Run the model that started it all.
2023-06-22 12:45:31 -04:00
```
2023-07-05 15:37:33 -04:00
ollama run llama
2023-06-22 12:45:31 -04:00
```
2023-07-05 15:37:33 -04:00
## Example models
### 💬 Chat
Have a conversation.
2023-06-30 12:31:00 -04:00
```
2023-07-05 15:37:33 -04:00
ollama run vicuna "Why is the sky blue?"
2023-06-30 12:31:00 -04:00
```
2023-07-05 15:37:33 -04:00
### 🗺️ Instructions
2023-06-25 13:08:03 -04:00
2023-07-05 15:37:33 -04:00
Ask questions. Get answers.
2023-06-29 18:25:02 -04:00
```
2023-07-05 15:37:33 -04:00
ollama run orca "Write an email to my boss."
2023-06-25 13:08:03 -04:00
```
2023-07-05 15:37:33 -04:00
### 👩‍💻 Code completion
Sometimes you just need a little help writing code.
2023-06-27 12:08:52 -04:00
2023-06-29 18:25:02 -04:00
```
2023-07-05 15:37:33 -04:00
ollama run replit "Give me react code to render a button"
2023-06-29 18:25:02 -04:00
```
2023-06-28 09:57:36 -04:00
2023-07-05 15:37:33 -04:00
### 📖 Storytelling
Venture into the unknown.
2023-06-29 18:25:02 -04:00
```
2023-07-05 15:37:33 -04:00
ollama run storyteller "Once upon a time"
2023-06-28 09:57:36 -04:00
```
2023-07-03 16:32:48 -04:00
## Building
```
go generate ./...
go build .
```
2023-07-05 15:37:33 -04:00
To run it start the server:
2023-06-27 13:46:46 -04:00
2023-07-05 15:37:33 -04:00
```
./ollama server &
```
Finally, run a model!
```
./ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin
```
## API Reference
### `POST /completion`
Complete a prompt
```
2023-07-05 19:19:01 -04:00
curl --unix-socket ~/.ollama/ollama.sock http://localhost/api/generate \
-X POST \
-d '{"model": "/path/to/model", "prompt": "Once upon a time", "stream": true}'
2023-07-05 15:37:33 -04:00
```