ollama/README.md

<div align="center">
  <picture>
    <source media="(prefers-color-scheme: dark)" height="200px" srcset="https://github.com/jmorganca/ollama/assets/3325447/56ea1849-1284-4645-8970-956de6e51c3c">
    <img alt="logo" height="200px" src="https://github.com/jmorganca/ollama/assets/3325447/0d0b44e2-8f4a-4e99-9b52-a5c1c741c8f7">
  </picture>
</div>

# Ollama

[![Discord](https://dcbadge.vercel.app/api/server/ollama?style=flat&compact=true)](https://discord.gg/ollama)

> Note: Ollama is in early preview. Please report any issues you find.

Create, run, and share portable large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.

### Portable Large Language Models (LLMs)

Package models as a series of layers in a portable, easy to manage format.

#### The idea behind Ollama

- Universal model format that can run anywhere: desktop, cloud servers & other devices.
- Encapsulate everything a model needs to operate – weights, configuration, and data – into a single package.
- Build custom models from base models like Meta's [Llama 2](https://ai.meta.com/llama/)
- Share large models without having to transmit large amounts of data.

<picture>
  <source media="(prefers-color-scheme: dark)" height="480" srcset="https://github.com/jmorganca/ollama/assets/251292/2e05cf23-e3c6-403e-9910-3d622801f4b8">
  <img alt="logo" height="480" src="https://github.com/jmorganca/ollama/assets/251292/2e05cf23-e3c6-403e-9910-3d622801f4b8">
</picture>

This format is inspired by the [image spec](https://github.com/opencontainers/image-spec) originally introduced by Docker for Linux containers. Ollama extends this format to package large language models.

## Download

- [Download](https://ollama.ai/download) for macOS on Apple Silicon (Intel coming soon)
- Download for Windows and Linux (coming soon)
- Build [from source](#building)

## Quickstart

To run and chat with [Llama 2](https://ai.meta.com/llama), the new model by Meta:

```
ollama run llama2
```

## Model library

Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.

| Model                    | Parameters | Size  | Download                    |
| ------------------------ | ---------- | ----- | --------------------------- |
| Llama2                   | 7B         | 3.8GB | `ollama pull llama2`        |
| Llama2 13B               | 13B        | 7.3GB | `ollama pull llama2:13b`    |
| Orca Mini                | 3B         | 1.9GB | `ollama pull orca`          |
| Vicuna                   | 7B         | 3.8GB | `ollama pull vicuna`        |
| Nous-Hermes              | 13B        | 7.3GB | `ollama pull nous-hermes`   |
| Wizard Vicuna Uncensored | 13B        | 7.3GB | `ollama pull wizard-vicuna` |

## Examples

### Run a model

```
ollama run llama2
>>> hi
Hello! How can I help you today?
```

### Create a custom character model

Pull a base model:

```
ollama pull llama2
```

Create a `Modelfile`:

```
FROM llama2

# set the temperature to 1 [higher is more creative, lower is more coherent]
PARAMETER temperature 1

# set the system prompt
SYSTEM """
You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
"""
```

Next, create and run the model:

```
ollama create mario -f ./Modelfile
ollama run mario
>>> hi
Hello! It's your friend Mario.
```

For more examples, see the [examples](./examples) directory.

### Pull a model from the registry

```
ollama pull orca
```

## Building

```
go build .
```

To run it start the server:

```
./ollama serve &
```

Finally, run a model!

```
./ollama run llama2
```
-												Update README.md

add logo
											
										
										
											2023-07-18 19:45:38 +00:00
+								<div align="center">
 								  <picture>
-												Update icon (#139)


											
										
										
											2023-07-20 15:55:20 +00:00
+								    <source media="(prefers-color-scheme: dark)" height="200px" srcset="https://github.com/jmorganca/ollama/assets/3325447/56ea1849-1284-4645-8970-956de6e51c3c">
 								    <img alt="logo" height="200px" src="https://github.com/jmorganca/ollama/assets/3325447/0d0b44e2-8f4a-4e99-9b52-a5c1c741c8f7">
-												Update README.md

add logo
											
										
										
											2023-07-18 19:45:38 +00:00
+								  </picture>
 								</div>
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
-												move to contained directory

											
										
										
											2023-06-27 16:08:52 +00:00
+								# Ollama
-												initial commit

											
										
										
											2023-06-22 16:45:31 +00:00
-												fix discord link in `README.md`

											
										
										
											2023-07-19 19:31:48 +00:00
+								[![Discord](https://dcbadge.vercel.app/api/server/ollama?style=flat&compact=true)](https://discord.gg/ollama)
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								> Note: Ollama is in early preview. Please report any issues you find.
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
-												documentation on the model format

											
										
										
											2023-07-20 15:33:28 +00:00
+								Create, run, and share portable large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.
 								### Portable Large Language Models (LLMs)
 								Package models as a series of layers in a portable, easy to manage format.
 								#### The idea behind Ollama
 								- Universal model format that can run anywhere: desktop, cloud servers & other devices.
 								- Encapsulate everything a model needs to operate – weights, configuration, and data – into a single package.
 								- Build custom models from base models like Meta's [Llama 2](https://ai.meta.com/llama/)
 								- Share large models without having to transmit large amounts of data.
 								<picture>
 								  <source media="(prefers-color-scheme: dark)" height="480" srcset="https://github.com/jmorganca/ollama/assets/251292/2e05cf23-e3c6-403e-9910-3d622801f4b8">
 								  <img alt="logo" height="480" src="https://github.com/jmorganca/ollama/assets/251292/2e05cf23-e3c6-403e-9910-3d622801f4b8">
 								</picture>
 								This format is inspired by the [image spec](https://github.com/opencontainers/image-spec) originally introduced by Docker for Linux containers. Ollama extends this format to package large language models.
-												move download to the top of `README.md`

											
										
										
											2023-07-18 20:31:25 +00:00
+								## Download
 								- [Download](https://ollama.ai/download) for macOS on Apple Silicon (Intel coming soon)
 								- Download for Windows and Linux (coming soon)
 								- Build [from source](#building)
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
+								## Quickstart
 								To run and chat with [Llama 2](https://ai.meta.com/llama), the new model by Meta:
 								```
 								ollama run llama2
 								```
 								## Model library
 								Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.
 								| Model                    | Parameters | Size  | Download                    |
 								| ------------------------ | ---------- | ----- | --------------------------- |
 								| Llama2                   | 7B         | 3.8GB | `ollama pull llama2`        |
 								| Llama2 13B               | 13B        | 7.3GB | `ollama pull llama2:13b`    |
 								| Orca Mini                | 3B         | 1.9GB | `ollama pull orca`          |
 								| Vicuna                   | 7B         | 3.8GB | `ollama pull vicuna`        |
 								| Nous-Hermes              | 13B        | 7.3GB | `ollama pull nous-hermes`   |
 								| Wizard Vicuna Uncensored | 13B        | 7.3GB | `ollama pull wizard-vicuna` |
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								## Examples
-												Add download link to readme

											
										
										
											2023-06-27 21:13:07 +00:00
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
+								### Run a model
-												better `README.md` install instructions

											
										
										
											2023-06-30 16:39:25 +00:00
-												initial commit

											
										
										
											2023-06-22 16:45:31 +00:00
+								```
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								ollama run llama2
 								>>> hi
 								Hello! How can I help you today?
-												initial commit

											
										
										
											2023-06-22 16:45:31 +00:00
+								```
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
+								### Create a custom character model
 								Pull a base model:
 								```
-												new `Modelfile` syntax

											
										
										
											2023-07-20 09:21:51 +00:00
+								ollama pull llama2
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
+								```
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								Create a `Modelfile`:
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
-												add `docker` instruction

											
										
										
											2023-06-30 16:31:00 +00:00
+								```
-												new `Modelfile` syntax

											
										
										
											2023-07-20 09:21:51 +00:00
+								FROM llama2
-												set temperature on `README.md` example

											
										
										
											2023-07-20 15:17:09 +00:00
 								# set the temperature to 1 [higher is more creative, lower is more coherent]
 								PARAMETER temperature 1
 								# set the system prompt
-												new `Modelfile` syntax

											
										
										
											2023-07-20 09:21:51 +00:00
+								SYSTEM """
-												fix typo

											
										
										
											2023-07-18 20:32:06 +00:00
+								You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								"""
-												simplify `README.md`

											
										
										
											2023-06-29 22:25:02 +00:00
+								```
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-07 20:14:58 +00:00
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								Next, create and run the model:
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-07 20:14:58 +00:00
 								```
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								ollama create mario -f ./Modelfile
 								ollama run mario
 								>>> hi
 								Hello! It's your friend Mario.
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-07 20:14:58 +00:00
+								```
-												fix broken link in `README.md`

											
										
										
											2023-07-20 09:15:11 +00:00
+								For more examples, see the [examples](./examples) directory.
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
 								### Pull a model from the registry
-												add advanced usage to readme

											
										
										
											2023-07-06 20:21:01 +00:00
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
+								```
-												new `Modelfile` syntax

											
										
										
											2023-07-20 09:21:51 +00:00
+								ollama pull orca
-												add discord link, remove repeated text

											
										
										
											2023-07-19 19:28:50 +00:00
+								```
-												reorganize `README.md` files

											
										
										
											2023-06-28 13:57:36 +00:00
-												add llama.cpp go bindings

											
										
										
											2023-07-03 20:32:48 +00:00
+								## Building
 								```
-												vendor llama.cpp

											
										
										
											2023-07-11 16:50:02 +00:00
+								go build .
-												add llama.cpp go bindings

											
										
										
											2023-07-03 20:32:48 +00:00
+								```
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
+								To run it start the server:
-												add development doc

											
										
										
											2023-06-27 17:46:46 +00:00
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
+								```
-												Update README.md

I needed to do this to run the project
											
										
										
											2023-07-19 14:14:44 +00:00
+								./ollama serve &
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
+								```
 								Finally, run a model!
 								```
-												update `README.md` with new syntax

											
										
										
											2023-07-18 20:22:33 +00:00
+								./ollama run llama2
-												updated readme

											
										
										
											2023-07-05 19:37:33 +00:00
+								```