ollama/README.md

<div align="center">
  <picture>
    <source media="(prefers-color-scheme: dark)" height="200px" srcset="https://github.com/jmorganca/ollama/assets/3325447/318048d2-b2dd-459c-925a-ac8449d5f02c">
    <img alt="logo" height="200px" src="https://github.com/jmorganca/ollama/assets/3325447/c7d6e15f-7f4d-4776-b568-c084afa297c2">
  </picture>
</div>

# Ollama

[![Discord](https://dcbadge.vercel.app/api/server/ollama?style=flat&compact=true)](https://discord.gg/ollama)

Create, run, and share large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.

> Note: Ollama is in early preview. Please report any issues you find.

## Download

- [Download](https://ollama.ai/download) for macOS on Apple Silicon (Intel coming soon)
- Download for Windows and Linux (coming soon)
- Build [from source](#building)

## Quickstart

To run and chat with [Llama 2](https://ai.meta.com/llama), the new model by Meta:

```
ollama run llama2
```

## Model library

Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.

| Model                    | Parameters | Size  | Download                    |
| ------------------------ | ---------- | ----- | --------------------------- |
| Llama2                   | 7B         | 3.8GB | `ollama pull llama2`        |
| Llama2 13B               | 13B        | 7.3GB | `ollama pull llama2:13b`    |
| Orca Mini                | 3B         | 1.9GB | `ollama pull orca`          |
| Vicuna                   | 7B         | 3.8GB | `ollama pull vicuna`        |
| Nous-Hermes              | 13B        | 7.3GB | `ollama pull nous-hermes`   |
| Wizard Vicuna Uncensored | 13B        | 7.3GB | `ollama pull wizard-vicuna` |

## Examples

### Run a model

```
ollama run llama2
>>> hi
Hello! How can I help you today?
```

### Create a custom character model

Pull a base model:

```
ollama pull orca
```

Create a `Modelfile`:

```
FROM orca
PROMPT """
### System:
You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.

### User:
{{ .Prompt }}

### Response:
"""
```

Next, create and run the model:

```
ollama create mario -f ./Modelfile
ollama run mario
>>> hi
Hello! It's your friend Mario.
```

For more info on `Modelfile` syntax see [this doc](./docs/modelfile).

### Pull a model from the registry

```
ollama pull nous-hermes
```

## Building

```
go build .
```

To run it start the server:

```
./ollama serve &
```

Finally, run a model!

```
./ollama run llama2
```
-												Update README.md

add logo
											
										
										
											2023-07-18 12:45:38 -07:00
+								<div align="center">
 								  <picture>
 								    <source media="(prefers-color-scheme: dark)" height="200px" srcset="https://github.com/jmorganca/ollama/assets/3325447/318048d2-b2dd-459c-925a-ac8449d5f02c">
 								    <img alt="logo" height="200px" src="https://github.com/jmorganca/ollama/assets/3325447/c7d6e15f-7f4d-4776-b568-c084afa297c2">
 								  </picture>
 								</div>
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
-												move to contained directory

											
										
										
											2023-06-27 12:08:52 -04:00
+								# Ollama
-												initial commit

											
										
										
											2023-06-22 12:45:31 -04:00
-												fix discord link in `README.md`

											
										
										
											2023-07-19 12:31:48 -07:00
+								[![Discord](https://dcbadge.vercel.app/api/server/ollama?style=flat&compact=true)](https://discord.gg/ollama)
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
 								Create, run, and share large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								> Note: Ollama is in early preview. Please report any issues you find.
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
-												move download to the top of `README.md`

											
										
										
											2023-07-18 13:31:25 -07:00
+								## Download
 								- [Download](https://ollama.ai/download) for macOS on Apple Silicon (Intel coming soon)
 								- Download for Windows and Linux (coming soon)
 								- Build [from source](#building)
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								## Quickstart
 								To run and chat with [Llama 2](https://ai.meta.com/llama), the new model by Meta:
 								```
 								ollama run llama2
 								```
 								## Model library
 								Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.
 								| Model                    | Parameters | Size  | Download                    |
 								| ------------------------ | ---------- | ----- | --------------------------- |
 								| Llama2                   | 7B         | 3.8GB | `ollama pull llama2`        |
 								| Llama2 13B               | 13B        | 7.3GB | `ollama pull llama2:13b`    |
 								| Orca Mini                | 3B         | 1.9GB | `ollama pull orca`          |
 								| Vicuna                   | 7B         | 3.8GB | `ollama pull vicuna`        |
 								| Nous-Hermes              | 13B        | 7.3GB | `ollama pull nous-hermes`   |
 								| Wizard Vicuna Uncensored | 13B        | 7.3GB | `ollama pull wizard-vicuna` |
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								## Examples
-												Add download link to readme

											
										
										
											2023-06-27 17:13:07 -04:00
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								### Run a model
-												better `README.md` install instructions

											
										
										
											2023-06-30 12:39:25 -04:00
-												initial commit

											
										
										
											2023-06-22 12:45:31 -04:00
+								```
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								ollama run llama2
 								>>> hi
 								Hello! How can I help you today?
-												initial commit

											
										
										
											2023-06-22 12:45:31 -04:00
+								```
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								### Create a custom character model
 								Pull a base model:
 								```
 								ollama pull orca
 								```
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								Create a `Modelfile`:
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
-												add `docker` instruction

											
										
										
											2023-06-30 12:31:00 -04:00
+								```
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								FROM orca
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								PROMPT """
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								### System:
-												fix typo

											
										
										
											2023-07-18 13:32:06 -07:00
+								You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
-												add `docker` instruction

											
										
										
											2023-06-30 12:31:00 -04:00
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								### User:
 								{{ .Prompt }}
 								### Response:
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								"""
-												simplify `README.md`

											
										
										
											2023-06-29 18:25:02 -04:00
+								```
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-07 16:14:58 -04:00
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								Next, create and run the model:
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-07 16:14:58 -04:00
 								```
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								ollama create mario -f ./Modelfile
 								ollama run mario
 								>>> hi
 								Hello! It's your friend Mario.
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-07 16:14:58 -04:00
+								```
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								For more info on `Modelfile` syntax see [this doc](./docs/modelfile).
 								### Pull a model from the registry
-												add advanced usage to readme

											
										
										
											2023-07-06 16:21:01 -04:00
-												add discord link, remove repeated text

											
										
										
											2023-07-19 12:28:50 -07:00
+								```
 								ollama pull nous-hermes
 								```
-												reorganize `README.md` files

											
										
										
											2023-06-28 09:57:36 -04:00
-												add llama.cpp go bindings

											
										
										
											2023-07-03 16:32:48 -04:00
+								## Building
 								```
-												vendor llama.cpp

											
										
										
											2023-07-11 09:50:02 -07:00
+								go build .
-												add llama.cpp go bindings

											
										
										
											2023-07-03 16:32:48 -04:00
+								```
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
+								To run it start the server:
-												add development doc

											
										
										
											2023-06-27 13:46:46 -04:00
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
+								```
-												Update README.md

I needed to do this to run the project
											
										
										
											2023-07-19 08:14:44 -06:00
+								./ollama serve &
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
+								```
 								Finally, run a model!
 								```
-												update `README.md` with new syntax

											
										
										
											2023-07-18 13:22:33 -07:00
+								./ollama run llama2
-												updated readme

											
										
										
											2023-07-05 15:37:33 -04:00
+								```