ollama/README.md

# Ollama

Run ai models locally.

> _Note: this project is a work in progress. The features below are still in development_

**Features**

- Run models locally on macOS (Windows, Linux and other platforms coming soon)
- Ollama uses the fastest loader available for your platform and model (e.g. llama.cpp, Core ML and other loaders coming soon)
- Import models from local files
- Find and download models on Hugging Face and other sources (coming soon)
- Support for running and switching between multiple models at a time (coming soon)
- Native desktop experience (coming soon)
- Built-in memory (coming soon)

## Install

```
pip install ollama
```

## Install From Source

```
git clone git@github.com:jmorganca/ollama ollama
cd ollama
pip install -r requirements.txt
pip install -e .
```

## Quickstart

```
% ollama run huggingface.co/TheBloke/orca_mini_3B-GGML
Pulling huggingface.co/TheBloke/orca_mini_3B-GGML...
Downloading [================           ] 66.67% 11.8MiB/s

...
...
...

> Hello

Hello, how may I help you?
```

## Python SDK

### Example

```python
import ollama
ollama.generate("orca-mini-3b", "hi")
```

### `ollama.generate(model, message)`

Generate a completion

```python
ollama.generate("./llama-7b-ggml.bin", "hi")
```

### `ollama.models()`

List available local models

```python
models = ollama.models()
```

### `ollama.serve()`

Serve the ollama http server

```
ollama.serve()
```

### `ollama.add(filepath)`

Add a model by importing from a file

```python
ollama.add("./path/to/model")
```

### `ollama.load(model)`

Manually a model for generation

```python
ollama.load("model")
```

### `ollama.unload(model)`

Unload a model

```python
ollama.unload("model")
```

### `ollama.pull(model)`

Download a model

```python
ollama.pull("huggingface.co/thebloke/llama-7b-ggml")
```

## Coming Soon

### `ollama.search("query")`

Search for compatible models that Ollama can run

```python
ollama.search("llama-7b")
```

## Documentation

- [Development](docs/development.md)
move to contained directory 2023-06-27 12:08:52 -04:00			`# Ollama`
initial commit 2023-06-22 12:45:31 -04:00
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`Run ai models locally.`
Add download link to readme 2023-06-27 17:13:07 -04:00
fix `README.md` formatting 2023-06-28 10:19:33 -04:00			`> _Note: this project is a work in progress. The features below are still in development_`
Add download link to readme 2023-06-27 17:13:07 -04:00
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`Features`
Add download link to readme 2023-06-27 17:13:07 -04:00
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`- Run models locally on macOS (Windows, Linux and other platforms coming soon)`
correct spelling for Core ML 2023-06-28 10:19:07 -04:00			`- Ollama uses the fastest loader available for your platform and model (e.g. llama.cpp, Core ML and other loaders coming soon)`
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`- Import models from local files`
			`- Find and download models on Hugging Face and other sources (coming soon)`
			`- Support for running and switching between multiple models at a time (coming soon)`
			`- Native desktop experience (coming soon)`
			`- Built-in memory (coming soon)`

			`## Install`
initial commit 2023-06-22 12:45:31 -04:00
			```
move to contained directory 2023-06-27 12:08:52 -04:00			`pip install ollama`
initial commit 2023-06-22 12:45:31 -04:00			```

update development.md 2023-06-28 09:51:04 -07:00			`## Install From Source`

			```
			`git clone git@github.com:jmorganca/ollama ollama`
			`cd ollama`
			`pip install -r requirements.txt`
			`pip install -e .`
			```

reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`## Quickstart`
reorganize directories 2023-06-25 13:08:03 -04:00
			```
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`% ollama run huggingface.co/TheBloke/orca_mini_3B-GGML`
			`Pulling huggingface.co/TheBloke/orca_mini_3B-GGML...`
loading bar customizations 2023-06-28 16:04:53 -04:00			`Downloading [================ ] 66.67% 11.8MiB/s`
reorganize directories 2023-06-25 13:08:03 -04:00
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`...`
			`...`
			`...`
move to contained directory 2023-06-27 12:08:52 -04:00
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`> Hello`

			`Hello, how may I help you?`
			```

			`## Python SDK`

			`### Example`
move to contained directory 2023-06-27 12:08:52 -04:00
			```python
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`import ollama`
correct spelling for Core ML 2023-06-28 10:19:07 -04:00			`ollama.generate("orca-mini-3b", "hi")`
reorganize directories 2023-06-25 13:08:03 -04:00			```

reorganize `README.md` files 2023-06-28 09:57:36 -04:00			### `ollama.generate(model, message)`
reorganize directories 2023-06-25 13:08:03 -04:00
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`Generate a completion`
reorganize directories 2023-06-25 13:08:03 -04:00
move to contained directory 2023-06-27 12:08:52 -04:00			```python
reorganize `README.md` files 2023-06-28 09:57:36 -04:00			`ollama.generate("./llama-7b-ggml.bin", "hi")`
reorganize directories 2023-06-25 13:08:03 -04:00			```

small `README.md` tweaks 2023-06-27 12:51:36 -04:00			### `ollama.models()`
reorganize directories 2023-06-25 13:08:03 -04:00
small `README.md` tweaks 2023-06-27 12:44:12 -04:00			`List available local models`
move to contained directory 2023-06-27 12:08:52 -04:00
tweak `README.md` 2023-06-28 10:57:18 -04:00			```python
move to contained directory 2023-06-27 12:08:52 -04:00			`models = ollama.models()`
reorganize directories 2023-06-25 13:08:03 -04:00			```

small `README.md` tweaks 2023-06-27 12:51:36 -04:00			### `ollama.serve()`
fix `README.md` 2023-06-25 13:10:15 -04:00
move to contained directory 2023-06-27 12:08:52 -04:00			`Serve the ollama http server`
reorganize directories 2023-06-25 13:08:03 -04:00
tweak `README.md` 2023-06-28 10:57:18 -04:00			```
			`ollama.serve()`
			```

add function 2023-06-27 17:36:02 -04:00			### `ollama.add(filepath)`
reorganize directories 2023-06-25 13:08:03 -04:00
add function 2023-06-27 17:36:02 -04:00			`Add a model by importing from a file`
move to contained directory 2023-06-27 12:08:52 -04:00
			```python
add function 2023-06-27 17:36:02 -04:00			`ollama.add("./path/to/model")`
reorganize directories 2023-06-25 13:08:03 -04:00			```

reorganize `README.md` files 2023-06-28 09:57:36 -04:00			### `ollama.load(model)`

			`Manually a model for generation`

			```python
			`ollama.load("model")`
			```

			### `ollama.unload(model)`

			`Unload a model`

			```python
			`ollama.unload("model")`
			```

add function 2023-06-27 17:36:02 -04:00			### `ollama.pull(model)`

			`Download a model`
move to contained directory 2023-06-27 12:08:52 -04:00
			```python
add function 2023-06-27 17:36:02 -04:00			`ollama.pull("huggingface.co/thebloke/llama-7b-ggml")`
move to contained directory 2023-06-27 12:08:52 -04:00			```
reorganize directories 2023-06-25 13:08:03 -04:00
fix spelling 2023-06-28 14:39:43 -04:00			`## Coming Soon`
pull from remote 2023-06-28 12:13:13 -04:00
small `README.md` tweaks 2023-06-27 12:51:36 -04:00			### `ollama.search("query")`
add light documentation for `/models` 2023-06-25 14:29:26 -04:00
move to contained directory 2023-06-27 12:08:52 -04:00			`Search for compatible models that Ollama can run`
add light documentation for `/models` 2023-06-25 14:29:26 -04:00
move to contained directory 2023-06-27 12:08:52 -04:00			```python
			`ollama.search("llama-7b")`
			```
reorganize directories 2023-06-25 13:08:03 -04:00
add development doc 2023-06-27 13:46:46 -04:00			`## Documentation`

			`- [Development](docs/development.md)`