docs: Update README

2024-01-30 12:23:07 -05:00 · 2024-01-30 12:23:07 -05:00 · 247a16de66
commit 247a16de66
parent 13b7ced7da
1 changed files with 28 additions and 19 deletions
--- a/README.md
+++ b/README.md
@ -23,9 +23,6 @@ This package provides:

 Documentation is available at [https://llama-cpp-python.readthedocs.io/en/latest](https://llama-cpp-python.readthedocs.io/en/latest).

-
-
-
 ## Installation

 `llama-cpp-python` can be installed directly from PyPI as a source distribution by running:
@ -38,7 +35,6 @@ This will build `llama.cpp` from source using cmake and your system's c compiler

 If you run into issues during installation add the `--verbose` flag to the `pip install` command to see the full cmake build log.

-
 ### Installation with Specific Hardware Acceleration (BLAS, CUDA, Metal, etc)

 The default pip install behaviour is to build `llama.cpp` for CPU only on Linux and Windows and use Metal on MacOS.
@ -109,6 +105,22 @@ To install with Vulkan support, set the `LLAMA_VULKAN=on` environment variable b
 CMAKE_ARGS="-DLLAMA_VULKAN=on" pip install llama-cpp-python
 ```

+#### Kompute
+
+To install with Kompute support, set the `LLAMA_KOMPUTE=on` environment variable before installing:
+
+```bash
+CMAKE_ARGS="-DLLAMA_KOMPUTE=on" pip install llama-cpp-python
+```
+
+#### SYCL
+
+To install with SYCL support, set the `LLAMA_SYCL=on` environment variable before installing:
+
+```bash
+CMAKE_ARGS="-DLLAMA_SYCL=on" pip install llama-cpp-python
+```
+
 ### Windows Notes

 If you run into issues where it complains it can't find `'nmake'` `'?'` or CMAKE_C_COMPILER, you can extract w64devkit as [mentioned in llama.cpp repo](https://github.com/ggerganov/llama.cpp#openblas) and add those manually to CMAKE_ARGS before running `pip` install:
@ -284,7 +296,6 @@ The high-level API also provides a simple interface for function calling.
 Note that the only model that supports full function calling at this time is "functionary".
 The gguf-converted files for this model can be found here: [functionary-7b-v1](https://huggingface.co/abetlen/functionary-7b-v1-GGUF)

-
 ```python
 >>> from llama_cpp import Llama
 >>> llm = Llama(model_path="path/to/functionary/llama-model.gguf", chat_format="functionary")
@ -332,7 +343,6 @@ The gguf-converted files for this model can be found here: [functionary-7b-v1](h

 ### Multi-modal Models

-
 `llama-cpp-python` supports the llava1.5 family of multi-modal models which allow the language model to
 read information from both text and images.

@ -378,7 +388,6 @@ For instance, if you want to work with larger contexts, you can expand the conte
 llm = Llama(model_path="./models/7B/llama-model.gguf", n_ctx=2048)
 ```

-
 ## OpenAI Compatible Web Server

 `llama-cpp-python` offers a web server which aims to act as a drop-in replacement for the OpenAI API.
@ -426,6 +435,7 @@ A Docker image is available on [GHCR](https://ghcr.io/abetlen/llama-cpp-python).
 ```bash
 docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/llama-model.gguf ghcr.io/abetlen/llama-cpp-python:latest
 ```
+
 [Docker on termux (requires root)](https://gist.github.com/FreddieOliveira/efe850df7ff3951cb62d74bd770dce27) is currently the only known way to run this on phones, see [termux support issue](https://github.com/abetlen/llama-cpp-python/issues/389)

 ## Low-level API
@ -454,7 +464,6 @@ Below is a short example demonstrating how to use the low-level API to tokenize

 Check out the [examples folder](examples/low_level_api) for more examples of using the low-level API.

-
 ## Documentation

 Documentation is available via [https://llama-cpp-python.readthedocs.io/](https://llama-cpp-python.readthedocs.io/).