docs: Add information re: auto chat formats. Closes #1236
This commit is contained in:
parent
f062a7f51d
commit
97aa3a153d
2 changed files with 13 additions and 2 deletions
11
README.md
11
README.md
|
@ -286,7 +286,16 @@ By default [`from_pretrained`](https://llama-cpp-python.readthedocs.io/en/latest
|
|||
|
||||
The high-level API also provides a simple interface for chat completion.
|
||||
|
||||
Note that `chat_format` option must be set for the particular model you are using.
|
||||
Chat completion requires that the model know how to format the messages into a single prompt.
|
||||
The `Llama` class does this using pre-registered chat formats (ie. `chatml`, `llama-2`, `gemma`, etc) or by providing a custom chat handler object.
|
||||
|
||||
The model will will format the messages into a single prompt using the following order of precedence:
|
||||
- Use the `chat_handler` if provided
|
||||
- Use the `chat_format` if provided
|
||||
- Use the `tokenizer.chat_template` from the `gguf` model's metadata (should work for most new models, older models may not have this)
|
||||
- else, fallback to the `llama-2` chat format
|
||||
|
||||
Set `verbose=True` to see the selected chat format.
|
||||
|
||||
```python
|
||||
>>> from llama_cpp import Llama
|
||||
|
|
|
@ -410,7 +410,7 @@ class Llama:
|
|||
bos_token = self._model.token_get_text(bos_token_id)
|
||||
|
||||
if self.verbose:
|
||||
print(f"Using chat template: {template}", file=sys.stderr)
|
||||
print(f"Using gguf chat template: {template}", file=sys.stderr)
|
||||
print(f"Using chat eos_token: {eos_token}", file=sys.stderr)
|
||||
print(f"Using chat bos_token: {bos_token}", file=sys.stderr)
|
||||
|
||||
|
@ -420,6 +420,8 @@ class Llama:
|
|||
|
||||
if self.chat_format is None and self.chat_handler is None:
|
||||
self.chat_format = "llama-2"
|
||||
if self.verbose:
|
||||
print(f"Using fallback chat format: {chat_format}", file=sys.stderr)
|
||||
|
||||
@property
|
||||
def ctx(self) -> llama_cpp.llama_context_p:
|
||||
|
|
Loading…
Add table
Reference in a new issue