ef22e478db
See #990. This change makes the logits_to_logprobs function equivalent to the version in the llama.cpp repository. It uses numpy so it's much faster than the previous version. |
||
---|---|---|
.. | ||
server | ||
__init__.py | ||
_utils.py | ||
llama.py | ||
llama_chat_format.py | ||
llama_cpp.py | ||
llama_grammar.py | ||
llama_types.py | ||
llava_cpp.py | ||
py.typed |