39978ccaf5
This also fixes a crash when loading the 70b llama2 model on MacOS with metal and `n_gpu_layers=1` |
||
---|---|---|
.. | ||
server | ||
__init__.py | ||
llama.py | ||
llama_cpp.py | ||
llama_types.py |
39978ccaf5
This also fixes a crash when loading the 70b llama2 model on MacOS with metal and `n_gpu_layers=1` |
||
---|---|---|
.. | ||
server | ||
__init__.py | ||
llama.py | ||
llama_cpp.py | ||
llama_types.py |