server
|
Add mul_mat_q option
|
2023-08-08 14:35:06 -04:00 |
__init__.py
|
Black formatting
|
2023-03-24 14:59:29 -04:00 |
llama.py
|
make n_gpu_layers=-1 offload all layers
|
2023-08-13 11:21:28 +08:00 |
llama_cpp.py
|
Update llama.cpp
|
2023-08-14 22:33:30 -04:00 |
llama_grammar.py
|
Fix typos in llama_grammar
|
2023-08-17 21:00:44 +09:00 |
py.typed
|
Add py.typed
|
2023-08-11 09:58:48 +02:00 |
utils.py
|
Suppress llama.cpp output when loading model.
|
2023-07-28 14:45:18 -04:00 |