llama.cpp

History

c0sogi a240aa6b25 Fix typos in llama_grammar		2023-08-17 21:00:44 +09:00
..
server	Add mul_mat_q option	2023-08-08 14:35:06 -04:00
__init__.py	Black formatting	2023-03-24 14:59:29 -04:00
llama.py	make n_gpu_layers=-1 offload all layers	2023-08-13 11:21:28 +08:00
llama_cpp.py	Update llama.cpp	2023-08-14 22:33:30 -04:00
llama_grammar.py	Fix typos in llama_grammar	2023-08-17 21:00:44 +09:00
llama_types.py	bugfix: fix compatibility bug with openai api on last token	2023-07-08 00:06:11 -04:00
py.typed	Add py.typed	2023-08-11 09:58:48 +02:00
utils.py	Suppress llama.cpp output when loading model.	2023-07-28 14:45:18 -04:00