llama.cpp

History

Andrei Betlen ef23d1e545 Update llama.cpp		2023-08-25 14:35:53 -04:00
..
server	Add mul_mat_q option	2023-08-08 14:35:06 -04:00
__init__.py	Black formatting	2023-03-24 14:59:29 -04:00
llama.py	Use _with_model variants for tokenization	2023-08-25 13:43:16 -04:00
llama_cpp.py	Update llama.cpp	2023-08-25 14:35:53 -04:00
llama_grammar.py	Fix typos in llama_grammar	2023-08-17 21:00:44 +09:00
llama_types.py	bugfix: fix compatibility bug with openai api on last token	2023-07-08 00:06:11 -04:00
py.typed	Add py.typed	2023-08-11 09:58:48 +02:00
utils.py	Suppress llama.cpp output when loading model.	2023-07-28 14:45:18 -04:00