ollama

Author	SHA1	Message	Date
Michael Yang	e9f6df7dca	use slices.DeleteFunc	2023-09-05 09:56:59 -07:00
Bruce MacDonald	42998d797d	subprocess llama.cpp server (#401 ) * remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm	2023-08-30 16:35:03 -04:00
Michael Yang	d791df75dd	check memory requirements before loading	2023-08-10 09:23:11 -07:00
Bruce MacDonald	a6f6d18f83	embed text document in modelfile	2023-08-08 11:27:17 -04:00
Bruce MacDonald	1c5a8770ee	read runner parameter options from map - read runner options from map to see what was specified explicitly and overwrite zero values	2023-08-01 13:38:19 -04:00
Bruce MacDonald	daa0d1de7a	allow specifying zero values in modelfile	2023-08-01 13:37:50 -04:00
Michael Yang	8609db77ea	use gin-contrib/cors middleware	2023-07-22 09:39:08 -07:00
Patrick Devine	e4d7f3e287	vendor in progress bar and change to bytes instead of bibytes (#130 )	2023-07-19 17:24:03 -07:00
Michael Yang	84200dcde6	use readline	2023-07-19 13:34:56 -07:00
Patrick Devine	5bea29f610	add new list command (#97 )	2023-07-18 09:09:45 -07:00
Michael Yang	28a136e9a3	modelfile params	2023-07-17 12:35:03 -07:00
Michael Yang	a806b03f62	no errgroup	2023-07-11 14:58:10 -07:00
Michael Yang	fd4792ec56	call llama.cpp directly from go	2023-07-11 11:59:18 -07:00
Michael Yang	c4b9e84945	progress	2023-07-06 17:07:40 -07:00
Michael Yang	3d6009aae3	run prompts	2023-07-06 17:07:40 -07:00
Bruce MacDonald	7cf5905063	display pull progress	2023-07-06 16:34:44 -04:00
Michael Yang	68e6b4550c	use prompt templates	2023-07-06 16:34:44 -04:00
Jeffrey Morgan	fd962a36e5	client updates	2023-07-06 16:34:44 -04:00
Jeffrey Morgan	6093a88c1a	add llama.cpp go bindings	2023-07-06 16:34:44 -04:00
Jeffrey Morgan	76cb60d496	wip go engine Co-authored-by: Patrick Devine <pdevine@sonic.net>	2023-07-06 16:34:44 -04:00

20 commits