ollama

Author	SHA1	Message	Date
Michael Yang	8acb233668	use strings.Builder	2024-05-01 10:01:09 -07:00
Michael Yang	119589fcb3	rename parser to model/file	2024-05-01 09:53:50 -07:00
Michael Yang	5ea844964e	cmd: import regexp	2024-05-01 09:53:45 -07:00
Michael Yang	bd8eed57fc	fix parser name	2024-05-01 09:52:54 -07:00
Michael Yang	9cf0f2e973	use parser.Format instead of templating modelfile	2024-05-01 09:52:54 -07:00
Michael Yang	176ad3aa6e	parser: add commands format	2024-05-01 09:52:54 -07:00
Michael Yang	4d08363580	comments	2024-05-01 09:52:54 -07:00
Michael Yang	8907bf51d2	fix multiline	2024-05-01 09:52:54 -07:00
Michael Yang	abe614c705	tests	2024-05-01 09:52:54 -07:00
Michael Yang	238715037d	linting	2024-05-01 09:52:54 -07:00
Michael Yang	c0a00f68ae	refactor modelfile parser	2024-05-01 09:52:54 -07:00
Jeffrey Morgan	f0c454ab57	gpu: add 512MiB to darwin minimum, metal doesn't have partial offloading overhead (#4068 )	2024-05-01 11:46:03 -04:00
Blake Mizerany	b9f74ff3d6	types/model: reintroduce Digest (#4065 )	2024-04-30 16:38:03 -07:00
jmorganca	fcf4d60eee	llm: add back check for empty token cache	2024-04-30 17:38:44 -04:00
jmorganca	e33d5c2dbc	update llama.cpp commit to `952d03d`	2024-04-30 17:31:20 -04:00
Jeffrey Morgan	18d9a7e1f1	update llama.cpp submodule to `f364eb6` (#4060 )	2024-04-30 17:25:39 -04:00
Michael	8488388cbd	Update README.md	2024-04-30 15:45:56 -04:00
Blake Mizerany	588901f449	types/model: reduce Name.Filepath allocs from 5 to 2 (#4039 )	2024-04-30 11:09:19 -07:00
Bruce MacDonald	0a7fdbe533	prompt to display and add local ollama keys to account (#3717 ) - return descriptive error messages when unauthorized to create blob or push a model - display the local public key associated with the request that was denied	2024-04-30 11:02:08 -07:00
Christian Frantzen	5950c176ca	Update langchainpy.md (#4037 ) Updated the code a bit	2024-04-29 23:19:06 -04:00
Daniel Hiltgen	23d23409a0	Update llama.cpp (#4036 ) * Bump llama.cpp to b2761 * Adjust types for bump	2024-04-29 23:18:48 -04:00
Patrick Devine	9009bedf13	better checking for OLLAMA_HOST variable (#3661 )	2024-04-29 19:14:07 -04:00
Daniel Hiltgen	d4ac57e240	Merge pull request #4035 from dhiltgen/fix_relative_paths Fix relative path lookup	2024-04-29 16:08:06 -07:00
Daniel Hiltgen	7b59d1770f	Fix relative path lookup	2024-04-29 16:00:08 -07:00
Jeffrey Morgan	95ead8ffba	Restart server on failure when running Windows app (#3985 ) * app: restart server on failure * fix linter * address comments * refactor log directory creation to be where logs are written * check all log dir creation errors	2024-04-29 10:07:52 -04:00
Jeffrey Morgan	7aa08a77ca	llm: dont cap context window limit to training context window (#3988 )	2024-04-29 10:07:30 -04:00
Blake Mizerany	7e432cdfac	types/model: remove old comment (#4020 )	2024-04-28 20:52:26 -07:00
Jeffrey Morgan	586672f490	fix copying model to itself (#4019 )	2024-04-28 23:47:49 -04:00
Daniel Hiltgen	b03408de74	Merge pull request #3972 from hmartinez82/win_arm64 Add support for building on Windows ARM64	2024-04-28 14:52:58 -07:00
Daniel Hiltgen	1e6a28bf5b	Merge pull request #4009 from dhiltgen/cpu_concurrency Fix concurrency for CPU mode	2024-04-28 14:20:27 -07:00
Daniel Hiltgen	d6e3b64582	Fix concurrency for CPU mode Prior refactoring passes accidentally removed the logic to bypass VRAM checks for CPU loads. This adds that back, along with test coverage. This also fixes loaded map access in the unit test to be behind the mutex which was likely the cause of various flakes in the tests.	2024-04-28 13:42:39 -07:00
Blake Mizerany	114c932a8e	types/model: allow _ as starter character in Name parts (#3991 )	2024-04-27 21:24:52 -07:00
Jeffrey Morgan	7f7103de06	mac: update setup command to `llama3` (#3986 )	2024-04-27 22:52:10 -04:00
Blake Mizerany	c631a9c726	types/model: relax name length constraint from 2 to 1 (#3984 )	2024-04-27 17:58:41 -07:00
Blake Mizerany	8fd9e56804	types/structs: drop unused structs package (#3981 )	2024-04-27 14:06:11 -07:00
Hernan Martinez	8a65717f55	Do not build AVX runners on ARM64	2024-04-26 23:55:32 -06:00
Hernan Martinez	6d3152a98a	Use architecture specific folders in installer script	2024-04-26 23:35:16 -06:00
Hernan Martinez	b438d485f1	Use architecture specific folders in the generate script	2024-04-26 23:34:12 -06:00
Hernan Martinez	204349b17b	Use architecture specific folders in the build script	2024-04-26 23:26:03 -06:00
Hernan Martinez	86e67fc4a9	Add import declaration for windows,arm64 to llm.go	2024-04-26 23:23:53 -06:00
Blake Mizerany	2bed62926e	types/model: remove Digest (for now) (#3970 ) The Digest type needs more thought and is not necessary at the moment.	2024-04-26 21:14:28 -07:00
Jeffrey Morgan	aad8d128a0	also look at cwd as a root for windows runners (#3959 )	2024-04-26 19:14:08 -04:00
Daniel Hiltgen	ec1acbb867	Merge pull request #3968 from dhiltgen/win_generate Fine grain control over windows generate steps	2024-04-26 16:03:38 -07:00
Daniel Hiltgen	e4859c4563	Fine grain control over windows generate steps This will speed up CI which already tries to only build static for unit tests	2024-04-26 15:49:46 -07:00
Nataly Merezhuk	8e30eb26bd	Updates the setup command to use llama3. (#3962 )	2024-04-26 18:41:01 -04:00
Daniel Hiltgen	0b5c589ca2	Merge pull request #3966 from dhiltgen/bump Fix target in gen_windows.ps1	2024-04-26 15:36:53 -07:00
Michael Yang	65fadddc85	Merge pull request #3964 from ollama/mxyng/weights fix gemma, command-r layer weights	2024-04-26 15:23:33 -07:00
Daniel Hiltgen	ed5fb088c4	Fix target in gen_windows.ps1	2024-04-26 15:10:42 -07:00
Michael Yang	f81f308118	fix gemma, command-r layer weights	2024-04-26 15:00:55 -07:00
Blake Mizerany	b1390a7b37	types/model: export ParseNameBare and Merge (#3957 ) These are useful outside this package.	2024-04-26 14:58:07 -07:00

1 2 3 4 5 ...

2541 commits