Daniel Hiltgen
|
29e90cc13b
|
Implement new Go based Desktop app
This focuses on Windows first, but coudl be used for Mac
and possibly linux in the future.
|
2024-02-15 05:56:45 +00:00 |
|
Daniel Hiltgen
|
e02ecfb6c8
|
Merge pull request #2116 from dhiltgen/cc_50_80
Add support for CUDA 5.0 cards
|
2024-01-27 10:28:38 -08:00 |
|
Jeffrey Morgan
|
a64570dcae
|
Fix clearing kv cache between requests with the same prompt (#2186)
* Fix clearing kv cache between requests with the same prompt
* fix powershell script
|
2024-01-25 13:46:20 -08:00 |
|
Daniel Hiltgen
|
a447a083f2
|
Add compute capability 5.0, 7.5, and 8.0
|
2024-01-20 14:24:05 -08:00 |
|
Jeffrey Morgan
|
4c54f0ddeb
|
sign dylibs on macOS (#2101)
|
2024-01-19 19:24:11 -05:00 |
|
Jeffrey Morgan
|
dc88cc3981
|
use gzip for runner embedding (#2067)
|
2024-01-19 13:23:03 -05:00 |
|
Daniel Hiltgen
|
1b249748ab
|
Add multiple CPU variants for Intel Mac
This also refines the build process for the ext_server build.
|
2024-01-17 15:08:54 -08:00 |
|
Daniel Hiltgen
|
39928a42e8
|
Always dynamically load the llm server library
This switches darwin to dynamic loading, and refactors the code now that no
static linking of the library is used on any platform
|
2024-01-11 08:42:47 -08:00 |
|
Bruce MacDonald
|
3367b5f3df
|
remove unused generate patches (#1810)
|
2024-01-05 11:25:45 -05:00 |
|
Daniel Hiltgen
|
77d96da94b
|
Code shuffle to clean up the llm dir
|
2024-01-04 12:12:05 -08:00 |
|