..
ext_server
IBM granite/granitemoe architecture support ( #6760 )
2024-10-17 11:59:52 -07:00
generate
llm: Remove GGML_CUDA_NO_PEER_COPY for ROCm ( #7174 )
2024-10-12 09:56:49 -07:00
llama.cpp@ 3f1ae2e32c
IBM granite/granitemoe architecture support ( #6760 )
2024-10-17 11:59:52 -07:00
patches
IBM granite/granitemoe architecture support ( #6760 )
2024-10-17 11:59:52 -07:00
filetype.go
Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL ( #4322 )
2024-05-23 13:21:49 -07:00
ggla.go
image processing for llama3.2 ( #6963 )
2024-10-18 16:12:35 -07:00
ggml.go
image processing for llama3.2 ( #6963 )
2024-10-18 16:12:35 -07:00
ggml_test.go
llm: speed up gguf decoding by a lot ( #5246 )
2024-06-24 21:47:52 -07:00
gguf.go
image processing for llama3.2 ( #6963 )
2024-10-18 16:12:35 -07:00
llm_darwin.go
Optimize container images for startup ( #6547 )
2024-09-12 12:10:30 -07:00
llm_linux.go
Optimize container images for startup ( #6547 )
2024-09-12 12:10:30 -07:00
llm_windows.go
runner: Set windows above normal priority ( #6905 )
2024-09-21 16:54:49 -07:00
memory.go
image processing for llama3.2 ( #6963 )
2024-10-18 16:12:35 -07:00
memory_test.go
Rename gpu package discover ( #7143 )
2024-10-16 17:45:00 -07:00
server.go
runner.go: Better handle return NULL values from llama.cpp
2024-10-28 18:12:29 -07:00
status.go
Catch one more error log
2024-08-05 09:28:07 -07:00