Jeffrey Morgan
5cba29b9d6
JSON mode: add `"format" as an api parameter ( #1051 )
...
* add `"format": "json"` as an API parameter
---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
2023-11-09 16:44:02 -08:00
Bruce MacDonald
5b39503bcd
document specifying multiple stop params ( #1061 )
2023-11-09 13:16:26 -08:00
Matt Williams
dd3dc47ddb
Merge pull request #992 from aashish2057/aashish2057/langchainjs_doc_update
2023-11-09 05:08:31 -08:00
Bruce MacDonald
a49d6acc1e
add a complete /generate options example ( #1035 )
2023-11-08 16:44:36 -08:00
Bruce MacDonald
ec2a31e9b3
support raw generation requests ( #952 )
...
- add the optional `raw` generate request parameter to bypass prompt formatting and response context
-add raw request to docs
2023-11-08 14:05:02 -08:00
Matt Williams
1d155caba3
docs: clarify where the models are stored in the faq
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-11-06 14:38:49 -08:00
aashish2057
b13586cc72
update langchainjs doc
2023-11-03 18:45:19 -05:00
Bruce MacDonald
6109bebba6
reformat api docs for more examples ( #972 )
2023-11-03 10:57:00 -04:00
Matt Williams
f21bd6210d
docs: clarify and clean up API docs
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-31 13:11:33 -07:00
Dirk Loss
874bb31986
Fix conversion command for gptneox ( #948 )
2023-10-30 14:34:29 -04:00
Jeffrey Morgan
c0dcea1398
Update faq.md
2023-10-27 18:29:00 -07:00
Bruce MacDonald
5c3491f425
allow for a configurable ollama model storage directory ( #897 )
...
* allow for a configurable ollama models directory
- set OLLAMA_MODELS in the environment that ollama is running in to change where model files are stored
- update docs
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
Co-Authored-By: Jay Nakrani <dhananjaynakrani@gmail.com>
Co-Authored-By: Akhil Acharya <akhilcacharya@gmail.com>
Co-Authored-By: Sasha Devol <sasha.devol@protonmail.com>
2023-10-27 10:19:59 -04:00
Michael Yang
92119de9d8
update linux.md
2023-10-25 14:57:50 -07:00
Michael Yang
53b0ba8d43
Merge pull request #893 from jmorganca/mxyng/update-faq
...
update faq
2023-10-24 16:02:35 -07:00
Michael Yang
db342691f9
Update docs/faq.md
...
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
2023-10-24 13:59:33 -07:00
Bruce MacDonald
cecf83141e
Linux uninstall instructions ( #894 )
2023-10-24 14:07:05 -04:00
Michael Yang
a5a2adf1ec
update faq
2023-10-24 10:54:16 -07:00
Jeffrey Morgan
914428351a
Update import.md
2023-10-23 17:44:53 -07:00
Jeffrey Morgan
9afea9e3b9
Update import.md
...
Separate GGUF and PyTorch guides
2023-10-23 17:42:17 -07:00
Jeffrey Morgan
6b213216d5
Update import.md
2023-10-19 12:17:36 -04:00
Alexander F. Rødseth
b7e137323a
Fix a typo ( #818 )
2023-10-17 09:00:15 -04:00
Bruce MacDonald
a0c3e989de
deprecate modelfile embed command ( #759 )
2023-10-16 11:07:37 -04:00
Jeffrey Morgan
f9b2f999ac
update readme with docker
setup and link to import.md
2023-10-15 02:23:03 -04:00
Jeffrey Morgan
c416087339
import.md
: formatting and spelling
2023-10-15 01:39:46 -04:00
Jeffrey Morgan
6002cebd2c
import.md
: convert and quantize docs
2023-10-15 00:11:51 -04:00
Jeffrey Morgan
212bdc541c
import.md
: model architectures spelling
2023-10-15 00:07:58 -04:00
Jeffrey Morgan
dca6686273
add steps for creating a Modelfile and more example commands to import.md
2023-10-15 00:05:50 -04:00
Matt Williams
b2974a7095
applied mikes comments
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-14 08:29:24 -07:00
Matt Williams
3c975f898f
update doc to refer to docker image
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-12 15:57:50 -07:00
Matt Williams
9245c8a1df
add how to quantize doc
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-12 15:34:57 -07:00
Bruce MacDonald
274d5a5fdf
optional parameter to not stream response ( #639 )
...
* update streaming request accept header
* add optional stream param to request bodies
2023-10-11 12:54:27 -04:00
Costa Alexoglou
f7f5169c94
Update api.md ( #741 )
...
Avoid triple ticks in visual editor and also copied in clipboard.
2023-10-09 16:01:46 -04:00
James Braza
6f2ce74231
Got rif of all caps to show it can be lower case
2023-10-02 13:54:27 -07:00
James Braza
6edcc5c79f
Using code highlighting syntax around Modelfile
2023-10-02 13:46:05 -07:00
Jiayu Liu
4fc10acce9
add some missing code directives in docs ( #664 )
2023-10-01 11:51:01 -07:00
Jay Nakrani
1d0ebe67e8
Document response stream chunk delimiter. ( #632 )
...
Document response stream chunk delimiter.
2023-09-29 21:45:52 -07:00
Aaron Coffey
6ae33d8141
Update modelfile.md to reflect the usage of num_gpu. ( #629 )
2023-09-28 10:21:21 -04:00
Jeffrey Morgan
c5664c1fef
Update faq.md
2023-09-27 13:49:43 -07:00
Bruce MacDonald
ed20837f9a
Update modelfile.md
2023-09-27 10:38:10 -04:00
James Braza
1db2a61dd0
Added num_predict to the options table ( #614 )
2023-09-27 10:26:08 -04:00
Jeffrey Morgan
5306b0269d
Update linux.md
2023-09-25 16:10:32 -07:00
Jeffrey Morgan
0fb5268496
Update linux.md
2023-09-25 10:06:23 -07:00
Jeffrey Morgan
ee3032ad89
improvements to docs/linux.md
2023-09-24 21:50:07 -07:00
Jeffrey Morgan
5b7a27281d
improvements to docs/linux.md
2023-09-24 21:38:23 -07:00
Jeffrey Morgan
d2a784e33e
add docs/linux.md
2023-09-24 21:34:44 -07:00
Michael Yang
6c6a31a1e8
embed libraries using cmake
2023-09-20 14:41:57 -07:00
Bruce MacDonald
fc6ec356fc
remove libcuda.so
2023-09-20 20:36:14 +01:00
Bruce MacDonald
1255bc9b45
only package 11.8 runner
2023-09-20 20:00:41 +01:00
Bruce MacDonald
4e8be787c7
pack in cuda libs
2023-09-20 17:40:42 +01:00
Bruce MacDonald
2540c9181c
support for packaging in multiple cuda runners ( #509 )
...
* enable packaging multiple cuda versions
* use nvcc cuda version if available
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
2023-09-14 15:08:13 -04:00