Skip to content

Commit 518dd98

Browse files
committed
Add valid KV cache data types
1 parent acb17f6 commit 518dd98

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/server.ps1

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Specifies the number of layers offloaded into the GPU.
2323
Specifies the models context length it was trained on.
2424
2525
.PARAMETER kvCacheDataType
26-
Specifies the KV cache data type.
26+
Specifies the KV cache data type (options: f32, f16, q8_0, q4_0, q4_1, iq4_nl, q5_0, or q5_1).
2727
2828
.PARAMETER verbose
2929
Increases the verbosity of the llama.cpp server.

0 commit comments

Comments
 (0)