We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent fdc2bd4 commit d39fc55Copy full SHA for d39fc55
CHANGELOG.md
@@ -4,6 +4,14 @@ All notable changes to this project will be documented in this file.
4
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
5
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
6
7
+## [1.17.0] - 2024-06-04
8
+
9
+### Added
10
+- [Server] Add kvCacheDataType option
11
+- [Server] Automatically enable q4_0 quantized KV cache with Flash Attention
12
+- [Server] Automatically enable Flash Attention on GPUS with at least Pascal architecture
13
+- [Build] Enable parallel building with CMake utilizing all CPU threads
14
15
## [1.16.0] - 2024-05-30
16
17
### Added
0 commit comments