You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: charts/deepgram-self-hosted/README.md
+6-3Lines changed: 6 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -180,6 +180,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin
180
180
| api.additionalAnnotations | object | `nil` | Additional annotations to add to the API deployment |
181
181
| api.additionalLabels | object | `{}` | Additional labels to add to API resources |
182
182
| api.affinity | object | `{}` | [Affinity and anti-affinity](https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#affinity-and-anti-affinity) to apply for API pods. |
183
+
| api.containerSecurityContext | object | `{}` | [Container-level security context](https://kubernetes.io/docs/tasks/configure-pod-container/security-context/#set-the-security-context-for-a-container) for API containers. |
183
184
| api.driverPool | object | `` | driverPool configures the backend pool of speech engines (generically referred to as "drivers" here). The API will load-balance among drivers in the standard pool; if one standard driver fails, the next one will be tried. |
184
185
| api.driverPool.standard | object | `` | standard is the main driver pool to use. |
185
186
| api.driverPool.standard.maxResponseSize | string | `"1073741824"` | Maximum response to deserialize from Driver (in bytes). Default is 1GB, expressed in bytes. |
@@ -201,7 +202,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin
201
202
| api.resolver.maxTTL | int | `nil` | maxTTL sets the DNS TTL value if specifying a custom DNS nameserver. |
202
203
| api.resolver.nameservers | list | `[]` | nameservers allows for specifying custom domain name server(s). A valid list item's format is "{IP} {PORT} {PROTOCOL (tcp or udp)}", e.g. `"127.0.0.1 53 udp"`. |
203
204
| api.resources | object |``| Configure resource limits per API container. See [Deepgram's documentation](https://developers.deepgram.com/docs/self-hosted-deployment-environments#api) for more details. |
204
-
| api.securityContext | object | `{}` | [Security context](https://kubernetes.io/docs/tasks/configure-pod-container/security-context/) for API pods. |
205
+
| api.securityContext | object | `{}` | [Pod-level security context](https://kubernetes.io/docs/tasks/configure-pod-container/security-context/#set-the-security-context-for-a-pod) for API pods. |
205
206
| api.server | object | `` | Configure how the API will listen for your requests |
206
207
| api.server.callbackConnTimeout | string | `"1s"` | callbackConnTimeout configures how long to wait for a connection to a callback URL. See [Deepgram's callback documentation](https://developers.deepgram.com/docs/callback) for more details. The value should be a humantime duration. |
207
208
| api.server.callbackTimeout | string |`"10s"`| callbackTimeout configures how long to waitfor a response from a callback URL. See [Deepgram's callback documentation](https://developers.deepgram.com/docs/callback) for more details. The value should be a humantime duration. |
@@ -229,6 +230,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin
229
230
| engine.chunking.speechToText.streaming.minDuration | float | `nil` | minDuration is the minimum audio duration for a STT chunk size for a streaming request |
230
231
| engine.chunking.speechToText.streaming.step | float | `1` | step defines how often to return interim results, in seconds. This value may be lowered to increase the frequency of interim results. However, this also causes a significant decrease in the number of concurrent streams supported by a single GPU. Please contact your Deepgram Account representative for more details. |
231
232
| engine.concurrencyLimit.activeRequests | int | `nil` | activeRequests limits the number of active requests handled by a single Engine container. If additional requests beyond the limit are sent, the API container forming the request will try a different Engine pod. If no Engine pods are able to accept the request, the API will return a 429 HTTP response to the client. The `nil` default means no limit will be set. |
| engine.features.streamingNer | bool | `false` | Enables format entity tags on streaming audio *if* a valid NER model is available. |
233
235
| engine.halfPrecision.state | string | `"auto"` | Engine will automatically enable half precision operations if your GPU supports them. You can explicitly enable or disable this behavior with the state parameter which supports `"enable"`, `"disabled"`, and `"auto"`. |
234
236
| engine.image.path | string | `"quay.io/deepgram/self-hosted-engine"` | path configures the image path to use for creating Engine containers. You may change this from the public Quay image path if you have imported Deepgram images into a private container registry. |
@@ -259,7 +261,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin
259
261
| engine.resources | object | `` | Configure resource limits per Engine container. See [Deepgram's documentation](https://developers.deepgram.com/docs/self-hosted-deployment-environments#engine) for more details. |
260
262
| engine.resources.limits.gpu | int |`1`| gpu maps to the nvidia.com/gpu resource parameter |
261
263
| engine.resources.requests.gpu | int |`1`| gpu maps to the nvidia.com/gpu resource parameter |
| engine.server | object |``| Configure Engine containers to listen for requests from API containers. |
264
266
| engine.server.host | string |`"0.0.0.0"`| host is the IP address to listen on forinference requests. You will want to listen on all interfaces to interact with other podsin the cluster. |
265
267
| engine.server.port | int |`8080`| port to listen on for inference requests |
@@ -287,6 +289,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin
287
289
| licenseProxy.additionalAnnotations | object |`nil`| Additional annotations to add to the LicenseProxy deployment |
288
290
| licenseProxy.additionalLabels | object |`{}`| Additional labels to add to License Proxy resources |
289
291
| licenseProxy.affinity | object |`{}`| [Affinity and anti-affinity](https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#affinity-and-anti-affinity) to apply for License Proxy pods. |
| licenseProxy.deploySecondReplica | bool |`false`| If the License Proxy is deployed, one replica should be sufficient to support many API/Engine pods. Highly available environments may wish to deploy a second replica to ensure uptime, which can be toggled with this option. |
291
294
| licenseProxy.enabled | bool |`false`| The License Proxy is optional, but highly recommended to be deployed in production to enable highly available environments. |
292
295
| licenseProxy.image.path | string |`"quay.io/deepgram/self-hosted-license-proxy"`| path configures the image path to use for creating License Proxy containers. You may change this from the public Quay image path if you have imported Deepgram images into a private container registry. |
@@ -297,7 +300,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin
297
300
| licenseProxy.namePrefix | string |`"deepgram-license-proxy"`| namePrefix is the prefix to apply to the name of all K8s objects associated with the Deepgram License Proxy containers. |
| licenseProxy.resources | object |``| Configure resource limits per License Proxy container. See [Deepgram's documentation](https://developers.deepgram.com/docs/license-proxy#system-requirements) for more details. |
300
-
| licenseProxy.securityContext | object | `{}` | [Security context](https://kubernetes.io/docs/tasks/configure-pod-container/security-context/) for API pods. |
| licenseProxy.server | object | `` | Configure how the license proxy will listen for licensing requests. |
302
305
| licenseProxy.server.baseUrl | string | `"/"` | baseUrl is the prefix for incoming license verification requests. |
303
306
| licenseProxy.server.host | string | `"0.0.0.0"` | host is the IP address to listen on. You will want to listen on all interfaces to interact with other pods in the cluster. |
0 commit comments