Skip to content
Open
Show file tree
Hide file tree
Changes from 7 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions charts/azimuth-chat/azimuth-ui.schema.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,7 @@ sortOrder:
- /azimuth-llm/ui/appSettings/model_instruction
- /azimuth-llm/ui/appSettings/page_title
- /azimuth-llm/api/image/version
- /azimuth-llm/api/gpus
- /azimuth-llm/ui/appSettings/llm_params/temperature
- /azimuth-llm/ui/appSettings/llm_params/max_tokens
- /azimuth-llm/ui/appSettings/llm_params/frequency_penalty
Expand Down
8 changes: 8 additions & 0 deletions charts/azimuth-chat/values.schema.json
Original file line number Diff line number Diff line change
Expand Up @@ -43,6 +43,14 @@
"default": "v0.10.2"
}
}
},
"gpus": {
"type": "integer",
"title": "GPU Count",
"description": "The number of GPUs to allocate to the model.",
"default": 1,
"minimum": 0,
"maximum": 8
}
}
},
Expand Down
4 changes: 4 additions & 0 deletions charts/azimuth-llm/templates/api/deployment.yml
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,10 @@ spec:
{{- if .Values.api.extraArgs -}}
{{- .Values.api.extraArgs | toYaml | nindent 10 }}
{{- end -}}
{{- if gt (.Values.api.gpus | int) 1 }}
- --tensor-parallel-size
- {{ .Values.api.gpus | quote }}
{{- end }}
{{- if .Values.huggingface.secretName -}}
envFrom:
- secretRef:
Expand Down
8 changes: 8 additions & 0 deletions charts/azimuth-llm/values.schema.json
Original file line number Diff line number Diff line change
Expand Up @@ -40,6 +40,14 @@
"default": "v0.10.2"
}
}
},
"gpus": {
"type": "integer",
"title": "GPU Count",
"description": "The number of GPUs to allocate to the model.",
"default": 1,
"minimum": 0,
"maximum": 8
}
}
},
Expand Down