Skip to content

Commit 69c5bff

Browse files
committed
update workflow
1 parent 8e4f26a commit 69c5bff

File tree

1 file changed

+19
-0
lines changed

1 file changed

+19
-0
lines changed

README.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -55,26 +55,45 @@ Code
5555
- After stress test move from stage to prod
5656
- Comment on the commit with cml for stress test results
5757
- in actions change the workflow comment to PR
58+
- good one - ecr image May 17, 2025, 00:03:15 (UTC+05.5) - take this and move forward.
59+
60+
Note: If we build a gpu image from github runner then its throwing below error, so i have prebuilt image
61+
62+
```
63+
/opt/conda/lib/python3.11/site-packages/torch/cuda/__init__.py:734: UserWarning: Can't initialize NVML
64+
warnings.warn("Can't initialize NVML")
65+
```
66+
if no thing works then use the model-onnx-server image
67+
### Pending for Deployment 3
68+
- lambda
5869

5970
Docs
6071
- Architecture diagram
6172
- Screenshots of deployment and video
6273

6374
Explanation: Deployment 01
6475
Architecture diagram
76+
dvc setup and pull - dataset
6577
start from train and store feature
6678
then go with optuna taking the lowest loss model and saving it - hyper paramter optimization
79+
models usage
80+
onnx model generation
81+
torch script model for usage
82+
mar for torch serve usage
83+
accuracy txt for verification
6784
then transfering the outputs to s3-dev
6885

6986
Now kubernetes
7087
setting up of kserve, knative, argocd, promethus, grafana
7188
show the screenshots on load test
7289

7390
Now deployments workflow
91+
show the ecr repo and docker files
7492
on PR generates a model in dev
7593
compare and post comment in PR
7694

7795
Now kubernetes deployment - github actions
96+
show the ecr repo and docker all 3 files
7897
explain secrets setup - github workflow
7998
on push to main branch
8099
train and push to stage

0 commit comments

Comments
 (0)