Warmup batching

**Description**

`batching=False` doesn't work with warmup. There is always additional dimension added at the beginning of the request making it impossible to infer a sample without batch dimension at the beginning, no matter if we set `batch_size=0` or `batch_size=1` in `ModelWarmup()`

**To reproduce**

```python
# server
import numpy as np
from pytriton.model_config import ModelConfig, Tensor
from pytriton.model_config.common import ModelWarmup, WarmupInput
from pytriton.triton import Triton


def _infer_fn(input):
    print(input[0].data['input1'])
    print(input[0].data['input1'].shape)
    return {"out": np.array((1,))}

with Triton() as triton:
    warmup = ModelWarmup(
        name="warmup",
        batch_size=1, # setting to 0 or 1
        inputs={
           "input1": WarmupInput(
                   dtype=np.float32,
                      shape=(2, 3),
                      zero_data=True,
                  ),
              },
              count=1,
    )

    triton.bind(
        model_name="MyModel",
        infer_func=_infer_fn,
        inputs=[
            Tensor(name="in1", dtype=np.float32, shape=(2,3)),
        ],
        outputs=[
            Tensor(name="out", dtype=np.float32, shape=(-1,)),
        ],
        config=ModelConfig(
            batching=False,
            model_warmup=[warmup],
        ),
    )
    triton.serve()


#output for batch_size=0:
#[]
#(0, 2, 3)

#output for batch_size=1:
#[[[0. 0. 0.]
#[0. 0. 0.]]]
#(1, 2, 3)

```



**Environment**

pytriton version: 0.6.0


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Warmup batching #112

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Warmup batching #112

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions