-
Notifications
You must be signed in to change notification settings - Fork 85
Open
Description
Request
Can you add support for GroundingDINO model conversion to TensorRT?
Why GroundingDINO?
Popular open-vocabulary object detection model
Text + image input for flexible detection
Good accuracy but slow inference in PyTorch
Would benefit greatly from TensorRT optimization
Current Status
✅ Works in MMDetection
❌ Not supported in mmdetection-to-tensorrt
Technical Notes
Uses dual inputs (image + text tokens)
Has BERT text encoder component
Cross-modal attention layers
Question
Are there any plans to support GroundingDINO? I'm interested in using this model with TensorRT optimization.
Metadata
Metadata
Assignees
Labels
No labels