Vision Transformer (ViT) trained for object detection
image to detect objects in
The webhook to call when inference is done, by default you will get the output in the response of your inference request
© 2023 Deep Infra. All rights reserved.