hustvl/yolos-base cover image

hustvl/yolos-base

Vision Transformer (ViT) trained for object detection

Vision Transformer (ViT) trained for object detection

Public
$0.0005 / sec

HTTP/cURL API

You can use cURL or any other http client to run inferences:

curl -X POST \
    -H "Authorization: bearer $DEEPINFRA_TOKEN"  \
    -F image=@my_image.jpg  \
    'https://api.deepinfra.com/v1/inference/hustvl/yolos-base'

which will give you back something similar to:

{
  "results": [
    {
      "score": 0.9939407110214233,
      "label": "remote",
      "box": {
        "xmin": 46,
        "ymin": 72,
        "xmax": 181,
        "ymax": 119
      }
    },
    {
      "score": 0.983637809753418,
      "label": "cat",
      "box": {
        "xmin": 12,
        "ymin": 54,
        "xmax": 319,
        "ymax": 470
      }
    }
  ],
  "request_id": null,
  "inference_status": {
    "status": "unknown",
    "runtime_ms": 0,
    "cost": 0.0,
    "tokens_generated": 0,
    "tokens_input": 0
  }
}

Input fields

imagestring

image to detect objects in


webhookfile

The webhook to call when inference is done, by default you will get the output in the response of your inference request

Input Schema

Output Schema