bert-large-cased

A transformer-based language model pre-trained on a large corpus of English data using a masked language modeling objective. It was introduced in a research paper by Google researchers and achieved state-of-the-art results on various natural language processing tasks. The model is cased, meaning it differentiates between English and english, and has a configuration of 24 layers, 1024 hidden dimensions, 16 attention heads, and 336M parameters.

Public

$0.0005 / sec

demoversions

HTTP/cURL API

You can use cURL or any other http client to run inferences:

curl -X POST \
    -d '{"input": "Where is my [MASK]?"}'  \
    -H "Authorization: bearer $DEEPINFRA_TOKEN"  \
    -H 'Content-Type: application/json'  \
    'https://api.deepinfra.com/v1/inference/bert-large-cased'

which will give you back something similar to:

{
  "results": [
    {
      "sequence": "where is my father?",
      "score": 0.08898820728063583,
      "token": 2269,
      "token_str": "father"
    },
    {
      "sequence": "where is my mother?",
      "score": 0.07864926755428314,
      "token": 2388,
      "token_str": "mother"
    }
  ],
  "request_id": null,
  "inference_status": {
    "status": "unknown",
    "runtime_ms": 0,
    "cost": 0.0,
    "tokens_generated": 0,
    "tokens_input": 0
  }
}

Input fields

`input`string

text prompt, should include exactly one [MASK] token

`webhook`file

The webhook to call when inference is done, by default you will get the output in the response of your inference request

Input Schema

Output Schema

Latest Models

Phind/

Phind-CodeLlama-34B-v2

openai/

whisper-tiny

bigcode/

starcoder2-15b

Gryphe/

MythoMax-L2-13b

openchat/

openchat_3.5

Featured Models

mistralai/

Mistral-7B-Instruct-v0.2

deepinfra/

airoboros-70b

meta-llama/

Llama-2-7b-chat-hf

llava-hf/

llava-1.5-7b-hf

cognitivecomputations/

dolphin-2.6-mixtral-8x7b

openchat/

openchat_3.5

Company

Pricing

Docs

Compare

DeepStart

About

Privacy

Terms