BAAI/bge-base-en-v1.5

BGE embedding is a general Embedding Model. It is pre-trained using retromae and trained on large-scale pair data using contrastive learning. Note that the goal of pre-training is to reconstruct the text, and the pre-trained model cannot be used for similarity calculation directly, it needs to be fine-tuned

Public

$0.005 / Mtoken

512

Project Paper License

demoversions

OpenAI-compatible HTTP API

DeepInfra supports the OpenAI embeddings API. The following creates an embedding vector representing the input text

curl "https://api.deepinfra.com/v1/openai/embeddings" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPINFRA_TOKEN" \
  -d '{
    "input": "The food was delicious and the waiter...",
    "model": "BAAI/bge-base-en-v1.5",
    "encoding_format": "float"
  }'

which will return something similar to

{
  "object":"list",
  "data":[
    {
      "object": "embedding",
      "index":0,
      "embedding":[
        -0.010480394586920738,
        -0.0026091758627444506
        ...
        0.031979579478502274,
        0.02021978422999382
      ]
    }
  ],
  "model": "BAAI/bge-base-en-v1.5",
  "usage": {
    "prompt_tokens":12,
    "total_tokens":12
  }
}

Input fields

`model`string

model name

`input`array

sequences to embed

`encoding_format`string

format used when encoding

Default value: "float"

Allowed values: float

Input Schema

Output Schema

Latest Models

openchat/

openchat_3.5

bigcode/

starcoder2-15b

Gryphe/

MythoMax-L2-13b

openai/

whisper-tiny

Phind/

Phind-CodeLlama-34B-v2

Featured Models

Qwen/

Qwen2.5-72B-Instruct

openai/

whisper-large-v3

nvidia/

Llama-3.1-Nemotron-70B-Instruct

openai/

whisper-large-v3-turbo

black-forest-labs/

FLUX-1-dev

deepinfra/

tts

Company

Pricing

Docs

Compare

DeepStart

About

Careers

Privacy

Terms

BAAI/bge-base-en-v1.5

OpenAI-compatible HTTP API

Input fields

modelstring

inputarray

encoding_formatstring

Input Schema

Output Schema

`model`string

`input`array

`encoding_format`string