We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

Qwen/Qwen3-Embedding-4B

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

Public

$0.005 / Mtoken

32,768

Project Paper License

demoversions

OpenAI-compatible HTTP API

DeepInfra supports the OpenAI embeddings API. The following creates an embedding vector representing the input text

curl "https://api.deepinfra.com/v1/openai/embeddings" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPINFRA_TOKEN" \
  -d '{
    "input": "The food was delicious and the waiter...",
    "model": "Qwen/Qwen3-Embedding-4B",
    "encoding_format": "float"
  }'
copy

which will return something similar to

{
  "object":"list",
  "data":[
    {
      "object": "embedding",
      "index":0,
      "embedding":[
        -0.010480394586920738,
        -0.0026091758627444506
        ...
        0.031979579478502274,
        0.02021978422999382
      ]
    }
  ],
  "model": "Qwen/Qwen3-Embedding-4B",
  "usage": {
    "prompt_tokens":12,
    "total_tokens":12
  }
}
copy

Input Schema

Output Schema

Unlock the most affordable AI hosting

Run models at scale with our fully managed GPU infrastructure, delivering enterprise-grade uptime at the industry's best rates.

Contact Sales Get Started

Latest Models

Gryphe/

MythoMax-L2-13b

openai/

whisper-tiny

Phind/

Phind-CodeLlama-34B-v2

bigcode/

starcoder2-15b

openchat/

openchat_3.5

Featured Models

Qwen/

Qwen3-235B-A22B-Thinking-2507

meta-llama/

Llama-3.3-70B-Instruct

zai-org/

GLM-4.5

sesame/

csm-1b

meta-llama/

Llama-4-Maverick-17B-128E-Instruct-Turbo

meta-llama/

Llama-4-Maverick-17B-128E-Instruct-FP8

Company

Pricing

Docs

Compare

DeepStart

About

Careers

Trust Center

Privacy

Terms

Have questions or need a custom solution?

Contact Sales

Qwen/Qwen3-Embedding-4B

OpenAI-compatible HTTP API

Input fields

`model`string

`input`array

`encoding_format`string

`dimensions`integer

Input Schema

Output Schema

Unlock the most affordable AI hosting

Qwen/Qwen3-Embedding-4B

OpenAI-compatible HTTP API

Input fields

modelstring

inputarray

encoding_formatstring

dimensionsinteger

Input Schema

Output Schema

Unlock the most affordable AI hosting

`model`string

`input`array

`encoding_format`string

`dimensions`integer