We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

Browse deepinfra models:

All categories and models you can try out and directly use in deepinfra:

Viewing all

featured

text-generation

automatic-speech-recognition

text-to-speech

embeddings

text-to-video

text-to-image

reranker

zero-shot-image-classification

multimodal

Category/embeddings

Embeddings are a crucial concept in AI and NLP, used to represent categorical data as continuous vectors in a high-dimensional space. They are particularly useful for working with text data, capturing the semantic meanings, syntax, and context of words and phrases. By converting sparse one-hot encoded vectors into dense vectors, embeddings help machine learning models learn meaningful relationships between categories.

Embeddings are not limited to text data; they can also be used for other types of categorical data such as images, audio, and video. Using embeddings for these data types captures relationships between different objects or sounds, improving machine learning models' performance.

Incorporating embeddings into AI models can significantly enhance their accuracy, making them an essential tool for data scientists and AI practitioners. By understanding embeddings, you can unlock new possibilities in your AI projects and create more sophisticated and accurate models.

512

$0.005 / Mtoken

sentence-transformers/

paraphrase-MiniLM-L6-v2

embeddings

We present a sentence similarity model based on the Sentence Transformers architecture, which maps sentences to a 384-dimensional dense vector space. The model uses a pre-trained BERT encoder and applies mean pooling on top of the contextualized word embeddings to obtain sentence embeddings. We evaluate the model on the Sentence Embeddings Benchmark.

512

$0.005 / Mtoken

shibing624/

text2vec-base-chinese

embeddings

A sentence similarity model that can be used for various NLP tasks such as text classification, sentiment analysis, named entity recognition, question answering, and more. It utilizes the CoSENT architecture, which consists of a transformer encoder and a pooling module, to encode input texts into vectors that capture their semantic meaning. The model was trained on the nli_zh dataset and achieved high performance on various benchmark datasets.

512

$0.005 / Mtoken

thenlper/

gte-base

embeddings

The GTE models are trained by Alibaba DAMO Academy. They are mainly based on the BERT framework and currently offer three different sizes of models, including GTE-large, GTE-base, and GTE-small. The GTE models are trained on a large-scale corpus of relevance text pairs, covering a wide range of domains and scenarios. This enables the GTE models to be applied to various downstream tasks of text embeddings, including information retrieval, semantic textual similarity, text reranking, etc.