Embeddings
Create vector embeddings for text to power semantic search and similarity
Embeddings
TL;DR
POST to /v1/embeddings with model: "assisters-embed-v1" and input (text or array). Returns 1024-dimension vectors. Supports 100+ languages. Perfect for semantic search, RAG, and clustering. $0.01 per million tokens.
Create vector representations of text for semantic search, clustering, recommendations, and similarity matching.
Endpoint
POST https://api.assisters.dev/v1/embeddingsRequest Body
stringrequiredThe embedding model to use. See available models.
Example: assisters-embed-v1
string | arrayrequiredThe text to embed. Can be a single string or an array of up to 100 strings.
stringdefault: floatThe format for the embedding values. Options: float, base64
Request Examples
Single Text
from openai import OpenAI
client = OpenAI(
api_key="ask_your_api_key",
base_url="https://api.assisters.dev/v1"
)
response = client.embeddings.create(
model="assisters-embed-v1",
input="The quick brown fox jumps over the lazy dog"
)
embedding = response.data[0].embedding
print(f"Dimensions: {len(embedding)}")import OpenAI from 'openai';
const client = new OpenAI({
apiKey: 'ask_your_api_key',
baseURL: 'https://api.assisters.dev/v1'
});
const response = await client.embeddings.create({
model: 'assisters-embed-v1',
input: 'The quick brown fox jumps over the lazy dog'
});
const embedding = response.data[0].embedding;
console.log(`Dimensions: ${embedding.length}`);curl https://api.assisters.dev/v1/embeddings \
-H "Authorization: Bearer ask_your_api_key" \
-H "Content-Type: application/json" \
-d '{
"model": "assisters-embed-v1",
"input": "The quick brown fox jumps over the lazy dog"
}'Batch Embeddings
response = client.embeddings.create(
model="assisters-embed-v1",
input=[
"First document to embed",
"Second document to embed",
"Third document to embed"
]
)
for i, data in enumerate(response.data):
print(f"Document {i}: {len(data.embedding)} dimensions")Semantic Search Example
import numpy as np
from openai import OpenAI
client = OpenAI(
api_key="ask_your_api_key",
base_url="https://api.assisters.dev/v1"
)
# Your documents
documents = [
"Python is a programming language",
"JavaScript runs in the browser",
"Machine learning uses algorithms",
"Cats are furry pets"
]
# Embed all documents
doc_response = client.embeddings.create(
model="assisters-embed-v1",
input=documents
)
doc_embeddings = [d.embedding for d in doc_response.data]
# Embed the query
query = "What programming languages are there?"
query_response = client.embeddings.create(
model="assisters-embed-v1",
input=query
)
query_embedding = query_response.data[0].embedding
# Calculate cosine similarity
def cosine_similarity(a, b):
return np.dot(a, b) / (np.linalg.norm(a) * np.linalg.norm(b))
# Find most similar documents
similarities = [cosine_similarity(query_embedding, doc) for doc in doc_embeddings]
ranked = sorted(zip(documents, similarities), key=lambda x: x[1], reverse=True)
for doc, score in ranked:
print(f"{score:.4f}: {doc}")Response
{
"object": "list",
"data": [
{
"object": "embedding",
"index": 0,
"embedding": [
0.0023064255,
-0.009327292,
0.015797347,
...
]
}
],
"model": "assisters-embed-v1",
"usage": {
"prompt_tokens": 10,
"total_tokens": 10
},
"cost": {
"total_usd": 0.000001,
"price_per_million_tokens": 0.1
}
}Response Fields
objectstringAlways list
costobjectCost breakdown: total_usd and price_per_million_tokens
dataarrayArray of embedding objects, one per input text:
object: Alwaysembeddingindex: Position in the input arrayembedding: Array of floats representing the vector
modelstringThe model used to generate embeddings
usageobjectToken usage for billing:
prompt_tokens: Tokens in the inputtotal_tokens: Same as prompt_tokens for embeddings
Available Models
| Model | Dimensions | Max Tokens | Price |
|---|---|---|---|
assisters-embed-v1 | 1024 | 8192 | $0.01/M |
Model Details
See detailed specifications for embedding models
Use Cases
Best Practices
Batch Requests
Embed multiple texts in one request for better throughput
Cache Embeddings
Store embeddings in a vector database to avoid re-computation
Normalize Vectors
Most embedding models output normalized vectors, but verify for your use case
Match Query/Doc Models
Always use the same model for queries and documents
Vector Databases
Store and query embeddings efficiently with these compatible databases:
- Pinecone - Managed vector database
- Weaviate - Open-source vector search
- Qdrant - Open-source vector database
- Milvus - Cloud-native vector database
- pgvector - PostgreSQL extension
- Supabase - PostgreSQL with pgvector built-in