Assisters API
API Reference

Embeddings

Create vector embeddings for text to power semantic search and similarity

Embeddings

TL;DR

POST to /v1/embeddings with model: "assisters-embed-v1" and input (text or array). Returns 1024-dimension vectors. Supports 100+ languages. Perfect for semantic search, RAG, and clustering. $0.01 per million tokens.

Create vector representations of text for semantic search, clustering, recommendations, and similarity matching.

Endpoint

POST https://api.assisters.dev/v1/embeddings

Request Body

stringrequired

The embedding model to use. See available models.

Example: assisters-embed-v1

string | arrayrequired

The text to embed. Can be a single string or an array of up to 100 strings.

stringdefault: float

The format for the embedding values. Options: float, base64

Request Examples

Single Text

from openai import OpenAI

client = OpenAI(
    api_key="ask_your_api_key",
    base_url="https://api.assisters.dev/v1"
)

response = client.embeddings.create(
    model="assisters-embed-v1",
    input="The quick brown fox jumps over the lazy dog"
)

embedding = response.data[0].embedding
print(f"Dimensions: {len(embedding)}")
import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'ask_your_api_key',
  baseURL: 'https://api.assisters.dev/v1'
});

const response = await client.embeddings.create({
  model: 'assisters-embed-v1',
  input: 'The quick brown fox jumps over the lazy dog'
});

const embedding = response.data[0].embedding;
console.log(`Dimensions: ${embedding.length}`);
curl https://api.assisters.dev/v1/embeddings \
  -H "Authorization: Bearer ask_your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "assisters-embed-v1",
    "input": "The quick brown fox jumps over the lazy dog"
  }'

Batch Embeddings

response = client.embeddings.create(
    model="assisters-embed-v1",
    input=[
        "First document to embed",
        "Second document to embed",
        "Third document to embed"
    ]
)

for i, data in enumerate(response.data):
    print(f"Document {i}: {len(data.embedding)} dimensions")

Semantic Search Example

import numpy as np
from openai import OpenAI

client = OpenAI(
    api_key="ask_your_api_key",
    base_url="https://api.assisters.dev/v1"
)

# Your documents
documents = [
    "Python is a programming language",
    "JavaScript runs in the browser",
    "Machine learning uses algorithms",
    "Cats are furry pets"
]

# Embed all documents
doc_response = client.embeddings.create(
    model="assisters-embed-v1",
    input=documents
)
doc_embeddings = [d.embedding for d in doc_response.data]

# Embed the query
query = "What programming languages are there?"
query_response = client.embeddings.create(
    model="assisters-embed-v1",
    input=query
)
query_embedding = query_response.data[0].embedding

# Calculate cosine similarity
def cosine_similarity(a, b):
    return np.dot(a, b) / (np.linalg.norm(a) * np.linalg.norm(b))

# Find most similar documents
similarities = [cosine_similarity(query_embedding, doc) for doc in doc_embeddings]
ranked = sorted(zip(documents, similarities), key=lambda x: x[1], reverse=True)

for doc, score in ranked:
    print(f"{score:.4f}: {doc}")

Response

{
  "object": "list",
  "data": [
    {
      "object": "embedding",
      "index": 0,
      "embedding": [
        0.0023064255,
        -0.009327292,
        0.015797347,
        ...
      ]
    }
  ],
  "model": "assisters-embed-v1",
  "usage": {
    "prompt_tokens": 10,
    "total_tokens": 10
  },
  "cost": {
    "total_usd": 0.000001,
    "price_per_million_tokens": 0.1
  }
}

Response Fields

objectstring

Always list

costobject

Cost breakdown: total_usd and price_per_million_tokens

dataarray

Array of embedding objects, one per input text:

  • object: Always embedding
  • index: Position in the input array
  • embedding: Array of floats representing the vector
modelstring

The model used to generate embeddings

usageobject

Token usage for billing:

  • prompt_tokens: Tokens in the input
  • total_tokens: Same as prompt_tokens for embeddings

Available Models

ModelDimensionsMax TokensPrice
assisters-embed-v110248192$0.01/M

Model Details

See detailed specifications for embedding models

Use Cases

Best Practices

Batch Requests

Embed multiple texts in one request for better throughput

Cache Embeddings

Store embeddings in a vector database to avoid re-computation

Normalize Vectors

Most embedding models output normalized vectors, but verify for your use case

Match Query/Doc Models

Always use the same model for queries and documents

Vector Databases

Store and query embeddings efficiently with these compatible databases:

  • Pinecone - Managed vector database
  • Weaviate - Open-source vector search
  • Qdrant - Open-source vector database
  • Milvus - Cloud-native vector database
  • pgvector - PostgreSQL extension
  • Supabase - PostgreSQL with pgvector built-in

Error Responses