Skip to content

Embedding

An embedding is a numerical representation of text (or image) that lets AI systems compare semantic similarity.

Also known as: vector embedding, text embedding

An embedding is a numerical representation — a vector of several hundred numbers — that captures the semantic meaning of a piece of text or an image. Two pieces of text with similar meaning get similar embedding vectors, even if they use different words. Embeddings are the foundation of RAG systems: when you ask a question, it is converted to an embedding, which is then used to find documents with similar embeddings in a vector database. This is why modern AI search can find relevant documents even when none of the search terms appear in the document itself.

Related terms

  • RAG (Retrieval-Augmented Generation)RAG is a technique where a language model answers based on the business's own documents — instead of only its general training.
  • Vector databaseA vector database is a database optimised for storing and searching embeddings — the foundation of RAG systems and AI search.
  • LLM (Large Language Model)An LLM is a large language model trained on enormous text volumes that can generate, summarise, and analyse text in a human-like way.
No commitment · 30 min · Free

We build what you need. Let's figure out what that is.

Send us two sentences about what's grinding. You'll hear back today, and we'll tell you whether this is something we can help with.

You can change these choices at any time. Necessary is always on because it is required for the site to work.

Necessary

Required for basic functionality, such as remembering your consent choice. No tracking.

Statistics

Anonymised visit statistics via Google Analytics 4. Helps us understand which pages people read.

Product insights

Detailed behavioural analysis via PostHog, including session recording with masked input fields. Used to improve the user experience.