pgvector

pgvector turns Postgres into a vector database: it adds a vector column type, distance operators, and HNSW/IVFFlat indexes so you can run similarity search next to your relational data, with full SQL filtering and transactions — no separate vector store to operate.

pgvector is an open-source extension that gives Postgres a native vector type, distance operators, and approximate-nearest-neighbour indexes. With it, your embeddings live in the same database as your relational data — searchable with ordinary SQL, filterable with WHERE, and consistent inside the same transaction. For a large share of RAG and semantic-search workloads, that means there's no separate vector database to deploy, sync, or back up.

It is aimed at teams who already run Postgres and want vector search without adding a system. You install the extension, add a vector column, build an index, and query with the distance operators — the rest of your schema, joins, and tooling keep working as they always did.

Highlights

Vector types in Postgres — vector, halfvec (half-precision), and sparsevec, with distance operators for L2 (<->), cosine (<=>), and inner product (<#>).
HNSW & IVFFlat indexes — HNSW for high recall and low latency, IVFFlat for smaller memory footprints; both expose tuning parameters for the recall/speed trade-off.
SQL-native filtering — combine similarity search with any WHERE clause, join, or ORDER BY — no separate metadata-filter API to learn.
Transactional & consistent — inserts and updates to vectors are ACID, just like the rest of your data.
Scales further with extensions — pgvectorscale adds StreamingDiskANN and better quantization to push past in-memory limits while staying in Postgres.

In an AI-assisted workflow

Enable the extension, store embeddings beside your rows, index them, and query with a distance operator and a normal filter:

CREATE EXTENSION IF NOT EXISTS vector;
 
ALTER TABLE docs ADD COLUMN embedding vector(1536);
CREATE INDEX ON docs USING hnsw (embedding vector_cosine_ops);
 
-- nearest neighbours to the query vector, filtered by metadata
SELECT id, content
FROM docs
WHERE product = 'billing'
ORDER BY embedding <=> $1   -- $1 is the embedded query
LIMIT 20;                   -- over-retrieve, then rerank

TIP

Match the operator class to your embedding model's distance metric — vector_cosine_ops for cosine, vector_l2_ops for Euclidean, vector_ip_ops for inner product. A mismatch silently degrades recall. To scaffold the schema and index, see Scaffold a pgvector Schema & HNSW Index.

Good to know

pgvector is free and open source under the permissive PostgreSQL License and ships in most managed Postgres offerings (Supabase, Neon, RDS, Cloud SQL). It's the pragmatic default when you already run Postgres and have up to a few million vectors; for billion-scale or heavy out-of-the-box quantization and sharding, weigh a dedicated store — see Best Vector Database in 2026. Tune the HNSW parameters against your recall target with the Embedding Index Tuner.

Frequently asked questions

What is pgvector?

pgvector is an open-source extension that gives Postgres a native vector type, distance operators, and approximate-nearest-neighbour indexes (HNSW and IVFFlat). Your embeddings live in the same database as your relational data — searchable with ordinary SQL, filterable with WHERE, and consistent inside the same transaction — so there's no separate vector database to deploy or sync.

Is pgvector free?

Yes — free and open source under the permissive PostgreSQL License, and it ships in most managed Postgres offerings, including Supabase, Neon, RDS, and Cloud SQL.

When does pgvector stop being enough?

It's the pragmatic default when you already run Postgres and have up to a few million vectors. For billion-scale workloads or heavy out-of-the-box quantization and sharding, weigh a dedicated store; the pgvectorscale extension can also push past in-memory limits while staying in Postgres.

Should I use HNSW or IVFFlat with pgvector?

HNSW for high recall and low latency; IVFFlat for smaller memory footprints. Both expose tuning parameters for the recall/speed trade-off — and match the operator class to your embedding model's distance metric (vector_cosine_ops, vector_l2_ops, or vector_ip_ops), since a mismatch silently degrades recall.

Highlights

In an AI-assisted workflow

Good to know

Frequently asked questions

Related