A complement to pgvector for high performance, cost efficient vector search on large workloads
pgvectorscale complements pgvector, the open-source vector data extension for
PostgreSQL, and introduces the following key innovations for pgvector data:
- A new index type called StreamingDiskANN, inspired by the DiskANN algorithm,
based on research from Microsoft.
- Statistical Binary Quantization: developed by Timescale researchers, This
compression method improves on standard Binary Quantization.
On a benchmark dataset of 50 million Cohere embeddings with 768 dimensions
each, PostgreSQL with pgvector and pgvectorscale achieves 28x lower p95 latency
and 16x higher query throughput compared to Pinecone's storage optimized (s1)
index for approximate nearest neighbor queries at 99% recall, all at 75% less
cost when self-hosted on AWS EC2.
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout server:database:postgresql/pgvectorscale && cd $_ - Create Badge
Source Files
| Filename | Size | Changed |
|---|---|---|
| _multibuild | 0000000197 197 Bytes | |
| _service | 0000000134 134 Bytes | |
| pgvectorscale-0.9.0.tar.gz | 0000117788 115 KB | |
| pgvectorscale.changes | 0000000312 312 Bytes | |
| pgvectorscale.spec | 0000002871 2.8 KB | |
| vendor.tar.zst | 0033341510 31.8 MB |
Comments 0