A complement to pgvector for high performance, cost efficient vector search on large workloads

Edit Package pgvectorscale
https://github.com/timescale/pgvectorscale

pgvectorscale complements pgvector, the open-source vector data extension for
PostgreSQL, and introduces the following key innovations for pgvector data:

- A new index type called StreamingDiskANN, inspired by the DiskANN algorithm,
based on research from Microsoft.
- Statistical Binary Quantization: developed by Timescale researchers, This
compression method improves on standard Binary Quantization.

On a benchmark dataset of 50 million Cohere embeddings with 768 dimensions
each, PostgreSQL with pgvector and pgvectorscale achieves 28x lower p95 latency
and 16x higher query throughput compared to Pinecone's storage optimized (s1)
index for approximate nearest neighbor queries at 99% recall, all at 75% less
cost when self-hosted on AWS EC2.

Source Files
Filename Size Changed
_multibuild 0000000197 197 Bytes
_service 0000000134 134 Bytes
pgvectorscale-0.9.0.tar.gz 0000117788 115 KB
pgvectorscale.changes 0000000312 312 Bytes
pgvectorscale.spec 0000002871 2.8 KB
vendor.tar.zst 0033341510 31.8 MB
Comments 0
No comments available
openSUSE Build Service is sponsored by