We at https://github.com/tensorchord/VectorChord solved most of the pgvector iss...

nostrebored · 2025-11-03T15:10:17 1762182617

So you’re quantizing and using IVF — what are your recall numbers with actual use cases?

VoVAllen · 2025-11-03T17:24:09 1762190649

We do have some benchmark number at https://blog.vectorchord.ai/vector-search-over-postgresql-a-.... It varies on different dataset, but most cases it's 2x or more QPS comparing to pgvector's hnsw at same recall.

nostrebored · 2025-11-03T19:27:53 1762198073

Your graphs are measuring accuracy [1] (i'm assuming precision?), not recall? My impression is that your approach would miss surfacing potentially relevant candidates, because that is the tradeoff IVF makes for memory optimization. I'd expect that this especially struggles with high dim vectors and large datasets.

[1] https://cdn.hashnode.com/res/hashnode/image/upload/v17434120...

VoVAllen · 2025-11-03T19:38:36 1762198716

It's recall. Thanks for pointing out this, we'll update the diagram.

The core part is a quantization technique called RaBitQ. We can scan over the bit vector to have an estimation about the real distance between query and data. I'm not sure what do you mean by "miss" here. As the approximate nearest neighbor index, all the index including HNSW will miss some potential candidates.

VoVAllen · 2025-11-03T14:43:37 1762181017

And we do have user hosting 3 Billion vectors with Postgres + VectorChord with sharding. And they're using vectors to save the earth! Check https://blog.vectorchord.ai/3-billion-vectors-in-postgresql-...

tacoooooooo · 2025-11-03T15:42:02 1762184522

We actually looked into vectorchord--it looks really cool, but it's not supported by RDS so it is an additional service for us to add anyways.

inadequatespace · 2025-11-05T16:30:07 1762360207

Another extremely solid win for Cunningham’s Law.