Skip to content

How to catch potential issues with the new Faiss vector format? #14875

@kaivalnp

Description

@kaivalnp

Description

Opening this to discuss how we could catch issues (early on) with the new Faiss vector format (#14178) -- like memory leaks or bugs in Faiss itself, under more rigorous scenarios -- perhaps using long-running (or "monster" tests)

This is important because we're dealing with native code, without fine control over memory allocation and error handling -- and could result in the JVM crashing!

One such example is #14847, where we discovered an issue with attempting to index >2GB of vectors in a single field in a single segment (this issue was more on the codec side and how we passed data over to Faiss instead of the native part, but the above tests would help catch this nonetheless!)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions