NAME

Lucy::Docs::DocIDs - Characteristics of Apache Lucy document ids.

DESCRIPTION

Document ids are signed 32-bit integers

Document ids in Apache Lucy start at 1. Because 0 is never a valid doc id, we can use it as a sentinel value:

while ( my $doc_id = $posting_list->next ) {
    ...
}

Document ids are ephemeral

The document ids used by Lucy are associated with a single index snapshot. The moment an index is updated, the mapping of document ids to documents is subject to change.

Since IndexReader objects represent a point-in-time view of an index, document ids are guaranteed to remain static for the life of the reader. However, because they are not permanent, Lucy document ids cannot be used as foreign keys to locate records in external data sources. If you truly need a primary key field, you must define it and populate it yourself.

Furthermore, the order of document ids does not tell you anything about the sequence in which documents were added to the index.