Tanl Linguistic Pipeline |
Iterates through postings, but skipping those for documents which have been remapped. More...
#include <PostingList.h>
Public Member Functions | |
remap_iterator (size_type s, byte const *p, int displace, Remap &remap) | |
remap_iterator & | operator++ () |
Advance a PostingList::const_iterator. |
Iterates through postings, but skipping those for documents which have been remapped.
IXE::PostingList::remap_iterator::remap_iterator | ( | size_type | s, | |
byte const * | p, | |||
int | displace, | |||
Remap & | remap | |||
) | [inline] |
remap | maps DocIDs of replaced documents to those which replace them. |
References IXE::PostingList::const_iterator::c_, operator++(), IXE::PostingList::const_iterator::rest_, IXE::PostingList::const_iterator::size_, IXE::PostingList::const_iterator::table_, and IXE::PostingList::const_iterator::tablesz_.
PostingList::remap_iterator & IXE::PostingList::remap_iterator::operator++ | ( | ) |
Advance a PostingList::const_iterator.
A posting has the following format:
I[0x80{M}...0x80]OL{H}^O
that is: a DocID (I) followed by zero or more TermColors (M) surrounded by 0x80 bytes, followed by the number of occurrences in the document (O), followed by the byte length of the hitlist less O (L), followed by O hits, i.e. positions where the word occurs in document I. Each H is a document position, represented as delta increment with respect to the previous one. First word is at position 1.
Reimplemented from IXE::PostingList::const_iterator.
References IXE::PostingList::const_iterator::c_, IXE::PostingList::const_iterator::index(), IXE::PostingList::const_iterator::next(), and IXE::PostingList::const_iterator::rest_.
Referenced by remap_iterator().