Penguin

A type of file format.

An InvertedIndex is a series of entries, one for each item in the Lexicon?. Each entry contains the document number (and often the offset into the document) for each occurrance of the word in the original corpus.