Field that indexes int
values for efficient range filtering and sorting. Here's an example usage:
document.add(new IntField(name, 6, Field.Store.NO));
For optimal performance, re-use the
IntField
and {@link Document} instance for more than one document:
IntField field = new IntField(name, 6, Field.Store.NO); Document document = new Document(); document.add(field); for(all documents) { ... field.setIntValue(value) writer.addDocument(document); ... }
See also {@link LongField}, {@link FloatField}, {@link DoubleField}.
To perform range querying or filtering against a IntField
, use {@link NumericRangeQuery} or {@link NumericRangeFilter}. To sort according to a IntField
, use the normal numeric sort types, eg {@link org.apache.lucene.search.SortField.Type#INT}. IntField
values can also be loaded directly from {@link FieldCache}.
You may add the same field name as an IntField
to the same document more than once. Range querying and filtering will be the logical OR of all values; so a range query will hit all documents that have at least one value in the range. However sort behavior is not defined. If you need to sort, you should separately index a single-valued IntField
.
An IntField
will consume somewhat more disk space in the index than an ordinary single-valued field. However, for a typical index that includes substantial textual content per document, this increase will likely be in the noise.
Within Lucene, each numeric value is indexed as a trie structure, where each term is logically assigned to larger and larger pre-defined brackets (which are simply lower-precision representations of the value). The step size between each successive bracket is called the precisionStep
, measured in bits. Smaller precisionStep
values result in larger number of brackets, which consumes more disk space in the index but may result in faster range search performance. The default value, 4, was selected for a reasonable tradeoff of disk space consumption versus performance. You can create a custom {@link FieldType} and invoke the {@link FieldType#setNumericPrecisionStep} method if you'dlike to change the value. Note that you must also specify a congruent value when creating {@link NumericRangeQuery} or {@link NumericRangeFilter}. For low cardinality fields larger precision steps are good. If the cardinality is < 100, it is fair to use {@link Integer#MAX_VALUE}, which produces one term per value.
For more information on the internals of numeric trie indexing, including the precisionStep
configuration, see {@link NumericRangeQuery}. The format of indexed values is described in {@link NumericUtils}.
If you only need to sort by numeric value, and never run range querying/filtering, you can index using a precisionStep
of {@link Integer#MAX_VALUE}. This will minimize disk space consumed.
More advanced users can instead use {@link NumericTokenStream} directly, when indexing numbers. Thisclass is a wrapper around this token stream type for easier, more intuitive usage.
@since 2.9