A logical representation of a set of data entities.
Logically, all datasets have two generic properties: a name, and a descriptor that holds information such as the dataset's schema and its partitioning information. Concrete implementations of {@code Dataset} may supportadditional properties, mandatory or otherwise, as needed. {@code Dataset}s are not normally instantiated directly, but managed by a repository (also implementation-specific).
Implementations of {@link Dataset} are immutable.
@param < E> The type of entities stored in this {@code Dataset}. @see View @see DatasetRepository @see DatasetWriter @see DatasetReader @see PartitionStrategy @see DatasetDescriptor @see Schema
|
|
|
|