-
Notifications
You must be signed in to change notification settings - Fork 313
Closed
Description
Tasks
- specs: Add Schema
- Visitor pattern
- Binding visitor to bind an UnboundPredicate
- Expressions:
- Term trait
- Unbound trait
- Expression trait
- Bound trait
- Visitor pattern
- specs: Add TableMetadata
- Support for both V1 and V2.
- When read, set V2 fields (
schemas
,current-schema-id
,partition-specs
,default-spec-id
,last-partition-id
,sort-orders
,default-sort-order-id
,refs
) so we can just assume that it is a V2 table.
- specs: Add Manifest
- ManifestList and Manifest
- specs: Transforms
- Iceberg has a set of transforms that need to be implemented
- For the
bucket
transforms, the values need to be hashed.
- io: How to handle IO? Support for GCS, S3, ADLS, etc. Do we want something similar to the FileIO in (Py)Iceberg.
- arrow: Reading data from iceberg to arrow record batches
- arrow: Writing unpartitioned data into iceberg from arrow record batches
- arrow: Writing partitioned data into iceberg from arrow record batches
- api: Table API design
- api: Catalog API design #23
- rest catalog
Future tasks
- PartitionEvaluator: Able to prune files on the ManifestList and Manifest level by taking the partition predicate into account.
- MetricsEvaluator: Able to prune DataFiles based on the column metrics on the Manifest
JanKaul, liurenjie1024, Fokko and Samrose-Ahmed
Metadata
Metadata
Assignees
Labels
No labels