-
Notifications
You must be signed in to change notification settings - Fork 356
Closed
Labels
good first issueGood for newcomersGood for newcomers
Milestone
Description
Feature Request / Improvement
It looks like a misnamed field slipped in:
{
"status": 1,
"snapshot_id": {
"long": 898025966831056900
},
"data_sequence_number": null,
"file_sequence_number": null,
"data_file": {
"content": 0,
"file_path": "/tmp/some.db/tablev2/data/00000-0-93717a88-1cea-4e3d-a69a-00ce3d087822.parquet",
"file_format": "PARQUET",
"partition": {},
"record_count": 3,
"file_size_in_bytes": 5459,
"column_sizes": { ... },
"value_counts": { ... },
"null_value_counts": { ... },
"nan_value_counts": { ... },
"lower_bounds": { ... },
"upper_bounds": { ... },
"key_metadata": null,
"split_offsets": {
"array": [
4
]
},
"equality_ids": null,
"sort_order_id": null
}
}
This should be sequence_number
:
Luckily this still worked due to Iceberg's field-id based lookup, but would be good to get this cleaned up.
Relevant code:
iceberg-python/pyiceberg/manifest.py
Line 380 in a8d3f17
NestedField(3, "data_sequence_number", LongType(), required=False), |
amogh-jahagirdar
Metadata
Metadata
Assignees
Labels
good first issueGood for newcomersGood for newcomers