-
Notifications
You must be signed in to change notification settings - Fork 174
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[FEAT] Support Reading Iceberg Merge-on-Read Position Deletes (#2563)
When position deletes are present, I chose to disable row-group splitting since it would complicate the logic to implement and it's unlikely that people are storing a small number of very large individual files in Iceberg. If there is a clean way to do this I wouldn't be opposed to implementing it though. This PR also does a slight refactor of the `DataFileSource` enum to consolidate `AnonymousDataFile` and `CatalogDataFile` and remove parameters that aren't relevant to database sources. In a future PR, I would like to do some refactoring of our `ScanTask` struct in order to ensure some relationships between the data sources and file formats. For example a database format would not make sense to exist with a parquet source, but we implicitly allow it right now. This has already caused parts of our code to be messy and make unnecessary checks
- Loading branch information
1 parent
4701290
commit c24635e
Showing
15 changed files
with
397 additions
and
177 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -171,6 +171,7 @@ pub mod pylib { | |
&schema_infer_options, | ||
None, | ||
None, | ||
None, | ||
)? | ||
.into_iter() | ||
.map(|v| v.into()) | ||
|
Oops, something went wrong.