[FEAT][1/2] Support Iceberg renaming of columns #4060
release-drafter.yml
on: pull_request
update_release_draft
9s
label
4s
Annotations
2 errors and 1 warning
update_release_draft
Validation Failed: {"resource":"Release","code":"invalid","field":"target_commitish"}
{
name: 'HttpError',
id: '8029389280',
status: 422,
response: {
url: 'https://api.github.com/repos/Eventual-Inc/Daft/releases/142827731',
status: 422,
headers: {
'access-control-allow-origin': '*',
'access-control-expose-headers': 'ETag, Link, Location, Retry-After, X-GitHub-OTP, X-RateLimit-Limit, X-RateLimit-Remaining, X-RateLimit-Used, X-RateLimit-Resource, X-RateLimit-Reset, X-OAuth-Scopes, X-Accepted-OAuth-Scopes, X-Poll-Interval, X-GitHub-Media-Type, X-GitHub-SSO, X-GitHub-Request-Id, Deprecation, Sunset',
connection: 'close',
'content-length': '195',
'content-security-policy': "default-src 'none'",
'content-type': 'application/json; charset=utf-8',
date: 'Sat, 24 Feb 2024 08:32:33 GMT',
'referrer-policy': 'origin-when-cross-origin, strict-origin-when-cross-origin',
server: 'GitHub.com',
'strict-transport-security': 'max-age=31536000; includeSubdomains; preload',
vary: 'Accept-Encoding, Accept, X-Requested-With',
'x-accepted-github-permissions': 'contents=write',
'x-content-type-options': 'nosniff',
'x-frame-options': 'deny',
'x-github-api-version-selected': '2022-11-28',
'x-github-media-type': 'github.v3; format=json',
'x-github-request-id': '0407:3ED2:CBB4C0:1958583:65D9A9A1',
'x-ratelimit-limit': '1000',
'x-ratelimit-remaining': '994',
'x-ratelimit-reset': '1708767150',
'x-ratelimit-resource': 'core',
'x-ratelimit-used': '6',
'x-xss-protection': '0'
},
data: {
message: 'Validation Failed',
errors: [
{
resource: 'Release',
code: 'invalid',
field: 'target_commitish'
}
],
documentation_url: 'https://docs.github.com/rest/releases/releases#update-a-release'
}
},
request: {
method: 'PATCH',
url: 'https://api.github.com/repos/Eventual-Inc/Daft/releases/142827731',
headers: {
accept: 'application/vnd.github.v3+json',
'user-agent': 'probot/12.2.5 octokit-core.js/3.5.1 Node.js/16.20.2 (linux; x64)',
authorization: 'token [REDACTED]',
'content-type': 'application/json; charset=utf-8'
},
body: '{"body":"## Changes\\n\\n- [FEAT] Add str.lower() function @nsalerni (#1938)\\n\\n## ✨ New Features\\n\\n- [FEAT] Time Array @colin-ho (#1892)\\n- [FEAT] Add str.lstrip() and str.rstrip() functions @nsalerni (#1944)\\n- [FEAT] Add str.upper() function @nsalerni (#1942)\\n\\n## 📖 Documentation\\n\\n- [DOCS] Add documentation for using and developing Daft on Ray @kevinzwang (#1896)\\n- [DOCS] Update schema hints documentation @jaychia (#1935)\\n\\n## 🧰 Maintenance\\n\\n- [CHORE] Remove non-`MicroPartition` and non-`ScanOperator` paths @clarkzinzow (#1946)\\n- [CHORE] Populate previews only when show() or \\\\_\\\\_repr\\\\_\\\\_() is called @colin-ho (#1889)\\n- [CHORE] Update segment endpoint @jaychia (#1902)\\n","draft":true,"prerelease":false,"make_latest":"true","name":"v0.2.17","tag_name":"v0.2.17","target_commitish":"refs/pull/1937/merge"}',
request: {}
},
event: {
id: '8029389280',
name: 'pull_request',
payload: {
action: 'edited',
changes: {
body: {
from: '# Summary\r\n' +
'\r\n' +
'Support field_id renaming of Parquet files along the codepath:\r\n' +
'\r\n' +
'1. `IcebergScanOperator`\r\n' +
'2. Generates `ScanTasks`, each containing the `field_id_mapping: Arc<{i32: Field}>`\r\n' +
'3. Propagated to workers through the `ScanWithTask` instruction object\r\n' +
'4. Micropartitions are created with `MicroPartition::from_scan_task`\r\n' +
'5. This then calls into `read_parquet_into_micropartition`\r\n' +
' a. If statistics are available, it will create an unloaded MicroPartition by creating a new ScanTask (hydrated with statistics) and then calling `MicroPartition::new_unloaded(new_scan_task)`. \r\n' +
|
update_release_draft
HttpError: Validation Failed: {"resource":"Release","code":"invalid","field":"target_commitish"}
at /home/runner/work/_actions/release-drafter/release-drafter/v5/dist/index.js:8462:21
at processTicksAndRejections (node:internal/process/task_queues:96:5)
at async Job.doExecute (/home/runner/work/_actions/release-drafter/release-drafter/v5/dist/index.js:30793:18)
{
name: 'AggregateError',
event: {
id: '8029389280',
name: 'pull_request',
payload: {
action: 'edited',
changes: {
body: {
from: '# Summary\r\n' +
'\r\n' +
'Support field_id renaming of Parquet files along the codepath:\r\n' +
'\r\n' +
'1. `IcebergScanOperator`\r\n' +
'2. Generates `ScanTasks`, each containing the `field_id_mapping: Arc<{i32: Field}>`\r\n' +
'3. Propagated to workers through the `ScanWithTask` instruction object\r\n' +
'4. Micropartitions are created with `MicroPartition::from_scan_task`\r\n' +
'5. This then calls into `read_parquet_into_micropartition`\r\n' +
' a. If statistics are available, it will create an unloaded MicroPartition by creating a new ScanTask (hydrated with statistics) and then calling `MicroPartition::new_unloaded(new_scan_task)`. \r\n' +
' b. Otherwise, it falls back into `read_parquet_bulk`, which has been modified to correctly handle `field_id_mapping`\r\n' +
'\r\n' +
'This PR ensures that when data/statistics are read from Parquet files, we correctly apply renaming according to `field_id_mapping`.\r\n' +
'\r\n' +
'## Drive-By\r\n' +
'\r\n' +
'Refactors to clean-up MicroPartitions/ScanTasks and schemas:\r\n' +
'\r\n' +
"1. Refactored `MicroPartition::new_unloaded`: it no longer accepts a `schema` argument; instead internally it will just use the ScanTask's `.materialized_schema()`\r\n" +
'2. Refactored `read_parquet_into_micropartition` to significantly reduce code deduplication\r\n' +
'3. Removed the usage of `MicroPartition.cast_to_schema`: this utility is now actually no longer used in the ScanOperator read path. However we cannot completely remove it because it is still used in our `use_native_downloader=False` read path.\r\n' +
'\r\n' +
'## Remaining todos:\r\n' +
'\r\n' +
'- [x] Fix logic with column pruning (need to apply column pruning after applying the field ID mappings)\r\n' +
'- [x] Perform correct renaming for statistics parsing from Parquet metadata\r\n' +
'- [ ] Perform recursive renaming for Series and for Schema'
}
},
number: 1937,
organization: {
avatar_url: 'https://avatars.githubusercontent.com/u/98941975?v=4',
description: 'Eventual Computing',
events_url: 'https://api.github.com/orgs/Eventual-Inc/events',
hooks_url: 'https://api.github.com/orgs/Eventual-Inc/hooks',
id: 98941975,
issues_url: 'https://api.github.com/orgs/Eventual-Inc/issues',
login: 'Eventual-Inc',
members_url: 'https://api.github.com/orgs/Eventual-Inc/members{/member}',
node_id: 'O_kgDOBeW8Fw',
public_members_url: 'https://api.github.com/orgs/Eventual-Inc/public_members{/member}',
repos_url: 'https://api.github.com/orgs/Eventual-Inc/repos',
url: 'https://api.github.com/orgs/Eventual-Inc'
},
pull_request: {
_links: {
comments: {
href: 'https://api.github.com/repos/Eventual-Inc/Daft/issues/1937/comments'
},
commits: {
href: 'https://api.github.com/repos/Eventual-Inc/Daft/pulls/1937/commits'
},
html: { href: 'https://github.com/Eventual-Inc/Daft/pull/1937' },
issue: {
href: 'https://api.github.com/repos/Eventual-Inc/Daft/issues/1937'
},
review_comment: {
href: 'https://api.github.com/repos/Event
|
update_release_draft
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: release-drafter/release-drafter@v5. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|