2020-07-15 (GCS 2.1.4, BQ 1.1.4)
Changelog
Cloud Storage connector:
-
Added a new parameter to configure output stream pipe type:
fs.gs.outputstream.pipe.type (default: IO_STREAM_PIPE)
Valid values are
NIO_CHANNEL_PIPE
andIO_STREAM_PIPE
.Output stream now supports (when property value set to
NIO_CHANNEL_PIPE
) Java NIO Pipe that allows to reliably write in the output stream from multiple threads without "Pipe broken" exceptions.Note that when using
NIO_CHANNEL_PIPE
option maximum upload throughput can decrease by 10%. -
Throw
ClosedChannelException
inGoogleHadoopOutputStream.write
methods if stream already closed. This fixes Spark Streaming jobs checkpointing to Cloud Storage. -
Add a property to impersonate a service account:
fs.gs.auth.impersonation.service.account (not set by default)
If this property is set, an access token will be generated for this service account to access GCS. The caller who issues a request for the access token must have been granted the Service Account Token Creator role (
roles/iam.serviceAccountTokenCreator
) on the service account to impersonate. -
Add properties to impersonate a service account through user or group name:
fs.gs.auth.impersonation.service.account.for.user.<USER_NAME> (not set by default) fs.gs.auth.impersonation.service.account.for.group.<GROUP_NAME> (not set by default)
If any of these properties is set, an access token will be generated for the service account associated with specified user name or group name in order to access GCS. The caller who issues a request for the access token must have been granted the Service Account Token Creator role (
roles/iam.serviceAccountTokenCreator
) on the service account to impersonate. -
Update all dependencies to latest versions.
Big Query connector:
- Update all dependencies to latest versions.