Skip to content

Kafka Connect suite of connectors for Cloud storage (Amazon S3)

License

Notifications You must be signed in to change notification settings

sovrn/kafka-connect-storage-cloud

 
 

Repository files navigation

Kafka Connect Connector for S3

FOSSA Status

kafka-connect-storage-cloud is the repository for Confluent's Kafka Connectors designed to be used to copy data from Kafka into Amazon S3.

Kafka Connect Sink Connector for Amazon Simple Storage Service (S3)

Documentation for this connector can be found here.

Blogpost for this connector can be found here.

Development

To build a development version you'll need a recent version of Kafka as well as a set of upstream Confluent projects, which you'll have to build from their appropriate snapshot branch. See the kafka-connect-storage-common FAQ for guidance on this process.

You can build kafka-connect-storage-cloud with Maven using the standard lifecycle phases.

Contribute

License

This project is licensed under the Confluent Community License.

FOSSA Status

Sovrn-specific build

This connector does have a Maven package target that attempts to add the necessary support jars to run the connector properly inside Kafka Connect. It has been our observation that this group of jars is insufficient. Once the connector has been packaged by Maven, the following jars need to be added to the group (which is located in kafka-connect-s3/target/kafka-connect-s3-<version>-SNAPSHOT-package/share/java/kafka-connect-s3).

avro-1.9.2.jar kafka-avro-serializer-6.0.1.jar kafka-connect-avro-data-6.0.1.jar kafka-connect-storage-common-10.0.5.jar kafka-connect-storage-core-10.0.5.jar kafka-connect-storage-format-10.0.5.jar kafka-connect-storage-partitioner-10.0.5.jar kafka-schema-registry-client-6.0.1.jar kafka-schema-serializer-6.0.1.jar parquet-avro-1.11.1.jar parquet-column-1.11.1.jar parquet-common-1.11.1.jar parquet-encoding-1.11.1.jar parquet-format-structures-1.11.1.jar parquet-hadoop-1.11.1.jar

Once the extra jars (which you can get from the S3 sink connector at Confluent Hub) have been added to the directory, go up one directory (java), and create the tarball using tar cvfz patched-kafka-connect-s3.tgz kafka-connect-s3.

This tarball is then uploaded to Artifactory using jfrog rt u patched-kafka-connect-s3.tgz raw-sovrn/exchange/patched-kafka-connect-s3.tgz. Once this upload is complete, the tarball is ready to be used by ansible-kafkaconnect.

About

Kafka Connect suite of connectors for Cloud storage (Amazon S3)

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 98.1%
  • HTML 1.9%