FastSync
- To Snowflake: Support for IAM roles, AWS Session Tokens and to pass credentials as environment variables
Tap Kafka
- Bump
pipelinewise-tap-kafka
to 3.0.0- Add local storage of consumed messages and instant commit kafka offsets
- Add more configurable options:
consumer_timeout_ms
,session_timeout_ms
,heartbeat_interval_ms
,max_poll_interval_ms
- Add two new fixed output columns:
MESSAGE_PARTITION
andMESSAGE_OFFSET
Tap Snowflake
- Bump
pipelinewise-tap-snowflake
to 2.0.0- Discover only the required tables to avoid issues when too many tables in the database causing
SHOW COLUMNS
column to return more than the maximum 10000 rows
- Discover only the required tables to avoid issues when too many tables in the database causing
Target Snowflake
- Bump
pipelinewise-target-snowflake
to 1.6.3- Generate compressed CSV files by default. Optionally can be disabled by the
no_compression
config option
- Generate compressed CSV files by default. Optionally can be disabled by the
- Support tap/target config files with
.yaml
extension when importing config - Fixed dependency conflict in install script
- Fixed an issue when
add_metadata_columns
was not defined ininheritable_config.json
FastSync
- From MySQL: Increased default batch size to 50.000 rows when fastsync exporting data from MySQL tables
- To Snowflake: Log inserts, updates and csv file sizes in the same format to target-snowflake connector
Tap Kafka
- Bump
pipelinewise-tap-kafka
to 2.1.1- Commit offset from the state file and not from the consumed messages
Tap Snowflake
- Bump
pipelinewise-tap-snowflake
to 1.1.2- Fixed some dependency conflicts
Target Snowflake
- Bump
pipelinewise-target-snowflake
to 1.6.2- Log inserts, updates and csv file sizes in a more consumable format
Singer transformation
- Make tranformation consistent between FastSync and Singer by updating transform-field to transform without trimming.
tap-snowflake
- Remove PIPELINEWISE.COLUMNS cache table.
FastSync S3-csv to Snowflake
- Fix bug when
date_overrides
is present.
FastSync and singer target-snowflake
- Remove PIPELINEWISE.COLUMNS cache table.
FastSync Postgres
- Support reserved words as table names.
Install script
- update script to search full name plugins.
Tap Postgres
- Bump
pipelinewise-tap-postgres
to 1.5.1- Support per session wal_sender_timeout
FastSync Postgres & Mysql
- fix "'NoneType' object has no attribute 'upper'" that happens when table has no PK.
- fix "Information schema query returned too much data".
FastSync Postgres - Handle reserved words in column names in FastSync from PostgreSQL
- Bump
ansible
to 2.7.16
FastSync MySQL
- Handle reserved words in column names in FastSync from MySQL
- Fixed issue when parallelism
and parallelism_max
parameters were not used in tap YAML files
Tap Postgres
- Bump
tap-postgres
to 1.4.1- Remove unused timestamps in log
Logging refactoring:
- Structured logs in Pipelinewise, FastSync and majority of plugins.
- Include a logging config file in Pipelinewise repository and package here.
- Ability to provide a custom logging config by setting the env variable
LOGGING_CONF_FILE
to be the path to the.conf
file
Tap Jira
- Bump
tap-jira
to 2.0.0- Update key property for stream users
FastSync MySQL - Fix bug: map BINARY MySQL column to BINARY type IN SF
Transform field
- Bump
pipelinewise-transform-field
to 1.1.2- Make validation turned off by default.
- FastSync: Changed the default /tmp folder for snowflake encryption
Target Snowflake
- Bump
pipelinewise-target-snowflake
to 1.4.1- Changed the default /tmp folder for encryption
- FastSync: Support BINARY and VARBINARY column types from MySQL sources
- FastSync: Fixed an issue when
MASK-HIDDEN
type of transformations were not applied in Snowflake targets - Write temporary files to
~/.pipelinewise/tmp
directory - Add
stop_tap
command - Fixed an issue when post import Primary Keys check was not working correctly
- Fixed an issue when
discover_tap
command sometimes was failing
Tap MySQL
- Bump
pipelinewise-tap-mysql
to 1.1.3- Support to extract BINARY and VARBINARY column types
- Improved performance of reading data from MySQL binary log
- Increase default session
wait_timeout
to 28800 - Increase default session
innodb_lock_wait_timeout
to 3600
Tap S3 CSV
- Bump
pipelinewise-tap-s3-csv
to 1.0.7- Improved column type guesser
Tap Kafka
- Bump
pipelinewise-tap-kafka
to 2.0.0- Rewamp output schema, export the consumed JSON messages from Kafka topics to fixed columns
- Disable data flattening
Target Snowflake
- Bump
pipelinewise-target-snowflake
to 1.3.0- Load binary data into Snowflake
BINARY
column types - Adjust timestamps from taps automatically to the max allowed
9999-12-31 23:59:59
when it's required - Add
validate_record
optional parameter and default to False - Add
temp_dir
optional parameter to overwrite system defaults
- Load binary data into Snowflake
- FastSync: Add fastsync support from S3-CSV to Snowflake
- Add post import checks to detect tables with no primary key early
- Add optional
--connectors
to the install script to install taps and targets selectively
Tap Zendesk
- Forked singer connector to
pipelinewise-tap-zendesk==1.0.0
- Improved performance by getting data from Zendesk API in parallel
Tap Postgres
- Bump
pipelinewise-tap-postgres
to 1.3.0- Add
max_run_seconds
configurable option - Add
break_at_end_lsn
configurable option - Only send feedback when lsn_comitted has increased
- Add
Tap Snowflake
- Bump
pipelinewise-tap-snowflake
to 1.0.5- Bump
snowflake-connector-python
to 2.0.4
- Bump
Tap Kafka
- Bump
pipelinewise-tap-kafka
to 1.0.2- Add
encoding
configurable option
- Add
Target Redshift
- Bump
pipelinewise-target-redshift
to 1.1.0- Emit new state message as soon as data flushed to Redshift
- Add
flush_all_streams
option - Add
max_parallelism
option
- Save state message as soon as received from a target connector
- Fixed issue when docker executable not started on non bash enabled systems
- Exit gracefully on SIGINT (CTRL+C) and SIGTERM (kill)
- Add tap run summary table when tap run finished
- Add
--extra_log
optional parameter torun_tap
command in CLI - Add
validate
command to CLI - Optimised string formatting
- More accurate logging of number of exported rows in MySQL FastSync
- Fixed an issue when Snowflake cache table was not refreshed after FastSync comleted from MySQL to Snowflake
Tap Postgres
- Bump
pipelinewise-tap-postgres
to 1.2.0- Bump to
psycopg2
2.8.4 with auto keep-alive feature - Remove LOG_BASED stream bookmarks from state if it has been de-selected
- Convert time with timezone columns to UTC
- Updating stream to lsn position before sending STATE message
- Removed database name from stream-id
- Bump to
- FastSync: Convert time with timezone columns to UTC
Target Snowflake
- Bump
snowflake-connector-python
to 2.0.3 - Bump
pipelinewise-target-snowflake
to 1.1.6- Emit state message as soon as new data flushed and loaded into Snowflake
- Enforce autocommit and secure connection
- Optional
flush_all_streams
option - Configurable
parallelism
option - Configurable
parallelism_max
option - Fixed issue when updating bookmarks failed when no STATE message received from tap
- FastSync: Enforce autocommit and secure connection
Target Redshift
- Bump pipelinewise-target-redshift to 1.0.7
- Configurable COPY option
- Configurable parallelism option
- Grant permissions to users and groups individually
- FastSync: Grant permissions to users and groups individually
Target Postgres
- Bump pipelinewise-target-postgres to 1.0.4
- Fixed issue when permission not granted correctly on newly created tables
- Updated Tap Postgres, Tap Redshift pages with new features
- Removed
sync_period
references
Transform Field
- Bump pipelinewise-transform-field to 1.1.1
- Add MASK-HIDDEN transformation type
Tap S3-CSV
- Bump pipelinewise-tap-s3-csv to 1.0.5
- Add non-AWS S3 support
Tap Postgres
- Bump pipelinewise-tap-postgres to 1.1.6
- FastSync: Fixed issue when 24:00:00 formatted timestamps not loaded from Postgres to Snowflake
Target Redshift
- Bump pipelinewise-target-redshift to 1.0.6
- Fixed issue when AWS credentials sometimes were visible in logs
- Updated Tap S3 CSV pages
- Add contribution page
Tap Postgres
- Bump tap-postgres to 1.1.5
- Lowercase pg_replication slot name
- FastSync: Lowercase pg_replication slot name
Target Redshift
- Bump pipelinewise-target-redshift to 1.0.5
- Set varchar column length dynamically
- FastSync: Set varchar column length dynamically
Tap Oracle
- Add Tap Oracle singer connector
- Add Oracle Instant Client to docker image
- Fixed sample YAML files for multiple connectors
- Fixed typos in multiple pages
- Fixed hard_delete option
- Updated contributors
- Add Tap Oracle
- Build docker image with no pipelinewise user
- Fixed issue when arguments were not passed correctly to docker container
- Initial release