Add support for the Firehose equivalent of `AdaptivePollingRecordPublisher` #19

bobtiernay-okta · 2021-04-15T18:08:23Z

Currently, the FirehoseProducer has a very simple linear backoff model. When rate limiting on the Firehose stream is encountered, the recovery time could be greatly improved by implementing a strategy similar to AdaptivePollingRecordPublisher's adaptRecordsToRead method:

	/**
	 * Calculates how many records to read each time through the loop based on a target throughput
	 * and the measured frequenecy of the loop.
	 * @param runLoopTimeNanos The total time of one pass through the loop
	 * @param numRecords The number of records of the last read operation
	 * @param recordBatchSizeBytes The total batch size of the last read operation
	 * @param maxNumberOfRecordsPerFetch The current maxNumberOfRecordsPerFetch
	 */
	private int adaptRecordsToRead(long runLoopTimeNanos, int numRecords, long recordBatchSizeBytes,
			int maxNumberOfRecordsPerFetch) {
		if (numRecords != 0 && runLoopTimeNanos != 0) {
			long averageRecordSizeBytes = recordBatchSizeBytes / numRecords;
			// Adjust number of records to fetch from the shard depending on current average record size
			// to optimize 2 Mb / sec read limits
			double loopFrequencyHz = 1000000000.0d / runLoopTimeNanos;
			double bytesPerRead = KINESIS_SHARD_BYTES_PER_SECOND_LIMIT / loopFrequencyHz;
			maxNumberOfRecordsPerFetch = (int) (bytesPerRead / averageRecordSizeBytes);
			// Ensure the value is greater than 0 and not more than 10000L
			maxNumberOfRecordsPerFetch = Math.max(1, Math.min(maxNumberOfRecordsPerFetch, ConsumerConfigConstants.DEFAULT_SHARD_GETRECORDS_MAX));

			// Set metrics
			metricsReporter.setLoopFrequencyHz(loopFrequencyHz);
			metricsReporter.setBytesPerRead(bytesPerRead);
		}
		return maxNumberOfRecordsPerFetch;
	}

The text was updated successfully, but these errors were encountered:

GS-KeithLee · 2021-04-16T09:50:40Z

Thank you for reporting the issue. We will add this into our backlog. Related: #17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for the Firehose equivalent of `AdaptivePollingRecordPublisher` #19

Add support for the Firehose equivalent of `AdaptivePollingRecordPublisher` #19

bobtiernay-okta commented Apr 15, 2021

GS-KeithLee commented Apr 16, 2021

Add support for the Firehose equivalent of AdaptivePollingRecordPublisher #19

Add support for the Firehose equivalent of AdaptivePollingRecordPublisher #19

Comments

bobtiernay-okta commented Apr 15, 2021

GS-KeithLee commented Apr 16, 2021

Add support for the Firehose equivalent of `AdaptivePollingRecordPublisher` #19

Add support for the Firehose equivalent of `AdaptivePollingRecordPublisher` #19