Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve checkstyle section on contributing guide #107

Merged
merged 1 commit into from
Oct 3, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file removed .readme/contributing/check_style.png
Binary file not shown.
Binary file removed .readme/contributing/reformat_code.png
Binary file not shown.
Binary file removed .readme/contributing/save_actions.png
Binary file not shown.
Binary file removed .readme/contributing/scan.png
Binary file not shown.
63 changes: 29 additions & 34 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
@@ -1,29 +1,41 @@
# Contributing to Kstreamplify

Welcome to our contribution guide! We're thrilled that you're interested in contributing to our project. This guide will help you understand the process and expectations for contributing, so that we can work together effectively and efficiently.
Welcome to our contribution guide! We're thrilled that you're interested in contributing to our project. This guide will
help you understand the process and expectations for contributing, so that we can work together effectively and
efficiently.

## Getting Started

### Issues

Issues should be used to report problems, request a new feature, or to discuss potential changes before a PR is created. When you create a new Issue, a template will be loaded that will guide you through collecting and providing the information we need to investigate.
Issues should be used to report problems, request a new feature, or to discuss potential changes before a PR is created.
When you create a new Issue, a template will be loaded that will guide you through collecting and providing the
information we need to investigate.

If you find an existing issue that addresses the problem you're having, please add your own reproduction information to the existing issue instead of creating a new one. Adding a [reaction](https://github.blog/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) can also indicate to our maintainers that a particular problem is affecting more than just the reporter.
If you find an existing issue that addresses the problem you're having, please add your own reproduction information to
the existing issue instead of creating a new one. Adding
a [reaction](https://github.blog/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) can also indicate to
our maintainers that a particular problem is affecting more than just the reporter.

If you're unable to find an open issue addressing the problem, open a new one. Be sure to include a title and a clear description, relevant information, and a code sample or executable test case demonstrating the expected behavior that is not occurring.
If you're unable to find an open issue addressing the problem, open a new one. Be sure to include a title and a clear
description, relevant information, and a code sample or executable test case demonstrating the expected behavior that is
not occurring.

### Pull Requests

PRs are always welcome and can be a quick way to get your fix or improvement slated for the next release. In general, PRs should:
PRs are always welcome and can be a quick way to get your fix or improvement slated for the next release. In general,
PRs should:

- Only fix/add the functionality in question OR address wide-spread style issues, not both.
- Add unit or integration tests for fixed or changed functionality (if a test suite already exists).
- Address a single concern in the least number of changed lines as possible.
- Be accompanied by a complete Pull Request template (loaded automatically when a PR is created).

Be sure to use the past tense ("Added new feature...", "Fixed bug on...") and add tags to the PR ("documentation" for documentation updates, "bug" for bug fixing, etc.).
Be sure to use the past tense ("Added new feature...", "Fixed bug on...") and add tags to the PR ("documentation" for
documentation updates, "bug" for bug fixing, etc.).

For changes that address core functionality or would require breaking changes (e.g. a major release), it's best to open an Issue to discuss your proposal first. This is not required but can save time creating and reviewing changes.
For changes that address core functionality or would require breaking changes (e.g. a major release), it's best to open
an Issue to discuss your proposal first. This is not required but can save time creating and reviewing changes.

In general, we follow the ["fork-and-pull" Git workflow](https://github.com/susam/gitpr)

Expand All @@ -39,35 +51,18 @@ In general, we follow the ["fork-and-pull" Git workflow](https://github.com/susa

### Code Style

We maintain a consistent code style using Checkstyle.
We maintain a consistent code style using [Checkstyle](https://checkstyle.sourceforge.io/).

#### IntelliJ
The configuration file is defined in the `.checkstyle` folder.
To perform Checkstyle validation, run the following:

To ensure code style consistency in IntelliJ, follow these steps:
```bash
mvn checkstyle:check
```

1. Install the [CheckStyle-IDEA plugin](https://plugins.jetbrains.com/plugin/1065-checkstyle-idea).
2. Create a new CheckStyle configuration for Kstreamplify based on the code style configuration located in the `.checkstyle` folder. Configure it as follows:
Before you start contributing new code, it is recommended to:

![check_style.png](.readme%2Fcontributing%2Fcheck_style.png)
- Install the IntelliJ [CheckStyle-IDEA](https://plugins.jetbrains.com/plugin/1065-checkstyle-idea) plugin.
- Configure the plugin to use Kstreamplify's Checkstyle configuration file.

3. Enable the "Reformat code" and "Optimize imports" options in the save actions:

![save_actions.png](.readme%2Fcontributing%2Fsave_actions.png)

4. Reformat your code with the Checkstyle configuration:

![reformat_code.png](.readme%2Fcontributing%2Freformat_code.png)

5. Before committing your changes, ensure your contribution doesn't introduce any problems by running a scan:

![scan.png](.readme%2Fcontributing%2Fscan.png)

### Git Commit Messages

When contributing to the project, it's important to follow a consistent style for Git commit messages. Here are some guidelines to keep in mind:

- Use the present tense, such as "Add feature," rather than the past tense, such as "Added feature."
- Use the imperative mood, such as "Move cursor to..." rather than "Moves cursor to..."
- Limit the first line of the commit message to 72 characters or less.
- Use references to issues and pull requests after the first line as needed.
- If your commit only changes documentation, include `[ci skip]` in the commit title.
Adhering to this code style ensures consistency and helps maintain code quality throughout the project.
108 changes: 68 additions & 40 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,29 +26,31 @@ With Kstreamplify, you can declare your KafkaStreams class and define your topol
* [Spring Boot](#spring-boot)
* [Testing](#testing)
* [Getting Started](#getting-started)
* [Properties Injection](#properties-injection)
* [Avro Serializer and Deserializer](#avro-serializer-and-deserializer)
* [Error Handling](#error-handling)
* [Topology](#topology)
* [Production and Deserialization](#production-and-deserialization)
* [Avro Schema](#avro-schema)
* [REST Endpoints](#rest-endpoints)
* [Hooks](#hooks)
* [On Start](#on-start)
* [Interactive Queries](#interactive-queries)
* [Testing](#testing)
* [Properties Injection](#properties-injection)
* [Avro Serializer and Deserializer](#avro-serializer-and-deserializer)
* [Error Handling](#error-handling)
* [Topology](#topology)
* [Production and Deserialization](#production-and-deserialization)
* [Avro Schema](#avro-schema)
* [REST Endpoints](#rest-endpoints)
* [Hooks](#hooks)
* [On Start](#on-start)
* [Interactive Queries](#interactive-queries)
* [Testing](#testing)
* [Motivation](#motivation)
* [Contribution](#contribution)

## Features

- **Easy bootstrapping**: Kafka Streams application bootstrapping is handled for you, allowing you to focus on topology implementation.
- **Easy bootstrapping**: Kafka Streams application bootstrapping is handled for you, allowing you to focus on topology
implementation.

- **Avro Schema Serializer and Deserializer**: Common serializers and deserializers for all your Avro specific records.

- **Error Handling**: A strong error handling mechanism is provided for topology, production, and deserialization errors, and it also allows routing them into a dead letter queue (DLQ) topic.
- **Error Handling**: A strong error handling mechanism is provided for topology, production, and deserialization
errors, and it also allows routing them into a dead letter queue (DLQ) topic.

- **REST endpoints**: Some useful REST endpoints, including Kubernetes liveness and readiness probes.
- **REST endpoints**: Some useful REST endpoints, including Kubernetes liveness and readiness probes.

- **Testing**: The library eases the use of Topology Test Driver, making it easier to write your tests.

Expand Down Expand Up @@ -77,6 +79,7 @@ To include the core Kstreamplify library in your project, add the following depe
If you're using Spring Boot, you can integrate Kstreamplify with your Spring Boot application by adding the following dependency:

```xml

<dependency>
<groupId>com.michelin</groupId>
<artifactId>kstreamplify-spring-boot</artifactId>
Expand Down Expand Up @@ -140,7 +143,7 @@ Whenever you need to serialize or deserialize records with Avro schemas, you can
SerdesUtils.<MyAvroValue>getSerdesForValue()
```

or
or

```java
SerdesUtils.<MyAvroValue>getSerdesForKey()
Expand All @@ -152,23 +155,28 @@ Here's an example of using these methods in your topology:

### Error Handling

The library provides the ability to handle errors that may occur in your topology as well as during the production or deserialization of records and route them to a dead-letter queue (DLQ) topic.
The library provides the ability to handle errors that may occur in your topology as well as during the production or
deserialization of records and route them to a dead-letter queue (DLQ) topic.

To do this, the first step is to override the `dlqTopic` method and return the name of your DLQ topic:

![](.readme/gif/dlq.gif "DLQ topic gif")

#### Topology
#### Topology

Kstreamplify provides utilities to handle all the unexpected errors that can occur in your topologies and route them to a dead-letter queue (DLQ) topic automatically.
Kstreamplify provides utilities to handle all the unexpected errors that can occur in your topologies and route them to
a dead-letter queue (DLQ) topic automatically.

The principle is simple: whenever you perform transformations on stream values, you can encapsulate the result as either success or failure. Failed records will be routed to your DLQ topic, while successful records will still be up for further processing.
The principle is simple: whenever you perform transformations on stream values, you can encapsulate the result as either
success or failure. Failed records will be routed to your DLQ topic, while successful records will still be up for
further processing.

Here is a complete example of how to do this:

![](.readme/gif/full-topology.gif "Full topology example gif")

The first step is during the map values processing. The operation should return a new value of type `ProcessingResult<V, V2>`.
The first step is during the map values processing. The operation should return a new value of
type `ProcessingResult<V, V2>`.
The first templatized parameter is the type of the new value after a successful transformation.
The second templatized parameter is the type of the current value for which the transformation failed.

Expand All @@ -181,20 +189,25 @@ return ProcessingResult.success(value);
Or the following in a catch clause to mark the result as failed:

```java
return ProcessingResult.fail(ex, value, "An error occurred during the upper case map values process");
return ProcessingResult.fail(ex,value,"An error occurred during the upper case map values process");
```

The `ProcessingResult.fail()` method takes the exception, the record that failed and a custom error message.

The second step is sending the new stream of `ProcessingResult<V, V2>` to the `TopologyErrorHandler.catchErrors()` method, which will split the
The second step is sending the new stream of `ProcessingResult<V, V2>` to the `TopologyErrorHandler.catchErrors()`
method, which will split the
stream into two branches:
- The first branch will contain the `ProcessingError` and will be routed to the DLQ topic as a `KafkaError` Avro objects that contains
multiple useful information such as the topic, the partition, the offsets, the exception, and the custom error message of the failed record.

- The first branch will contain the `ProcessingError` and will be routed to the DLQ topic as a `KafkaError` Avro objects
that contains
multiple useful information such as the topic, the partition, the offsets, the exception, and the custom error message
of the failed record.
- The second branch will only contain the successful records and will be returned to continue the processing.

#### Production and Deserialization

The library provides handlers for production and deserialization errors, which can be used to route these errors to the configured DLQ topic.
The library provides handlers for production and deserialization errors, which can be used to route these errors to the
configured DLQ topic.

Here's how to use them:

Expand All @@ -209,7 +222,8 @@ kafka:

#### Avro Schema

An Avro schema needs to be deployed in a Schema Registry on top of the DLQ topic. It is available [here](https://github.com/michelin/kstreamplify/blob/main/kstreamplify-core/src/main/avro/kafka-error.avsc).
An Avro schema needs to be deployed in a Schema Registry on top of the DLQ topic. It is
available [here](https://github.com/michelin/kstreamplify/blob/main/kstreamplify-core/src/main/avro/kafka-error.avsc).

### REST endpoints

Expand All @@ -221,21 +235,26 @@ The Kstreamplify library provides several REST endpoints, which are listed below

### Hooks

Kstreamplify offers the flexibility to execute custom code through hooks. These hooks can be defined by overriding specific methods.
Kstreamplify offers the flexibility to execute custom code through hooks. These hooks can be defined by overriding
specific methods.

#### On Start

The `On Start` hook allows you to execute code right after the Kafka Streams instantiation. It provides the Kafka Streams instance as a parameter.
The `On Start` hook allows you to execute code right after the Kafka Streams instantiation. It provides the Kafka
Streams instance as a parameter.

![](.readme/gif/on-start.gif)

You can use this hook to perform any custom initialization or setup tasks for your Kafka Streams application.

### Interactive Queries

Kstreamplify is designed to make your Kafka Streams instance ready for [interactive queries](https://docs.confluent.io/platform/current/streams/developer-guide/interactive-queries.html), including support for RPC (Remote Procedure Call).
Kstreamplify is designed to make your Kafka Streams instance ready
for [interactive queries](https://docs.confluent.io/platform/current/streams/developer-guide/interactive-queries.html),
including support for RPC (Remote Procedure Call).

The `application.server` property, which should contain the host:port information, is automatically handled by Kstreamplify.
The `application.server` property, which should contain the host:port information, is automatically handled by
Kstreamplify.
The property can be loaded in three different ways.
By order of priority:

Expand All @@ -249,37 +268,46 @@ kafka:

Where `MY_APPLICATION_PORT_HOST` contains the host:port information.

- an environment variable named `MY_POD_IP`. This is particularly useful when loading host:port information from Kubernetes.
- an environment variable named `MY_POD_IP`. This is particularly useful when loading host:port information from
Kubernetes.

Here's an extract of a Kubernetes deployment which set the `MY_POD_IP` environment variable in a Kubernetes environment:

```yml
...
containers:
env:
- name: MY_POD_IP
valueFrom:
fieldRef:
fieldPath: status.podIP
- name: MY_POD_IP
valueFrom:
fieldRef:
fieldPath: status.podIP
...
```

- If neither the variable environment nor the `MY_POD_IP` environment variable is set, Kstreamplify sets `application.server` to the default value `localhost`.
- If neither the variable environment nor the `MY_POD_IP` environment variable is set, Kstreamplify
sets `application.server` to the default value `localhost`.

### Testing

For testing, you can create a test class that implements `KafkaStreamsStarterTest` and override the `topology` method. Then, apply the topology of your Kafka Streams on the given `streamsBuilders`.
For testing, you can create a test class that implements `KafkaStreamsStarterTest` and override the `topology` method.
Then, apply the topology of your Kafka Streams on the given `streamsBuilders`.

Here is an example:

![](.readme/gif/test.gif "Test class gif")

## Motivation

Developing applications with Kafka Streams can be challenging and often raises many questions for developers. It involves considerations such as efficient bootstrapping of Kafka Streams applications, handling unexpected business issues, and integrating Kubernetes probes, among others.
Developing applications with Kafka Streams can be challenging and often raises many questions for developers. It
involves considerations such as efficient bootstrapping of Kafka Streams applications, handling unexpected business
issues, and integrating Kubernetes probes, among others.

To assist developers in overcoming these challenges, we have built this library. Our aim is to provide a comprehensive solution that simplifies the development process and addresses common pain points encountered while working with Kafka Streams.
To assist developers in overcoming these challenges, we have built this library. Our aim is to provide a comprehensive
solution that simplifies the development process and addresses common pain points encountered while working with Kafka
Streams.

## Contribution

We welcome contributions from the community! Before you get started, please take a look at our [contribution guide](https://github.com/michelin/kstreamplify/blob/master/CONTRIBUTING.md) to learn about our guidelines and best practices. We appreciate your help in making Kstreamplify a better library for everyone.
We welcome contributions from the community! Before you get started, please take a look at
our [contribution guide](https://github.com/michelin/kstreamplify/blob/master/CONTRIBUTING.md) to learn about our
guidelines and best practices. We appreciate your help in making Kstreamplify a better library for everyone.