This example project demonstrates a generic IoT use case for ScyllaDB in Java. The documentation for this application and guided exercise is here.
The application allows tracking of pets health indicators and consist of three parts:
- migrate (
com.carepet.Migrate
) - creates thecarepet
keyspace and tables - collar (
com.carepet.Sensor
) - generates a pet health data and pushes it into the storage - web app (
com.carepet.server.App
) - REST API service for tracking pets health state
Prerequisites:
- JDK at least OpenJDK 8
- maven
- docker
- docker-compose
To run a local ScyllaDB cluster consisting of three nodes with
the help of docker
and docker-compose
execute:
$ docker-compose up -d
Docker-compose will spin up three nodes: carepet-scylla1
, carepet-scylla2
and carepet-scylla3
. You can access them with the docker
command.
To execute CQLSH:
$ docker exec -it carepet-scylla1 cqlsh
To execute nodetool:
$ docker exec -it carepet-scylla1 nodetool status
Shell:
$ docker exec -it carepet-scylla1 shell
You can inspect any node by means of the docker inspect
command
as follows. for example:
$ docker inspect carepet-scylla1
To get node IP address run:
$ docker inspect -f '{{range .NetworkSettings.Networks}}{{.IPAddress}}{{end}}' carepet-scylla1
To initialize database execute:
$ mvn package
$ NODE1=$(docker inspect -f '{{range .NetworkSettings.Networks}}{{.IPAddress}}{{end}}' carepet-scylla1)
$ ./bin/migrate.sh --hosts $NODE1 --datacenter datacenter1
Expected output:
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/sitano/.m2/repository/org/slf4j/slf4j-simple/1.7.26/slf4j-simple-1.7.26.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/home/sitano/.m2/repository/ch/qos/logback/logback-classic/1.2.3/logback-classic-1.2.3.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.SimpleLoggerFactory]
[main] INFO com.carepet.Migrate - creating keyspace...
Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.api.core.session.SessionBuilder - Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.internal.core.DefaultMavenCoordinates - DataStax Java driver for Apache Cassandra(R) (com.scylladb:java-driver-core) version 4.8.0-scylla-0
[s0-admin-0] INFO com.datastax.oss.driver.internal.core.time.Clock - Using native clock for microsecond precision
[main] INFO com.carepet.Migrate - creating table...
Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.api.core.session.SessionBuilder - Using Scylla optimized driver!!!
[s1-admin-0] INFO com.datastax.oss.driver.internal.core.time.Clock - Using native clock for microsecond precision
Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.api.core.session.SessionBuilder - Using Scylla optimized driver!!!
[s2-admin-0] INFO com.datastax.oss.driver.internal.core.time.Clock - Using native clock for microsecond precision
Keyspace: carepet; Table: measurement
Keyspace: carepet; Table: owner
Keyspace: carepet; Table: pet
Keyspace: carepet; Table: sensor
Keyspace: carepet; Table: sensor_avg
You can check the database structure with:
$ docker exec -it carepet-scylla1 cqlsh
cqlsh> DESCRIBE KEYSPACES
carepet system_schema system_auth system system_distributed system_traces
cqlsh> USE carepet;
cqlsh:carepet> DESCRIBE TABLES
pet sensor_avg gocqlx_migrate measurement owner sensor
cqlsh:carepet> DESCRIBE TABLE pet
CREATE TABLE carepet.pet (
owner_id uuid,
pet_id uuid,
chip_id text,
species text,
breed text,
color text,
gender text,
address text,
age int,
name text,
weight float,
PRIMARY KEY (owner_id, pet_id)
) WITH CLUSTERING ORDER BY (pet_id ASC)
AND bloom_filter_fp_chance = 0.01
AND caching = {'keys': 'ALL', 'rows_per_partition': 'ALL'}
AND comment = ''
AND compaction = {'class': 'SizeTieredCompactionStrategy'}
AND compression = {'sstable_compression': 'org.apache.cassandra.io.compress.LZ4Compressor'}
AND crc_check_chance = 1.0
AND dclocal_read_repair_chance = 0.1
AND default_time_to_live = 0
AND gc_grace_seconds = 864000
AND max_index_interval = 2048
AND memtable_flush_period_in_ms = 0
AND min_index_interval = 128
AND read_repair_chance = 0.0
AND speculative_retry = '99.0PERCENTILE';
cqlsh:carepet> exit
To start pet collar simulation execute the following in the separate terminal:
$ mvn package
$ NODE1=$(docker inspect -f '{{range .NetworkSettings.Networks}}{{.IPAddress}}{{end}}' carepet-scylla1)
$ ./bin/sensor.sh --hosts $NODE1 --datacenter datacenter1 --measure PT1M --buffer-interval PT1M
Expected output:
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/sitano/.m2/repository/org/slf4j/slf4j-simple/1.7.26/slf4j-simple-1.7.26.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/home/sitano/.m2/repository/ch/qos/logback/logback-classic/1.2.3/logback-classic-1.2.3.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.SimpleLoggerFactory]
Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.api.core.session.SessionBuilder - Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.internal.core.DefaultMavenCoordinates - DataStax Java driver for Apache Cassandra(R) (com.scylladb:java-driver-core) version 4.8.0-scylla-0
[s0-admin-0] INFO com.datastax.oss.driver.internal.core.time.Clock - Using native clock for microsecond precision
[main] INFO com.carepet.Migrate - owner = Owner{ownerId=0701da30-52f0-4ee4-911c-f9ac951bf3b1, name='OBahu5A3', address='5UtxnIxqfa'}
[main] INFO com.carepet.Migrate - pet = Pet{ownerId=0701da30-52f0-4ee4-911c-f9ac951bf3b1, petId=765ac83f-9744-450b-a4ec-9d40359edeae, chipId='', species='', breed='', color='', gender='', age=55, weight=7.937521, address='home', name='0YaLHRJq'}
[main] INFO com.carepet.Migrate - sensor = Sensor{petId=765ac83f-9744-450b-a4ec-9d40359edeae, sensorId=25ce6029-20a2-4532-a154-d82ea8da800d, type='R'}
[main] INFO com.carepet.Migrate - sensor = Sensor{petId=765ac83f-9744-450b-a4ec-9d40359edeae, sensorId=14a94142-12e8-414f-861e-15571d087c41, type='P'}
[main] INFO com.carepet.Migrate - sensor = Sensor{petId=765ac83f-9744-450b-a4ec-9d40359edeae, sensorId=630b2f96-bd0c-43c1-93a7-7750229f6da8, type='R'}
[main] INFO com.carepet.Migrate - sensor = Sensor{petId=765ac83f-9744-450b-a4ec-9d40359edeae, sensorId=06ffd4ba-21e6-4973-bda9-e0346c48334c, type='R'}
Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.api.core.session.SessionBuilder - Using Scylla optimized driver!!!
[s1-admin-0] INFO com.datastax.oss.driver.internal.core.time.Clock - Using native clock for microsecond precision
[main] INFO com.carepet.Migrate - Measure{sensorId=25ce6029-20a2-4532-a154-d82ea8da800d, ts=2020-09-11T12:47:26.807Z, value=34.0}
[main] INFO com.carepet.Migrate - Measure{sensorId=14a94142-12e8-414f-861e-15571d087c41, ts=2020-09-11T12:47:26.809Z, value=84.0}
[main] INFO com.carepet.Migrate - Measure{sensorId=630b2f96-bd0c-43c1-93a7-7750229f6da8, ts=2020-09-11T12:47:26.809Z, value=35.0}
[main] INFO com.carepet.Migrate - Measure{sensorId=06ffd4ba-21e6-4973-bda9-e0346c48334c, ts=2020-09-11T12:47:26.809Z, value=37.0}
[main] INFO com.carepet.Migrate - pushing data
...
Write down the pet Owner ID (ID is something after the #
sign without trailing spaces).
To start REST API service execute the following in the separate terminal:
$ mvn package
$ NODE1=$(docker inspect -f '{{range .NetworkSettings.Networks}}{{.IPAddress}}{{end}}' carepet-scylla1)
$ ./bin/server.sh --hosts $NODE1 --datacenter datacenter1
Expected output:
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/sitano/.m2/repository/org/slf4j/slf4j-simple/1.7.26/slf4j-simple-1.7.26.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/home/sitano/.m2/repository/ch/qos/logback/logback-classic/1.2.3/logback-classic-1.2.3.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.SimpleLoggerFactory]
Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.api.core.session.SessionBuilder - Using Scylla optimized driver!!!
[main] INFO com.datastax.oss.driver.internal.core.DefaultMavenCoordinates - DataStax Java driver for Apache Cassandra(R) (com.scylladb:java-driver-core) version 4.8.0-scylla-0
[s0-admin-0] INFO com.datastax.oss.driver.internal.core.time.Clock - Using native clock for microsecond precision
[main] INFO io.micronaut.runtime.Micronaut - Startup completed in 684ms. Server Running: http://localhost:8080
Now you can open http://127.0.0.1:8080/
in the browser or send an HTTP request from the CLI:
$ curl -v http://127.0.0.1:8080/
Expected output:
* Trying 127.0.0.1:8080...
* Connected to localhost (127.0.0.1) port 8080 (#0)
> GET / HTTP/1.1
> Host: localhost:8080
> User-Agent: curl/7.72.0
> Accept: */*
>
* Mark bundle as not supporting multiuse
< HTTP/1.1 404 Not Found
< Date: Fri, 11 Sep 2020 12:50:12 GMT
< content-type: application/json
< content-length: 77
< connection: close
<
* Closing connection 0
{"message":"Page Not Found","_links":{"self":{"href":"/","templated":false}}}
This is ok. If you see this JSON in the end with 404, it means everything works as expected.
To read an owner data you can use saved owner_id
as follows:
$ curl http://127.0.0.1:8080/api/owner/{owner_id}
For example:
$ curl http://127.0.0.1:8080/api/owner/0701da30-52f0-4ee4-911c-f9ac951bf3b1
Expected result:
{"name":"OBahu5A3","address":"5UtxnIxqfa","owner_id":"0701da30-52f0-4ee4-911c-f9ac951bf3b1"}
To list the owners pets use:
$ curl http://127.0.0.1:8080/api/owner/{owner_id}/pets
For example:
$ curl http://127.0.0.1:8080/api/owner/0701da30-52f0-4ee4-911c-f9ac951bf3b1/pets
Expected output:
[{"age":55,"weight":7.937521,"address":"home","name":"0YaLHRJq","owner_id":"0701da30-52f0-4ee4-911c-f9ac951bf3b1","pet_id":"765ac83f-9744-450b-a4ec-9d40359edeae"}]
To list pet's sensors use:
$ curl http://127.0.0.1:8080/api/pet/{pet_id}/sensors
For example:
$ curl http://127.0.0.1:8080/api/pet/765ac83f-9744-450b-a4ec-9d40359edeae/sensors
[{"type":"R","pet_id":"765ac83f-9744-450b-a4ec-9d40359edeae","sensor_id":"06ffd4ba-21e6-4973-bda9-e0346c48334c"},{"type":"P","pet_id":"765ac83f-9744-450b-a4ec-9d40359edeae","sensor_id":"14a94142-12e8-414f-861e-15571d087c41"},{"type":"R","pet_id":"765ac83f-9744-450b-a4ec-9d40359edeae","sensor_id":"25ce6029-20a2-4532-a154-d82ea8da800d"},{"type":"R","pet_id":"765ac83f-9744-450b-a4ec-9d40359edeae","sensor_id":"630b2f96-bd0c-43c1-93a7-7750229f6da8"}]
To review the pet's sensors data use:
$ curl http://127.0.0.1:8080/api/sensor/{sensor_id}/values?from=2006-01-02T15:04:05Z07:00&to=2006-01-02T15:04:05Z07:00
For example:
$ curl http://localhost:8080/api/sensor/06ffd4ba-21e6-4973-bda9-e0346c48334c/values\?from\="2020-09-11T00:00:00Z"\&to\="2020-09-11T23:59:59Z"
Expected output:
[37.0, ...]
To read the pet's daily average per sensor use:
$ curl http://127.0.0.1:8080/api/sensor/{sensor_id}/values/day/{date}
For example:
$ curl http://localhost:8080/api/sensor/06ffd4ba-21e6-4973-bda9-e0346c48334c/values/day/2020-09-11
Expected output:
[0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,37.0]
Package structure is as follows:
Name | Purpose |
---|---|
/src/main/java/com/carepet | java application source code |
com.carepet.model | database models, both ORM and REST |
com.carepet.Migrate | install database schema |
com.carepet.Sensor | simulate pet collar |
com.carepet.server.App | web application backend |
com.carepet.Config | database configuration |
See annotations to com.carepet.server.ModelController
. Application
uses the Micronaut
web framework to serve REST. It's possible to
expose auto-generated OpenAPI spec.
Collars are small devices that attach to pets and collect data with the help of different sensors. After the data is collected it may be delivered to the central database for the analysis and health status checking.
Collar code sits in the com.carepet.Sensor
and uses scylladb/java-driver
Java driver to connect to the database directly and publish its data.
Collar gathers sensors measurements, aggregates data in a buffer and
sends it every hour.
Overall all applications in this repository use scylladb/java-driver
for:
- Connect to the database
- Map objects (ORM) and generate DAO layer
- Build Queries
The web application REST API server resides in com.carepet.server
and uses
micronaut
that supports OpenAPI to expose its API. The API
handlers reside in com.carepet.server.ModelController
.
Most of the queries are reads.
The application is capable of caching sensor measurements data
on hourly basis. It uses lazy evaluation to manage sensor_avg
.
It can be viewed as an application-level lazy-evaluated
materialized view.
The algorithm is simple and resides in com.carepet.server.ModelController.avg()
:
- read
sensor_avg
- if no data, read
measurement
data, aggregate in memory, save - serve request
Pet --> Sensor --> ScyllaDB <-> REST API Server <-> User
Install JDK >= 8 and Maven. Create a repository. Clone it. Execute inside of your repository:
$ mvn archetype:generate -DgroupId=com.mycompany.app -DartifactId=my-app -DarchetypeArtifactId=maven-archetype-quickstart -DarchetypeVersion=1.4 -DinteractiveMode=false
Now when you have your pom module add ScyllaDB driver as a dependency with:
<dependencies>
<dependency>
<groupId>com.scylladb</groupId>
<artifactId>java-driver-core</artifactId>
<version>4.8.0-scylla-0</version>
</dependency>
</dependencies>
Now your pom.xml
will be looking something like this:
<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>com.mycompany.app</groupId>
<artifactId>my-app</artifactId>
<version>1.0-SNAPSHOT</version>
<name>my-app</name>
<!-- FIXME change it to the project's website -->
<url>http://www.example.com</url>
<properties>
<project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
<maven.compiler.source>1.7</maven.compiler.source>
<maven.compiler.target>1.7</maven.compiler.target>
</properties>
<dependencies>
<dependency>
<groupId>com.scylladb</groupId>
<artifactId>java-driver-core</artifactId>
<version>4.8.0-scylla-0</version>
</dependency>
...
</dependencies>
<build>
<pluginManagement><!-- lock down plugins versions to avoid using Maven defaults (may be moved to parent pom) -->
...
</pluginManagement>
</build>
</project>
Now you are ready to connect to the database and start working. To connect to the database, do the following:
import com.datastax.oss.driver.api.core.CqlSession;
import com.datastax.oss.driver.api.core.CqlSessionBuilder;
class Example {
public static void main(String []args) {
CqlSessionBuilder builder = CqlSession.builder()
.withApplicationName(applicationName)
.withClientId(clientId);
CqlSession session = builder.build();
}
}
If you want to use authentication it can be done with:
import com.datastax.oss.driver.api.core.CqlSession;
import com.datastax.oss.driver.api.core.CqlSessionBuilder;
class Example {
public static void main(String []args) {
CqlSession session = CqlSession.builder()
.withAuthCredentials("username", "password")
.build();
}
}
Local endpoints also require specifying local datacenter:
import com.datastax.oss.driver.api.core.CqlSession;
import com.datastax.oss.driver.api.core.CqlSessionBuilder;
import java.net.InetSocketAddress;
class Example {
public static void main(String []args) {
CqlSession session = CqlSession.builder()
.addContactPoints({new InetSocketAddress("127.0.0.1", 9042)})
.withLocalDatacenter("datacenter1")
.build();
}
}
Now you can issue CQL commands with:
import com.datastax.oss.driver.api.core.CqlSession;
import com.datastax.oss.driver.api.core.CqlSessionBuilder;
import com.datastax.oss.driver.api.core.cql.PreparedStatement;import com.datastax.oss.driver.api.core.cql.ResultSet;
import com.datastax.oss.driver.api.core.cql.Row;
class Example {
public static void main(String []args) {
CqlSession session = CqlSession.builder().build();
session.execute("INSERT INTO table VALUE(1, 2, 3)");
// or
PreparedStatement statement = session.prepare("INSERT INTO table VALUE(?, ?, ?)");
session.execute(statement.bind(1, 2, 3));
// or
ResultSet s = session.execute("SELECT * FROM table");
for (Row r: s) {
// r.get()
}
}
}
You can use query builder with the help of:
<dependency>
<groupId>com.scylladb</groupId>
<artifactId>java-driver-query-builder</artifactId>
<version>4.8.0-scylla-0</version>
</dependency>
To get:
Statement stmt =
selectFrom("examples", "querybuilder_json")
.json()
.all()
.whereColumn("id")
.isEqualTo(literal(1))
.build();
To use object-data mapping (ORM) include:
<dependency>
<groupId>com.scylladb</groupId>
<artifactId>java-driver-mapper-runtime</artifactId>
<version>4.8.0-scylla-0</version>
</dependency>
Add annotation processing:
<build>
<plugins>
<plugin>
<artifactId>maven-compiler-plugin</artifactId>
<version>3.8.0</version>
<configuration>
<annotationProcessorPaths>
<path>
<groupId>com.scylladb</groupId>
<artifactId>java-driver-mapper-processor</artifactId>
<version>4.8.0-scylla-0</version>
</path>
</annotationProcessorPaths>
<compilerArgs>
<arg>-Aproject=${project.groupId}/${project.artifactId}</arg>
</compilerArgs>
</configuration>
</plugin>
</plugins>
</build>
Create a mapper:
import com.datastax.oss.driver.api.core.CqlSession;
import com.datastax.oss.driver.api.mapper.annotations.DaoFactory;
@com.datastax.oss.driver.api.mapper.annotations.Mapper
public interface Mapper {
static com.datastax.oss.driver.api.mapper.MapperBuilder<Mapper> builder(CqlSession session) {
return new MapperBuilder(session);
}
}
You can create DAO factory per item type or one for all types:
import com.datastax.oss.driver.api.mapper.annotations.Dao;
import com.datastax.oss.driver.api.mapper.annotations.Select;
import java.util.UUID;
@Dao
public interface DAO {
/** Simple selection by full primary key. */
@Select
Owner getOwner(UUID id);
}
Generate the source with:
$ mvn compile
Add DAO factory:
import com.datastax.oss.driver.api.core.CqlSession;
import com.datastax.oss.driver.api.mapper.annotations.DaoFactory;
@com.datastax.oss.driver.api.mapper.annotations.Mapper
public interface Mapper {
@DaoFactory
DAO dao();
static com.datastax.oss.driver.api.mapper.MapperBuilder<Mapper> builder(CqlSession session) {
return new MapperBuilder(session);
}
}
You are ready to go.
For more details, check out implementation.
You have a few options here:
- pack all of them into a single jar
- copy all of them into the build folder and include into classpath
Let's take a look at path 2:
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-dependency-plugin</artifactId>
<version>3.1.2</version>
<executions>
<!-- copy dependencies -->
<execution>
<id>copy-dependencies</id>
<phase>generate-sources</phase>
<goals>
<goal>copy-dependencies</goal>
</goals>
<configuration>
<overWriteReleases>false</overWriteReleases>
<overWriteSnapshots>false</overWriteSnapshots>
<overWriteIfNewer>true</overWriteIfNewer>
</configuration>
</execution>
<!-- build class path -->
<execution>
<id>build-classpath</id>
<phase>generate-sources</phase>
<goals>
<goal>build-classpath</goal>
</goals>
<configuration>
<outputFile>${project.build.directory}/dependencies</outputFile>
</configuration>
</execution>
</executions>
</plugin>
</plugins>
This will copy all the dependencies to the target/dependency
folder and create
a dependencies
file that can be included into the -classpath
definition:
$ mvn package
$ java -classpath ./target/sample-1.0-SNAPSHOT.jar:$(cat ./target/dependencies) com.project.App