DC Data Server - Elastic/Ruby

This is a Proof of Concept data server to run the modified stock example.

Quick start

This has been developed using Elasticsearch 7.10. Install Elasticsearch and update the URL in src/common.rb. This sample is developed using blacktop/elasticsearch:7.10 docker image.
This has been developed using Ruby 2.7. However, it should work with any new ruby.
To install all required gems, in this folder.

$ bundle install

To create an index in Elastic and to index all data needed for the stocks example:

# To create the index
# caution: it deletes previous index
$ bundle exec ruby src/create_index.rb

# Import data
$ bundle exec ruby src/data_import.rb

To run the (Puma) server

$ bundle exec rackup -p 3030 rackup.ru

Making it work for you

Index Structure

You will need to define fields in the index. Though not necessary, it is recommended that you explicitly define types for all fields. While an index has no restriction on filed types, only simple types - numeric, date, and keyword - are supported as dimensions. Crossfilter allows computed values as dimensions. To achieve a similar feature, you will need to create a field and compute it while indexing.

The file src/create_index.rb may be used as a sample. If you are new to Elastic, you may consider an Index as a Table in a database.

See the following to learn more:

Change the name of the index as per your preference. You will need to use the same index name in src/conf.rb.

Supported data types:

Elastic Type	Use as Dimension	Simple Filters	Range Filters
Keyword	Yes	Yes	No
Integer	Yes	Yes	Yes
Decimal	Yes	No	Yes
Date	Yes	Yes	Yes

To create the index:

# caution: it deletes previous index
$ bundle exec ruby src/create_index.rb

Import data

Usually, your application will update data in the Elastic index.

The sample src/data_import.rb shows how to import data from csv files. It uses CSV from the Ruby standard library. There is a similar JSON library.

Before importing the data, each field will need to be converted to their specific types as defined the index mapping. Pre-computing synthetic fields is going to help by simplifying and speeding up queries. You must pre-compute synthetic fields that are going to be used as dimensions.

The sample uses Elastic bulk APIs to index. See: https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-bulk.html.

# Import data
$ bundle exec ruby src/data_import.rb

Dimensions and Groups

See file src/conf.rb.

Any of the supported filed types can be used as dimension. Equivalent to the CrossFilter default Group, when no special aggregation is configured, the number of matching rows (documents in Elastic terminology) will the value,

Elastic has quite rich support for creating aggregations. This sample uses the Term aggregation combined with Pipeline aggregations:

src/conf.rb shows a variety of aggregation types.

The example also shows how to associate multiple groups to one dimension.

Production use

Though it has missing features, the implementation is high quality.

The indexing uses bulk APIs.
At the time of query - all queries are sent as a single call using bulk API.
If a dimension is used by multiple groups, these are combined as a single set of aggregations.
The code is multi-thread safe. Entire shared data is immutable.
This code can be adapted to any other Ruby-based app servers.

Missing features

Filter types other than simple and range.
Ability to get rows (for data table, data grid). In roadmap
Composite keys are not supported for dimension.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
src		src
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
.streerc		.streerc
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
rackup.ru		rackup.ru

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DC Data Server - Elastic/Ruby

Quick start

Making it work for you

Index Structure

Import data

Dimensions and Groups

Production use

Missing features

About

Releases

Packages

Languages

kum-deepak/elastic-dc

Folders and files

Latest commit

History

Repository files navigation

DC Data Server - Elastic/Ruby

Quick start

Making it work for you

Index Structure

Import data

Dimensions and Groups

Production use

Missing features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages