Skip to content
This repository has been archived by the owner on Mar 12, 2023. It is now read-only.
Nitin Motgi edited this page Jul 30, 2017 · 29 revisions

YARE - Yet Another Rules Engine is a powerful, horizontally scalable, lightweight inference based Business Rules Engine for big data processing with ability to run on Spark, Hadoop, Amazon EMR, Azure HDInsight and GCE. The Rules are composed through a easy to read when ... then semantics understandable by business users.

The Business Rule Engine is an add-on capability of CDAP right next to the CDAP Pipelines. It enables users to see, understand, edit and manage the business rules in their data processing pipelines without writing any code.

A Business Rule is a statement that describes a business policy or procedure to process data. Actions describes the sequence of operations that is associated with data when a business rule is fired.

YARE includes

YARE gives the power back to Business users in big data environment to make decisions on their data, enforce policies, and to be an integral part of the data ingestion process - ETL. With YARE, business users can write, manage, deploy, execute and monitor business data transformation and policy enforcements.

In order to facilitate business users YARE provides :

  • Business Rule Repository -- A dataset for storing the business rules as defined by the business users
  • Business Rule Editor -- An intuitive user interface that allows business users to define, edit and manage business rules.
  • Rule Execution Core -- Transform plugin for enforcing business rules in the data processing pipelines.

Benefits of YARE

  • Non-Programmers who want to analyze big data
    • Declarative language support, easy to under for non-programmers.
    • Easy-to-verify Rules.
    • They serve as documentation for policies and transformations.
    • Don't need to write code.
    • Use the provided user interface to write rules and manage them.
  • One-time infrastructure setup
    • Rules are plug and play.
  • Centralization of knowledge
    • Allows you to create a knowledge base that is executable in your big data environment.
    • Provides single-point of truth for all policies and transformation needs.
  • Logic and Data Separation
    • Allows frequent change requests, one can add new rules without having to modify the existing rules.
    • Data can reside on your Hadoop cluster, Cloud Storage or even traditional databases, rules for transforming them are separated.
    • Enables easy re-use of rules across Big Data and Traditional Data stores.
  • Speed and Scalability
    • Processing large dataset.
    • Scaling horizontally as you cluster or data grows.
    • Support Realtime and Batch workloads.
  • Tool Integration
    • Available as a library to integrate with JBoss, WebLogic, Spring
    • Integrate with SQL tools through CDAP provided JDBC driver with SQL clients.

How is YARE different ?

  • YARE is natively built for Spark, Hadoop and EMR
  • YARE scales horizontally as your environment for processing scales
  • YARE is not memory intensive,
  • YARE handles complex and complicated data decision logic through Wrangler DSL
  • YARE conditions are simple to write and follow JEXL syntax for expressing expressions.
  • YARE can be applied on aggregates and as well on row-by-row transformations