Job Engine Design

Purpose: The Job Engine is a coordinator which manage the tasks' life cycle and CRUD of the tasks. To be clearified, the Job Engine does not run the task itself, instead the task runs on hadoop(or whatever it depends).

Executable

Executable is a top-level interface for all kinds of jobs or tasks.

AbstractExecutable is a abstract implementation of Executable, it provides:

some getter and setter method
default implementation of Executable.execute()
life cycle method of an Executable and their default implementation

DefaultChainedExecutable is an implementation of AbstractExecutable which contains a group of Executable

ExecutableManager

ExecutableManager provide the CRUD function for an Executable

ExecutableDao

ExecutableDao provide the access of the persistent object for Executable

There are two persistent object for one Executable

ExecutablePO is to store the runnning parameters for the Executable, and once the Executable is submitted, ExecutablePO is unmodifiable.
ExecutableOutputPO is to store the running result for the Executable, for instance the current state, error log.

DefaultScheduler

DefaultScheduler is a coordinator for Executables.

There is a daemon thread call JobFetcher running periodically. It is responsible for scheduling the Executables

Note: there should always be only one instance running in the cluster. And it is configured using "kylin.server.mode" in the "kylin.properties", there are two modes "all" & "query", "all" means it will defaultly start the scheduler. So if there are multiple kylin instances, make sure there is only one instance whose "kylin.server.mode" is set to "all".

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

Design.md

Design.md

Job Engine Design

Executable

ExecutableManager

ExecutableDao

DefaultScheduler

Files

Design.md

Latest commit

History

Design.md

File metadata and controls

Job Engine Design

Executable

ExecutableManager

ExecutableDao

DefaultScheduler