From 853adfd53acbe73ed5cd4061bb9683039b4042a6 Mon Sep 17 00:00:00 2001
From: Volker Stampa <Volker.Stampa@aleph-alpha.com>
Date: Tue, 20 Feb 2024 17:32:05 +0100
Subject: [PATCH] WIP: Draft of concepts doc

---
 Concepts.md                        | 120 +++++++++++++++++++++++++++++
 assets/RecursiveSummary.drawio.svg |   4 +
 2 files changed, 124 insertions(+)
 create mode 100644 Concepts.md
 create mode 100644 assets/RecursiveSummary.drawio.svg
diff --git a/Concepts.md b/Concepts.md
new file mode 100644
index 000000000..3e0ccccb0
--- /dev/null
+++ b/Concepts.md
@@ -0,0 +1,120 @@
+# Concepts
+
+## Task
+
+At the heart of the Intelligence Layer is a `Task`. A Task is actually a pretty generic concept that just
+transforms an input-parameter to an output like a function in mathematics.
+
+```
+Task: Input -> Output
+```
+
+In Python this is expressed through an abstract class with type-parameters and the abstract method `do_run`
+where the actual transformation is implemented:
+
+```Python
+class Task(ABC, Generic[Input, Output]):
+
+    @abstractmethod
+    def do_run(self, input: Input, task_span: TaskSpan) -> Output:
+        ...
+```
+
+`Input` and `Output` are normal Python datatypes that can be serialized from and to JSON. For this the Intelligence
+Layer relies on [Pydantic](https://docs.pydantic.dev/). The types that can actually be used are defined in form
+of the type-alias [`PydanticSerializable`](src/intelligence_layer/core/tracer.py#L44).
+
+The second parameter `task_span` is used for [tracing](#Trace) which is described below.
+
+`do_run` is the method that needs to be implemented for a concrete Task. The external interface of a
+Task is its `run` method:
+
+```Python
+class Task(ABC, Generic[Input, Output]):
+    @final
+    def run(self, input: Input, tracer: Tracer, trace_id: Optional[str] = None) -> Output:
+      ...
+```
+
+Its signature differs only in the parameters regarding [tracing](#Trace).
+
+### Levels of abstraction
+
+Even though the concept is so generic the main purpose for a Task is of course to make use of an LLM for the
+transformation. Tasks are defined at different levels of abstraction. There are higher level Tasks (also called Use Cases)
+that reflect a typical user problem and there are lower level Tasks that are more about interfacing
+with an LLM on a very generic or even technical level.
+
+Examples for higher level tasks (Use Cases) are:
+
+- Answering a question based on a gievn document: `QA: (Document, Question) -> Answer`
+- Generate a summary of a given document: `Summary: Document -> Summary`
+
+Examples for lower level tasks are:
+
+- Let the model generate text based on an instruacton and some context: `Instruct: (Context, Instruction) -> Completion`
+- Chunk a text in smaller pieces at optimized boundaries (typically to make it fit into an LLM's context-size): `Chunk: Text -> [Chunk]`
+
+### Composability
+
+Tasks compose. Typically you would build higher level tasks from lower level tasks. Given a task you can draw a dependency graph
+that illustrates which sub-tasks it is using and in turn which sub-tasks they are using. This graph typically forms a hierarchy or
+more general a directed acyclic graph. The following drawing shows this graph for the Intelligence Layer's `RecursiveSummarize`
+Task:
+
+<img src="./assets/RecursiveSummary.drawio.svg">
+
+
+### Trace
+
+A Task implements a workflow. It processes its input, passes it on to sub-tasks, processes the outputs of sub-tasks
+to build its own output. This workflow can be represented in a trace. For this a Task's `run` method takes a `Tracer`
+that takes care of storing details on the steps of this workflow like the tasks that have been invoked along with their
+input and output and timing information. For this the tracing defines the following concepts:
+
+- A `Tracer` is passed to a Task's `run` method and provides methods for opening `Span`s or `TaskSpan`s.
+- A `Span` allows for grouping multiple logs and duration together as a single, logical step in the
+  workflow.
+- A `TaskSpan` allows for grouping multiple logs together, as well as the task's specific input, output,
+  and duration.
+
+Each of these concepts is implemented in form of an abstract base class and the Intelligence Layer provides
+several implementations:
+
+- The `NoOpTracer` can be used when tracing information shall not be stored at all.
+
+## Evaluation
+
+### Dataset
+
+- List of examples (`Input`)
+
+### Run
+
+- Compute `Output`s for Dataset
+
+### Evaluate
+
+- Evaluate a single run to create an results that can be compared
+- Compare multiple runs with a single evaluation (e.g. ELO)
+
+### Aggregate
+
+- Aggregate results from a single evaluation
+- Aggregate results from multiple compare-evaluations to complete comparison
+
+### Data Storage
+
+- DatasetRepository
+- RunRepository
+- EvaluationRepository
+- AggregationRepository
+
+
+explainability:
+- debug loglevel explain (full prompt vs focus (RAG)) (prompt whisper)
+- eval: unexpected result: explain for input (aggregate)
+  - run explain only on "failed"
+
+Run:
+- scheduled
diff --git a/assets/RecursiveSummary.drawio.svg b/assets/RecursiveSummary.drawio.svg
new file mode 100644
index 000000000..c062ff126
--- /dev/null
+++ b/assets/RecursiveSummary.drawio.svg
@@ -0,0 +1,4 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!-- Do not edit this file with editors other than draw.io -->
+<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN" "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
+<svg xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" version="1.1" width="451px" height="501px" viewBox="-0.5 -0.5 451 501" content="&lt;mxfile host=&quot;Electron&quot; modified=&quot;2024-02-14T14:30:24.389Z&quot; agent=&quot;Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) draw.io/23.0.2 Chrome/120.0.6099.109 Electron/28.1.0 Safari/537.36&quot; etag=&quot;GyFu_av88SBxKN1HSNOe&quot; version=&quot;23.0.2&quot; type=&quot;device&quot;&gt;&#10;  &lt;diagram name=&quot;Page-1&quot; id=&quot;r06lxv6mwxcB28TaKnRd&quot;&gt;&#10;    &lt;mxGraphModel dx=&quot;1114&quot; dy=&quot;999&quot; grid=&quot;1&quot; gridSize=&quot;10&quot; guides=&quot;1&quot; tooltips=&quot;1&quot; connect=&quot;1&quot; arrows=&quot;1&quot; fold=&quot;1&quot; page=&quot;1&quot; pageScale=&quot;1&quot; pageWidth=&quot;850&quot; pageHeight=&quot;1100&quot; math=&quot;0&quot; shadow=&quot;0&quot;&gt;&#10;      &lt;root&gt;&#10;        &lt;mxCell id=&quot;0&quot; /&gt;&#10;        &lt;mxCell id=&quot;1&quot; parent=&quot;0&quot; /&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-7&quot; value=&quot;&quot; style=&quot;edgeStyle=orthogonalEdgeStyle;rounded=0;orthogonalLoop=1;jettySize=auto;html=1;&quot; edge=&quot;1&quot; parent=&quot;1&quot; source=&quot;DCfif1_LMO6Rl-M8gUsG-1&quot; target=&quot;DCfif1_LMO6Rl-M8gUsG-2&quot;&gt;&#10;          &lt;mxGeometry relative=&quot;1&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-1&quot; value=&quot;RecursiveSummarize&quot; style=&quot;rounded=0;whiteSpace=wrap;html=1;&quot; vertex=&quot;1&quot; parent=&quot;1&quot;&gt;&#10;          &lt;mxGeometry x=&quot;170&quot; y=&quot;30&quot; width=&quot;190&quot; height=&quot;60&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-8&quot; style=&quot;edgeStyle=orthogonalEdgeStyle;rounded=0;orthogonalLoop=1;jettySize=auto;html=1;&quot; edge=&quot;1&quot; parent=&quot;1&quot; source=&quot;DCfif1_LMO6Rl-M8gUsG-2&quot; target=&quot;DCfif1_LMO6Rl-M8gUsG-3&quot;&gt;&#10;          &lt;mxGeometry relative=&quot;1&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-9&quot; style=&quot;edgeStyle=orthogonalEdgeStyle;rounded=0;orthogonalLoop=1;jettySize=auto;html=1;&quot; edge=&quot;1&quot; parent=&quot;1&quot; source=&quot;DCfif1_LMO6Rl-M8gUsG-2&quot; target=&quot;DCfif1_LMO6Rl-M8gUsG-4&quot;&gt;&#10;          &lt;mxGeometry relative=&quot;1&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-2&quot; value=&quot;SteerableLongContextSummarize&quot; style=&quot;rounded=0;whiteSpace=wrap;html=1;&quot; vertex=&quot;1&quot; parent=&quot;1&quot;&gt;&#10;          &lt;mxGeometry x=&quot;170&quot; y=&quot;140&quot; width=&quot;190&quot; height=&quot;60&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-10&quot; value=&quot;&quot; style=&quot;edgeStyle=orthogonalEdgeStyle;rounded=0;orthogonalLoop=1;jettySize=auto;html=1;&quot; edge=&quot;1&quot; parent=&quot;1&quot; source=&quot;DCfif1_LMO6Rl-M8gUsG-3&quot; target=&quot;DCfif1_LMO6Rl-M8gUsG-5&quot;&gt;&#10;          &lt;mxGeometry relative=&quot;1&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-3&quot; value=&quot;SteerableSingleChunkSummarize&quot; style=&quot;rounded=0;whiteSpace=wrap;html=1;&quot; vertex=&quot;1&quot; parent=&quot;1&quot;&gt;&#10;          &lt;mxGeometry x=&quot;40&quot; y=&quot;250&quot; width=&quot;190&quot; height=&quot;60&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-4&quot; value=&quot;Chunk&quot; style=&quot;rounded=0;whiteSpace=wrap;html=1;&quot; vertex=&quot;1&quot; parent=&quot;1&quot;&gt;&#10;          &lt;mxGeometry x=&quot;300&quot; y=&quot;250&quot; width=&quot;190&quot; height=&quot;60&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-11&quot; value=&quot;&quot; style=&quot;edgeStyle=orthogonalEdgeStyle;rounded=0;orthogonalLoop=1;jettySize=auto;html=1;&quot; edge=&quot;1&quot; parent=&quot;1&quot; source=&quot;DCfif1_LMO6Rl-M8gUsG-5&quot; target=&quot;DCfif1_LMO6Rl-M8gUsG-6&quot;&gt;&#10;          &lt;mxGeometry relative=&quot;1&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-5&quot; value=&quot;Instruct&quot; style=&quot;rounded=0;whiteSpace=wrap;html=1;&quot; vertex=&quot;1&quot; parent=&quot;1&quot;&gt;&#10;          &lt;mxGeometry x=&quot;40&quot; y=&quot;360&quot; width=&quot;190&quot; height=&quot;60&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;        &lt;mxCell id=&quot;DCfif1_LMO6Rl-M8gUsG-6&quot; value=&quot;Complete&quot; style=&quot;rounded=0;whiteSpace=wrap;html=1;&quot; vertex=&quot;1&quot; parent=&quot;1&quot;&gt;&#10;          &lt;mxGeometry x=&quot;40&quot; y=&quot;470&quot; width=&quot;190&quot; height=&quot;60&quot; as=&quot;geometry&quot; /&gt;&#10;        &lt;/mxCell&gt;&#10;      &lt;/root&gt;&#10;    &lt;/mxGraphModel&gt;&#10;  &lt;/diagram&gt;&#10;&lt;/mxfile&gt;&#10;"><defs/><g><path d="M 225 60 L 225 103.63" fill="none" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 225 108.88 L 221.5 101.88 L 225 103.63 L 228.5 101.88 Z" fill="rgb(0, 0, 0)" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="all"/><rect x="130" y="0" width="190" height="60" fill="rgb(255, 255, 255)" stroke="rgb(0, 0, 0)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility" style="overflow: visible; text-align: left;"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 188px; height: 1px; padding-top: 30px; margin-left: 131px;"><div data-drawio-colors="color: rgb(0, 0, 0); " style="box-sizing: border-box; font-size: 0px; text-align: center;"><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: rgb(0, 0, 0); line-height: 1.2; pointer-events: all; white-space: normal; overflow-wrap: normal;">RecursiveSummarize</div></div></div></foreignObject><text x="225" y="34" fill="rgb(0, 0, 0)" font-family="Helvetica" font-size="12px" text-anchor="middle">RecursiveSummarize</text></switch></g><path d="M 225 170 L 225 195 L 95 195 L 95 213.63" fill="none" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 95 218.88 L 91.5 211.88 L 95 213.63 L 98.5 211.88 Z" fill="rgb(0, 0, 0)" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="all"/><path d="M 225 170 L 225 195 L 355 195 L 355 213.63" fill="none" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 355 218.88 L 351.5 211.88 L 355 213.63 L 358.5 211.88 Z" fill="rgb(0, 0, 0)" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="all"/><rect x="130" y="110" width="190" height="60" fill="rgb(255, 255, 255)" stroke="rgb(0, 0, 0)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility" style="overflow: visible; text-align: left;"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 188px; height: 1px; padding-top: 140px; margin-left: 131px;"><div data-drawio-colors="color: rgb(0, 0, 0); " style="box-sizing: border-box; font-size: 0px; text-align: center;"><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: rgb(0, 0, 0); line-height: 1.2; pointer-events: all; white-space: normal; overflow-wrap: normal;">SteerableLongContextSummarize</div></div></div></foreignObject><text x="225" y="144" fill="rgb(0, 0, 0)" font-family="Helvetica" font-size="12px" text-anchor="middle">SteerableLongContextSummarize</text></switch></g><path d="M 95 280 L 95 323.63" fill="none" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 95 328.88 L 91.5 321.88 L 95 323.63 L 98.5 321.88 Z" fill="rgb(0, 0, 0)" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="all"/><rect x="0" y="220" width="190" height="60" fill="rgb(255, 255, 255)" stroke="rgb(0, 0, 0)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility" style="overflow: visible; text-align: left;"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 188px; height: 1px; padding-top: 250px; margin-left: 1px;"><div data-drawio-colors="color: rgb(0, 0, 0); " style="box-sizing: border-box; font-size: 0px; text-align: center;"><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: rgb(0, 0, 0); line-height: 1.2; pointer-events: all; white-space: normal; overflow-wrap: normal;">SteerableSingleChunkSummarize</div></div></div></foreignObject><text x="95" y="254" fill="rgb(0, 0, 0)" font-family="Helvetica" font-size="12px" text-anchor="middle">SteerableSingleChunkSummarize</text></switch></g><rect x="260" y="220" width="190" height="60" fill="rgb(255, 255, 255)" stroke="rgb(0, 0, 0)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility" style="overflow: visible; text-align: left;"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 188px; height: 1px; padding-top: 250px; margin-left: 261px;"><div data-drawio-colors="color: rgb(0, 0, 0); " style="box-sizing: border-box; font-size: 0px; text-align: center;"><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: rgb(0, 0, 0); line-height: 1.2; pointer-events: all; white-space: normal; overflow-wrap: normal;">Chunk</div></div></div></foreignObject><text x="355" y="254" fill="rgb(0, 0, 0)" font-family="Helvetica" font-size="12px" text-anchor="middle">Chunk</text></switch></g><path d="M 95 390 L 95 433.63" fill="none" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 95 438.88 L 91.5 431.88 L 95 433.63 L 98.5 431.88 Z" fill="rgb(0, 0, 0)" stroke="rgb(0, 0, 0)" stroke-miterlimit="10" pointer-events="all"/><rect x="0" y="330" width="190" height="60" fill="rgb(255, 255, 255)" stroke="rgb(0, 0, 0)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility" style="overflow: visible; text-align: left;"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 188px; height: 1px; padding-top: 360px; margin-left: 1px;"><div data-drawio-colors="color: rgb(0, 0, 0); " style="box-sizing: border-box; font-size: 0px; text-align: center;"><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: rgb(0, 0, 0); line-height: 1.2; pointer-events: all; white-space: normal; overflow-wrap: normal;">Instruct</div></div></div></foreignObject><text x="95" y="364" fill="rgb(0, 0, 0)" font-family="Helvetica" font-size="12px" text-anchor="middle">Instruct</text></switch></g><rect x="0" y="440" width="190" height="60" fill="rgb(255, 255, 255)" stroke="rgb(0, 0, 0)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility" style="overflow: visible; text-align: left;"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 188px; height: 1px; padding-top: 470px; margin-left: 1px;"><div data-drawio-colors="color: rgb(0, 0, 0); " style="box-sizing: border-box; font-size: 0px; text-align: center;"><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: rgb(0, 0, 0); line-height: 1.2; pointer-events: all; white-space: normal; overflow-wrap: normal;">Complete</div></div></div></foreignObject><text x="95" y="474" fill="rgb(0, 0, 0)" font-family="Helvetica" font-size="12px" text-anchor="middle">Complete</text></switch></g></g><switch><g requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"/><a transform="translate(0,-5)" xlink:href="https://www.drawio.com/doc/faq/svg-export-text-problems" target="_blank"><text text-anchor="middle" font-size="10px" x="50%" y="100%">Text is not SVG - cannot display</text></a></switch></svg>