Add dimensionality check through a Newton runtime #590

blackgeorge-boom · 2022-04-15T18:43:55Z

This is a draft PR aiming to show a simple dimensionality check at runtime.

Closes #586.

Aims to handle the problem of how to check the dimensionality of array positions, where the index of the array is a variable (whose value is unknown at compile time).

…se of variable index array. Addresses #586.

Addresses #586.

… with instructions to run it. Addresses #586.

Addresses #586.

This can be useful for the runtime dimensionality check, where we will need to allocate memory at runtime from the examined program, in order to keep the Newton type information in the heap. Addresses #586.

blackgeorge-boom · 2022-06-06T10:52:05Z

Currently, I'm stuck at the following. We need a way to keep at runtime the physics types of the intermediate results as they're calculated on the fly. For example:

	distance	x = 5;		/* meters  */
	speed		v = 1;		/* m/s     */

	state[0] = x;
	state[1] = v;

	for (int i = 0; i < 2; i++)
	{
		state[0] = state[i] + v*dt;
	}

For i = 0 this is valid, but for i = 1 it is not.

Currently, it should be straightforward, during compilation, to serialize the physics information into a JSON file:

{
  "state" : ["distance", "speed"],
  "dt" : "time",
  "x" : "distance",
  "v": "speed",
  "v*dt" : ?
}

The last element is intentionally left empty.

And let's assume in runtime we can deserialize this information, by inserting calls to the compiled binary that will be using the runtime utilities for the deserialization (this is not trivial though).

My question is: how to keep the physics type of the intermediate result v * dt?

At compile time, everything is a virtual register, so this problem is easy, regardless of whether a live value will end up in a register or in memory.

@KomaGR If I understood correctly, you had mentioned an idea of mapping memory addresses to physics, however, what if the result lies in a register and not in memory?

KomaGR · 2022-06-06T14:19:27Z

I am a bit hesitant with the JSON format for this problem specifically. Parsing JSON is not trivial and could considerably increase the binary size of the runtime. If it's easy to integrate, it would be OK to rely on JSON format as a starting point and converge to something specific for our needs later.

@KomaGR If I understood correctly, you had mentioned an idea of mapping memory addresses to physics, however, what if the result lies in a register and not in memory?

I have to admit that I am not familiar with the different scenarios we could end up with when using various levels of optimization flags (e.g., O2/O3). We're talking about is a dynamic type system runtime for type-checking, which as a concept is not new and I believe we may find material on how to do such checks (if we decide that they are in scope for the project).

E.g., Python uses this kind of typing (although with a huge amount of overhead):

>>> a = 1
>>> b = "42"
>>> type(a)
<class 'int'>
>>> type(b)
<class 'str'>
>>> a+b
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: unsupported operand type(s) for +: 'int' and 'str'
>>> if a == 1:
...     c = 2
... else:
...     c = "0"
... 
>>> a+c
3
>>> b+c
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: can only concatenate str (not "int") to str

In our case what is I think new is that we're using dynamic typing for physical dimensions using an under-the-hood solution. I.e., without using a library (see Boost.Units).

blackgeorge-boom · 2022-06-06T14:48:14Z

I agree that we are trying to implement runtime support for dynamic typing for physics types.

In our case what is I think new is that we're using dynamic typing for physical dimensions using an under-the-hood solution. I.e., without using a library (see Boost.Units).

Yes. We should actually make sure that we have some novelty here, compared to Boost.

Addresses #586.

…ormation. Addresses #586.

…ng-compiler into issue-586 Addresses #586.

applications/newton/llvm-ir/array-element-physics/Makefile

src/newton/newton-irPass-LLVMIR-dimension-check.cpp

Addresses #586.

blackgeorge-boom · 2022-07-01T16:33:47Z

After discussing with @KomaGR, I will try to do the following:

When encountering the alloca in LLVM IR, generate a call to __newtonInsert.
Take the return value of the runtime call and associate it with the alloca.
Whenever loading from the alloca, associate the new value with the "id" value of the alloca.

KomaGR · 2022-07-01T16:56:49Z

I will try to provide runtime support for handling the dimension propagation for multiplication and division operations.

Addresses #586.

When encountering a variable definition, we insert a call to malloc in order to allocate an array-state for the variable. Then, we add a call to `newtonInsert` where we send the array as an argument for the runtime to record and we get back a unique identifier. This will be used as a future reference for our variable's dimension array. Addresses #586.

Initially, it covers simple arithmetic operations between variables of primitive types. Addresses #586.

Currently, each array access generates a separate virtual register in LLVM. Therefore, for every access we allocate a separate dimension array and get a new identifier back from the runtime call. We need to find a way to use the same identifier for the same array accesses. Addresses #586.

KomaGR · 2023-02-24T16:07:19Z

@blackgeorge-boom What is the status regarding this pull request. I know we're probably not moving forward with the runtime proposal for now. Does this pull request implement the static compile-time dimensionality check (even if not capable of catching some dimensionality errors)?

blackgeorge-boom added 5 commits April 15, 2022 19:35

Dimensionality check pass inserts calls to runtime function in the ca…

aa7effb

…se of variable index array. Addresses #586.

Dimensionality check pass dumps transformed IR code to temporary file.

cdd42b8

Addresses #586.

Added example with simple dimensionality check hook at runtime, along…

9f08cd5

… with instructions to run it. Addresses #586.

Added simple runtime check function for dimensionality check.

d18786e

Addresses #586.

Added simple example as target for dimensionality check.

d67b56e

Addresses #586.

blackgeorge-boom requested a review from KomaGR April 15, 2022 18:43

blackgeorge-boom added 3 commits May 13, 2022 18:02

Added automatic path creation for the transformed IR file in the output.

6f365cd

Addresses #586.

Instrumented the IR by adding simple call to malloc.

9cbcc33

This can be useful for the runtime dimensionality check, where we will need to allocate memory at runtime from the examined program, in order to keep the Newton type information in the heap. Addresses #586.

Merge branch 'master' into issue-586

80e2b80

blackgeorge-boom added 4 commits June 6, 2022 17:44

Added function to check if a PhysicsInfo is composite.

8226496

Addresses #586.

Refactored output file name definition for modified IR.

71fcc75

Addresses #586.

Added JSON serializing mechanism for source variable physics type inf…

746951e

…ormation. Addresses #586.

Merge branch 'issue-586' of github.com:phillipstanleymarbell/Noisy-la…

8fd2e2d

…ng-compiler into issue-586 Addresses #586.

KomaGR reviewed Jun 18, 2022

View reviewed changes

applications/newton/llvm-ir/array-element-physics/Makefile Outdated Show resolved Hide resolved

src/newton/newton-irPass-LLVMIR-dimension-check.cpp Outdated Show resolved Hide resolved

Draft runtime

b5b18fa

Addresses #586.

blackgeorge-boom and others added 10 commits July 8, 2022 13:12

Removed version from llvm-dis tool.

279754a

Addresses #586.

Converted some PhysicsType methods in camelCase.

13e23fa

Addresses #586.

Removed redundant LLVM Context variable.

de736fb

Addresses #586.

Fixed camelCase.

987e231

Addresses #586.

Added call to newton runtime initialization.

fb5a085

Addresses #586.

Add functions for products and quotients

47ba21f

Addresses #586.

Renamed argument types for instrumentation.

3003b87

Addresses #586.

Added instrumentation for the runtime function newtonCheckDimensions.

9194fcd

Initially, it covers simple arithmetic operations between variables of primitive types. Addresses #586.

KomaGR changed the base branch from master to issue-644 February 24, 2023 15:54

KomaGR added the Prosecco label Feb 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dimensionality check through a Newton runtime #590

Add dimensionality check through a Newton runtime #590

blackgeorge-boom commented Apr 15, 2022

blackgeorge-boom commented Jun 6, 2022

KomaGR commented Jun 6, 2022

blackgeorge-boom commented Jun 6, 2022

blackgeorge-boom commented Jul 1, 2022

KomaGR commented Jul 1, 2022

KomaGR commented Feb 24, 2023

Add dimensionality check through a Newton runtime #590

Are you sure you want to change the base?

Add dimensionality check through a Newton runtime #590

Conversation

blackgeorge-boom commented Apr 15, 2022

blackgeorge-boom commented Jun 6, 2022

KomaGR commented Jun 6, 2022

blackgeorge-boom commented Jun 6, 2022

blackgeorge-boom commented Jul 1, 2022

KomaGR commented Jul 1, 2022

KomaGR commented Feb 24, 2023