From c658a069c4d3f830cd0dcdaf4391c42e9361c337 Mon Sep 17 00:00:00 2001 From: Lukas Rothenberger Date: Wed, 10 Jan 2024 09:19:57 +0100 Subject: [PATCH] doc(wiki): parallel patterns --- .../Patterns/Pipeline.md | 4 +- docs/data/Parallel_patterns.md | 8 - docs/data/Parallel_patterns/Patterns.md | 9 ++ docs/data/Parallel_patterns/doall.md | 78 ++++++++++ .../geometric_decomposition.md | 118 +++++++++++++++ docs/data/Parallel_patterns/pipeline.md | 143 ++++++++++++++++++ docs/data/Parallel_patterns/reduction.md | 71 +++++++++ docs/data/Parallel_patterns/tasks.md | 11 ++ 8 files changed, 431 insertions(+), 11 deletions(-) delete mode 100644 docs/data/Parallel_patterns.md create mode 100644 docs/data/Parallel_patterns/Patterns.md create mode 100644 docs/data/Parallel_patterns/doall.md create mode 100644 docs/data/Parallel_patterns/geometric_decomposition.md create mode 100644 docs/data/Parallel_patterns/pipeline.md create mode 100644 docs/data/Parallel_patterns/reduction.md create mode 100644 docs/data/Parallel_patterns/tasks.md diff --git a/DEPRECATED_DOCS/DEPRECATED_Pattern_Detection/Patterns/Pipeline.md b/DEPRECATED_DOCS/DEPRECATED_Pattern_Detection/Patterns/Pipeline.md index 0e3bf6356..08cc36684 100644 --- a/DEPRECATED_DOCS/DEPRECATED_Pattern_Detection/Patterns/Pipeline.md +++ b/DEPRECATED_DOCS/DEPRECATED_Pattern_Detection/Patterns/Pipeline.md @@ -94,9 +94,7 @@ Stages: Node: 1:13 Start line: 1:4 End line: 1:4 - pragma: "#pragma omp task" - first private: ['i'] - private: [] + shared: ['d', 'in'] reduction: [] InDeps: [] diff --git a/docs/data/Parallel_patterns.md b/docs/data/Parallel_patterns.md deleted file mode 100644 index 13127f47e..000000000 --- a/docs/data/Parallel_patterns.md +++ /dev/null @@ -1,8 +0,0 @@ ---- -layout: default -title: Parallel patterns -parent: Data -nav_order: 2 ---- - -TODO \ No newline at end of file diff --git a/docs/data/Parallel_patterns/Patterns.md b/docs/data/Parallel_patterns/Patterns.md new file mode 100644 index 000000000..9fb3c8a18 --- /dev/null +++ b/docs/data/Parallel_patterns/Patterns.md @@ -0,0 +1,9 @@ +--- +layout: default +title: Parallel patterns +parent: Data +has_children: true +permalink: /Data/Patterns +--- + +# Parallel Patterns diff --git a/docs/data/Parallel_patterns/doall.md b/docs/data/Parallel_patterns/doall.md new file mode 100644 index 000000000..84340ab83 --- /dev/null +++ b/docs/data/Parallel_patterns/doall.md @@ -0,0 +1,78 @@ +--- +layout: default +title: Do-All +has_children: true +parent: Parallel Patterns +grand_parent: Data +nav_order: 1 +--- + + +# Do-All Loop + +## Reporting +Do-All Loops are reported in the following format: +``` +Do-all at: 1:2 +Start line: 1:7 +End line: 1:9 +pragma: "#pragma omp parallel for" +private: [] +shared: [] +first private: [] +reduction: [] +last private: [] +``` + +## Interpretation +The reported values shall be interpreted as follows: +* `Do-all at: :`, where the respective parent file can be looked up in the `FileMapping.txt` using `file_id` and `cu_id` can be used for a look up in `Data.xml` +* `Start line: :`, where `line_num` refers to the source code line of the parallelizable loop. +* `End line: :`, where `line_num` refers to the last line of the parallelizable loop. + +* `pragma:`shows which type of OpenMP pragma shall be inserted before the target loop in order to parallelize it. +* `private: []` lists a set of variables which have been identified as thread-`private` +* The same interpretation applies to the following values aswell: + * `shared` + * `first_private` + * `last_private` +* `reduction: [:]` specifies a set of identified reduction operations and variables. For `Do-All` suggestions, this list is always empty. + +## Implementation +In order to implement a suggestion, first open the source code file corresponding to `file_id` and navigate to line `Start line -> `. +Insert `pragma` before the loop begins. +In order to ensure a valid parallelization, you need to add the following clauses to the OpenMP pragma, if the respective lists are not empty: +* `private` -> clause: `private()` +* `shared` -> clause: `shared()` +* `first_private` -> clause: `firstprivate()` +* `last_private` -> clause: `lastprivate()` +* `reduction`-> clause: `reduction(:)` + +### Example +As an example, we will analyze the following code snippet for parallelization potential. All location and meta data will be ignored for the sake of simplicity. + + for (int i = 0; i < 10; ++i) { + local_array[i] += 1; + } + +Analyzing this code snippet results in the following parallelization suggestion: + + pragma: "#pragma omp parallel for" + private: ["i"] + shared: ["local_array"] + first private: [] + reduction: [] + last private: [] + + +After interpreting and implementing the suggestion, the resulting, now parallel, source code could look as follows: + + #pragma omp parallel for private(i) shared(local_array) + for (int i = 0; i < 10; ++i) { + local_array[i] += 1; + } diff --git a/docs/data/Parallel_patterns/geometric_decomposition.md b/docs/data/Parallel_patterns/geometric_decomposition.md new file mode 100644 index 000000000..c52ca7ee8 --- /dev/null +++ b/docs/data/Parallel_patterns/geometric_decomposition.md @@ -0,0 +1,118 @@ +--- +layout: default +title: Geometric Decomposition +has_children: true +parent: Parallel patterns +grand_parent: Data +nav_order: 2 +--- + + +# Geometric Decomposition + +## Reporting +Possible geometric decompositions are reported in the following format: +``` +Geometric decomposition at: 1:9 +Start line: 1:26 +End line: 1:36 +Do-All loops: ['1:11'] +Reduction loops: [] + Number of tasks: 24 + Chunk limits: 1000 + pragma: for (i = 0; i < num-tasks; i++) #pragma omp task] + private: [] + shared: [] + first private: ['i'] + reduction: [] + last private: [] +``` + +## Interpretation +The reported values shall be interpreted as follows: +* `Geometric decomposition at: :`, where the respective parent file can be looked up in the `FileMapping.txt` using `file_id` and `cu_id` can be used for a look up in `Data.xml` +* `Start line: :`, where `line_num` refers to the first source code line of the potential geometrically decomposable code. +* `End line: :`, where `line_num` refers to the last line of the suggested pattern. +* `Do-All loops: [:]` specifies which [Do-all loops](Do-All.md) can be part of the geometric decomposition. +* `Reduction loops: [:]` specifies which [Reduction loops](Reduction.md) can be part of the geometric decomposition. +* `Number of tasks: ` specifies the number of tasks which should or can be spawned in order to process the geometric decomposition. +* `Chunk limits: ` determine the size of a workload package (amount of iterations) for each individual spawned task. +* `private, shared, first_private` and `last_private` indicate variables which should be mentioned within the respective OpenMP data sharing clauses. +* `reduction: [:]` specifies a set of identified reduction operations and variables. + + +## Implementation +In order to implement a geometric decomposition, first open the source code file corresponding to `file_id` and navigate to line `Start line -> `. +Insert `pragma` before each of the loops mentioned in `Do-all loops` and `Reduction loops`. Make sure to replace `num-tasks` with the specified `Number of tasks`, or insert a respective variable into the source code. +Modify the loop conditions of the original source code in order to allow a geometric decomposition. Each task should be responsible for processing a chunk of the size `Chunk limits`. +In order to ensure a valid parallelization, you need to add the following clauses to the OpenMP pragma, if the respective lists are not empty: +* `private` -> clause: `private()` +* `shared` -> clause: `shared()` +* `first_private` -> clause: `firstprivate()` +* `last_private` -> clause: `lastprivate()` +* `reduction`-> clause: `reduction(:)` + +### Example +As an example, we will analyze the following code snippet for parallelization potential. Some location and meta data will be ignored for the sake of simplicity. + + int main( void) + { + int i; + int d=20,a=22, b=44,c=90; + for (i=0; i<100; i++) { + a = foo(i, d); + b = bar(a, d); + c = delta(b, d); + } + a = b; + return 0; + } + +Analyzing this code snippet results in the following geometric decomposition suggestion: +``` +Geometric decomposition at: 1:1 +Start line: 1:2 +End line: 1:12 +Type: Geometric Decomposition Pattern +Do-All loops: ['1:3'] // line 5 +Reduction loops: [] + Number of tasks: 10 + Chunk limits: 10 + pragma: for (i = 0; i < num-tasks; i++) #pragma omp task] + private: [] + shared: [] + first private: ['i'] + reduction: [] + last private: [] +``` + +After interpreting and implementing the suggestion, the resulting, now parallel, source code could look as follows. +Since `i` has been used in the original source code already, the inserted `pragma` uses `x` instead. +As a last modification, the loop conditions in the original source code need to be modified slightly in order to allow the decomposition. +For a simpler interpretation of the example we have added the `chunk_size` and `tid` variables. +Note: Since the geometric decomposition relies on the identification of the thread number, the outermost `for` loop should be located inside a `parallel region`. However, depending on the specific analyzed source code, a surrounding `parallel region` might already exist or a different location for the surrounding `parallel region` may be more beneficial. + + int main( void) + { + int i; + int d=20,a=22, b=44,c=90; + + #pragma omp parallel + #pragma omp single + for (int x = 0; x < 10; x++ ) { + #pragma omp task + { + int tid = omp_get_thread_num(); + int chunk_size = 10; // value of Chunk limits + + for (i = tid*chunk_size; i < tid*chunk_size + chunk_size; i++) { + a = foo(i, d); + b = bar(a, d); + c = delta(b, d); + } + } + } + + a = b; + return 0; + } diff --git a/docs/data/Parallel_patterns/pipeline.md b/docs/data/Parallel_patterns/pipeline.md new file mode 100644 index 000000000..492500c3e --- /dev/null +++ b/docs/data/Parallel_patterns/pipeline.md @@ -0,0 +1,143 @@ +--- +layout: default +title: Pipeline +has_children: true +parent: Parallel patterns +grand_parent: Data +nav_order: 3 +--- + + +# Pipeline + +## Reporting + +### Pipelines +Pipelines are reported in the following format: +``` +Pipeline at: 1:11 +Start line: 1:30 +End line: 1:34 +Stages: + + + + + ... +``` +The reported values shall be interpreted as follows: +* `Pipeline at: :`, where the respective parent file can be looked up in the `FileMapping.txt` using `file_id` and `cu_id` can be used for a look up in `Data.xml` +* `Start line: :`, where `line_num` refers to the first source code line of the identified pipeline. +* `End line: :`, where `line_num` refers to the last line of the pipeline loop. +* `Stages` defines a list of stages contained in the identified pipeline. The specific format of the stages is described in the following. + +### Pipeline Stages +Individual stages of a pipeline are reported in the following format: +``` +Node: 1:13 +Start line: 1:31 +End line: 1:31 +pragma: "#pragma omp task" +first private: ['i'] +private: [] +shared: ['d', 'in'] +reduction: [] +InDeps: [] +OutDeps: ['a'] +InOutDeps: [] +``` + +The reported values shall be interpreted as follows: +* `Node: :`, where the respective parent file can be looked up in the `FileMapping.txt` using `file_id` and `cu_id` can be used for a look up in `Data.xml` +* `Start line: :`, where `line_num` refers to the first source code line of the identified pipeline stage. +* `End line: :`, where `line_num` refers to the last line of the stage. +* `pragma:`shows which type of OpenMP pragma shall be inserted before the `start line`. +* `private: []` lists a set of variables which have been identified as thread-`private` +* The same interpretation applies to the following values aswell: + * `shared` + * `first_private` +* `reduction: [:]` specifies a set of identified reduction operations and variables. +* `InDeps: []` specifies `in`-dependencies according to the [OpenMP depend clause](https://www.openmp.org/spec-html/5.0/openmpsu99.html). +* `OutDeps: []` specifies `out`-dependencies according to the [OpenMP depend clause](https://www.openmp.org/spec-html/5.0/openmpsu99.html). +* `InOutDeps: []` specifies `inout`-dependencies according to the [OpenMP depend clause](https://www.openmp.org/spec-html/5.0/openmpsu99.html). + + +## Implementation +In order to implement a suggested pipeline, first navigate to the source code location specified by `Pipeline at:`. +For each individual stage the following OpenMP pragmas and closes need to be added to the source code, if the respective lists are not empty: +* Insert `pragma` prior to the `start line` mentioned by the stage. +* If `private` is not empty, add the clause `private()`, where vars are separated by commas to the pragma. +* Do the same for: + * `shared` -> clause: `shared()` + * `first_private` -> clause: `firstprivate()` + * `reduction`-> clause: `reduction(:)` + * `InDeps` -> clause: `depend(in:)` + * `OutDeps` -> clause: `depend(out:)` + * `InOutDeps` -> clause: `depend(inout:)` + + +### Example +As an example, we will analyze the following code snippet for parallelization potential. Some location and meta data will be ignored for the sake of simplicity. + + int i; + int d=20,a=22, b=44,c=90; + for (i=0; i<100; i++) { + a = foo(i, d); + b = bar(a, d); + c = delta(b, d); + } + a = b; + +Analyzing this code snippet results in the following parallelization suggestion: +``` +Pipeline at: +Start line: 1:3 +End line: 1:7 +Stages: +Node: 1:13 + Start line: 1:4 + End line: 1:4 + + shared: ['d', 'in'] + reduction: [] + InDeps: [] + OutDeps: ['a'] + InOutDeps: [] + + Start line: 1:5 + End line: 1:5 + pragma: "#pragma omp task" + first private: [] + private: [] + shared: ['d', 'in'] + reduction: [] + InDeps: ['a'] + OutDeps: ['b'] + InOutDeps: [] + + Start line: 1:6 + End line: 1:7 + pragma: "#pragma omp task" + first private: [] + private: ['c'] + shared: ['d', 'in'] + reduction: [] + InDeps: ['b'] + OutDeps: [] + InOutDeps: [] +``` + +After interpreting and implementing the suggestion, the resulting, now parallel, source code could look as follows: + + int i; + int d=20,a=22, b=44,c=90; + for (i=0; i<100; i++) { + #pragma omp task firsprivate(i) shared(d, in) depend(out:a) + a = foo(i, d); + #pragma omp task shared(d, in) depend(in:a) depend(out:b) + b = bar(a, d); + #pragma omp task private(c) shared(d, in) depend(in: b) + c = delta(b, d); + } + a = b; + diff --git a/docs/data/Parallel_patterns/reduction.md b/docs/data/Parallel_patterns/reduction.md new file mode 100644 index 000000000..cda9dc003 --- /dev/null +++ b/docs/data/Parallel_patterns/reduction.md @@ -0,0 +1,71 @@ +--- +layout: default +title: Reduction +has_children: true +parent: Parallel patterns +grand_parent: Data +nav_order: 4 +--- + +# Reduction Loop + +## Reporting +Reduction Loops are reported in the following format: +``` +Reduction at: 1:2 +Start line: 1:7 +End line: 1:9 +pragma: "#pragma omp parallel for" +private: [] +shared: [] +first private: [] +reduction: [] +last private: [] +``` + +## Interpretation +The reported values shall be interpreted as follows: +* `Reduction at: :`, where the respective parent file can be looked up in the `FileMapping.txt` using `file_id` and `cu_id` can be used for a look up in `Data.xml` +* `Start line: :`, where `line_num` refers to the source code line of the parallelizable loop. +* `End line: :`, where `line_num` refers to the last line of the parallelizable loop. +* `pragma:`shows which type of OpenMP pragma shall be inserted before the target loop in order to parallelize it. +* `private: []` lists a set of variables which have been identified as thread-`private` +* The same interpretation applies to the following values aswell: + * `shared` + * `first_private` + * `last_private` +* `reduction: [:]` specifies a set of identified reduction operations and variables. + +## Implementation +In order to implement a suggestion, first open the source code file corresponding to `file_id` and navigate to line `Start line -> `. +Insert `pragma` before the loop begins. +In order to ensure a valid parallelization, you need to add the following clauses to the OpenMP pragma, if the respective lists are not empty: +* `private` -> clause: `private()` +* `shared` -> clause: `shared()` +* `first_private` -> clause: `firstprivate()` +* `last_private` -> clause: `lastprivate()` +* `reduction`-> clause: `reduction(:)` + +### Example +As an example, we will analyze the following code snippet for parallelization potential. All location and meta data will be ignored for the sake of simplicity. + + for (int i = 0; i < N; i++) { + local_var *= global_array[i]; + } + +Analyzing this code snippet results in the following parallelization suggestion: +``` +pragma: "#pragma omp parallel for" +private: ["i"] +shared: [] +first private: ["global_array"] +reduction: ["*:local_var"] +last private: [] +``` + +After interpreting and implementing the suggestion, the resulting, now parallel, source code could look as follows: + + #pragma omp parallel for private(i) firstprivate(global_array) reduction(*:local_var) + for (int i = 0; i < N; i++) { + local_var *= global_array[i]; + } diff --git a/docs/data/Parallel_patterns/tasks.md b/docs/data/Parallel_patterns/tasks.md new file mode 100644 index 000000000..58f32f435 --- /dev/null +++ b/docs/data/Parallel_patterns/tasks.md @@ -0,0 +1,11 @@ +--- +layout: default +title: Tasks +has_children: true +parent: Parallel patterns +grand_parent: Data +nav_order: 5 +--- + +# Task Parallelism +Currently, the task parallelism detection is not supported, but it will be re-included in the near future.