java-gpu

Support for offloading parallel-for loops in Java to NVIDIA CUDA compatible cards.

By introducing a @Parallel annotation, loops can be annotated as parallel. An extra compilation can then offload these to NVIDIA CUDA compatible graphics cards. Data transfers are handled automatically, and no CUDA C code has to be written explicitly.

A full dissertation about this is available: * Peter Calvert: Parallelisation of Java for Graphics Processors. Part II Dissertation, Computer Science Tripos, University of Cambridge June 2010.

AMD's APARAPI project attempts to achieve very similar goals to this project, and is still being maintained (so is more likely to work across a range of systems). The key difference is that they introduce kernel classes rather than operating on for loops, and have slightly different restrictions on the code that can be placed within GPU sections. It is possible that some of the techniques used here could be incorporated into APARAPI.

Compiling

Once you've downloaded a source package (or checked out code from this repo)

You can then perform a build using Maven by performing mvn clean package from the top level directory.

This places everything you should need in the target folder.

In order for the Java-GPU compiler to find your CUDA and JDK installations you should also have the CUDA_HOME and JDK_HOME environment variables set. For example export CUDA_HOME=/usr/local/cuda/ export JDK_HOME=/usr/lib/jvm/java-6-sun-1.6.0.20/

For example

export CUDA_HOME=/usr/local/cuda/
export JDK_HOME=/usr/lib/jvm/....

Sample Usage

To compile the Mandelbrot set sample, you can simply perform:

java -jar java-gpu-1.0.0.jar org.java.gpu.samples.Mandelbrot

from within the target directory. This will produce two files (on Linux):

org/java/gpu/samples/Mandelbrot.class libparallel.so

You should then by able to run the sample using:

java -cp .:java-gpu-1.0.0.jar org.java.gpu.samples.Mandelbrot 2000 2000 out.png

If you have problems that are related to the compilation for your GPU, then the options passed to NVCC are specified in src/main/java/org/java/gpu/cuda/CUDAExporter.java.

Example Code

Just as an example, the Java source code for the above Mandelbrot example is as follows.

Note the @Parallel annotation that indicates the region for CUDA processing:

@Parallel(loops = {"y", "x"}) 
public void compute() { 
  for(int y = 0; y < height; y++) { 
    for(int x = 0; x < width; x++) { 
       float Zr = 0.0f; 
       float Zi = 0.0f; 
       float Cr = (x * spacing - 1.5f); 
       float Ci = (y * spacing - 1.0f);

       float ZrN = 0;
       float ZiN = 0;
       int i;

       for(i = 0; (i < iterations) && (ZiN + ZrN <= LIMIT); i++) {
         Zi = 2.0f * Zr * Zi + Ci;
         Zr = ZrN - ZiN + Cr;
         ZiN = Zi * Zi;
         ZrN = Zr * Zr;
       }

       data[y][x] = (short)((i * 255) / iterations);
    }
  }
}

Manual porting on github from https://code.google.com/archive/p/java-gpu/

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
include		include
src/main/java/org/java/gpu		src/main/java/org/java/gpu
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

java-gpu

Compiling

Sample Usage

Example Code

About

Releases

Packages

Languages

License

Fabryprog/java-gpu

Folders and files

Latest commit

History

Repository files navigation

java-gpu

Compiling

Sample Usage

Example Code

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages