Euler1D Benchmark

A comparison of various programming languages solving a 1D hydrodynamics problem.

The programs implement a simple finite-difference solver (the Lax-Friedrichs method) for the 1D Euler equations (inviscid, compressible hydrodynamics) in various languages to compare execution speed and syntax.

The benchmark is to solve the standard Sod Shock Tube, with NX = 5000 points and a CFL parameter of 0.9. Disk output is only used initially to verify implementation correctness, but is suppressed for the benchmarks. Ten runs are carried out with each implementation to get an idea of the execution time variability (which is generally found to be small).

Current implementations are:

C/C++
Fortran 90
Java
Python, using native nested lists and for loops
Python, using Numpy arrays and vectorized operations
Julia, native
Julia, leveraging the LoopVectorization.jl library
Rust

The tests were executed on a personal computer with an Intel Core i7-9700F processor running Gentoo Linux with kernel 6.1.31. Compiler/interpreter versions were 12.3.1 for gcc (C/C++ and Fortran 90), OpenJDK 17.0.6 for Java, CPython 3.8.17 and 3.11.4 for Python (with Numpy 1.24.4), 1.8.5 for Julia and 1.69.1 for Rust.

Execution time is measured internally by each program by comparing the platform's wall clock or CPU clock at the start and end of the main program, and printed to the terminal, in seconds, as the sole program output. For Julia+LoopVec the code was run manually within the Julia REPL after executing using LoopVectorization and include("Euler1D_opt.jl"), so as to compile/optimize as much as possible before the tests are run, but the program still measures its own run time internally; the results of doing this are in line with those obtained with the @benchmark macro from BenchmarkTools.jl.

Optimization flags were used where available (e.g. O3 for gcc and opt-level=3 for Rust); see the run_benchmarks.py script for details. The LoopVectorization.jl library was used for the Julia+LoopVec benchmark to vectorize/optimize the main loops (by simply prepending the @turbo macro); thanks to Luis Arcos (LAlbertoA) for the tip. The two Python benchmarks were executed with both Python 3.8 and 3.11; it turns out that 3.11 is substantially faster!

It was my first time writing Julia and Rust code so those implementations might be a bit rough. If you know how to further optimize any of them to make the comparison more fair, please let me know!

Results

The results of the benchmark are presented in the following plots. The bar heights are the average execution time averaged over 10 runs for each case, normalized to the execution time of the fasest benchmark implementation (so far, Fortran). In the first plot the native Python implementations are shown on a separate scale.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
outputs		outputs
.gitignore		.gitignore
Euler1D.cpp		Euler1D.cpp
Euler1D.f90		Euler1D.f90
Euler1D.java		Euler1D.java
Euler1D.jl		Euler1D.jl
Euler1D.py		Euler1D.py
Euler1D.rs		Euler1D.rs
Euler1D_LVec.jl		Euler1D_LVec.jl
Euler1D_alt.py		Euler1D_alt.py
Euler1D_numpy.py		Euler1D_numpy.py
LICENSE		LICENSE
README.md		README.md
benchmark_lin.png		benchmark_lin.png
benchmark_log.png		benchmark_log.png
benchmarks.csv		benchmarks.csv
plot_Euler1D.py		plot_Euler1D.py
plot_benchmarks.py		plot_benchmarks.py
run_benchmarks.py		run_benchmarks.py
test.rs		test.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Euler1D Benchmark

Results

Linear Y scale

Logarithmic Y scale

About

Releases

Packages

Languages

License

meithan/Euler1D_Benchmark

Folders and files

Latest commit

History

Repository files navigation

Euler1D Benchmark

Results

Linear Y scale

Logarithmic Y scale

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages