Skip to content

v3.0

Compare
Choose a tag to compare
@r-devulap r-devulap released this 06 Oct 20:49
· 396 commits to main since this release
b9f9340

Version 3.0 release contains a new supported method avx512_argselect to compute arg nth_element (also known as argpartition in NumPy). It returns an array of indices that would partition the data array. Highlights of this release include:

  1. v3.0 x86-simd-sort is merged into NumPy main branch. It provides AVX-512 vectorized versions of np.partition and np.argpartition . It speeds up np.partition up by up to 25x for 16-bit, 17x for 32-bit dtypes and about 8x speed up for 64-bit dtypes. Speeds up for np.argpartition are up-to 6.5x.
  2. A slightly modified version of x86-simd-sort has now been merged into OpenJDK . It speeds up sorting 32-bit and 64-bit data by up to 15x and 7x respectively.

What's Changed

Full Changelog: v2.0...v3.0