Using AVX2 to optimize Google Chrome base64 encoding

The idea started from this article, where it is mentioned on how standard Google Chrome base64 uses 3 lookup tables to optimize the encoding process (see). I wanted to try and optimize base64 encoding that is dependent on lookup tables with AVX2 instructions. This allows custom tables to be used for the base64 encoding process. The main idea is that only 2 bytes per every dword are unpacked with AVX2 instructions, where multiple dwords are stored in one AVX2 256bit register (8 of them). This AVX2 byte unpacking process is inspired from the article and is 'illustrated' on page 12 with figure 3.

Nanobench was used to get the benchmarking results

Results

g++ base64-enc.cpp -O1 -march=native -o base64-enc

relative	ns/op	op/s	err%	total	benchmark
100.0%	15,773.88	63,395.94	0.6%	1.90	`google chrome base64 encode`
162.5%	9,706.89	103,019.64	0.5%	1.16	`avx2 base64 encode`

clang++ base64-enc.cpp -O3 -march=native -o base64-enc

relative	ns/op	op/s	err%	total	benchmark
100.0%	11,428.46	87,500.86	0.5%	1.38	`google chrome base64 encode`
119.4%	9,567.64	104,518.97	0.2%	1.14	`avx2 base64 encode`

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
base64-enc.cpp		base64-enc.cpp
nanobench.cpp		nanobench.cpp
nanobench.h		nanobench.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using AVX2 to optimize Google Chrome base64 encoding

Results

About

Releases

Packages

Languages

INDA22PlusPlus/imou-simd

Folders and files

Latest commit

History

Repository files navigation

Using AVX2 to optimize Google Chrome base64 encoding

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages