perf-ninja-rs/labs/memory_bound/swmem_prefetch_1 at master · grahamking/perf-ninja-rs

History

Name		Name	Last commit message	Last commit date
parent directory ..
benches		benches
src		src
Cargo.toml		Cargo.toml
README.md		README.md

README.md

To prefetch data in Rust use:

Those two and their *_instruction equivalents become the LLVM intrinsic llvm.prefetch which I suspect must become one of the asm instructions PREFETCHh, so you could also call that directly:

core::arch::x86_64::_mm_prefetch

Under Clang 14 this is not memory bound. The bottleneck appears to be branch prediction. The Rust version matches the C++ version that way. It's possible this won't teach you what the lab intends. Add prefetching anyway, and see if the benchmark shows a performance improvement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

swmem_prefetch_1

swmem_prefetch_1

README.md

Files

swmem_prefetch_1

Directory actions

More options

Directory actions

More options

Latest commit

History

swmem_prefetch_1

Folders and files

parent directory

README.md