Release v0.101 · LLNL/lbann

============================== Release Notes: v0.101 ==============================

Support for new network structures:

Support for new layers:

Python front-end:

Performance optimizations:

Optimize CUDA kernel for tensor reordering in GRU layer
Enabled TensorCore optimization for GRU layer
GCN and Graph layers also have a faster Dense variant which only utilizes Matrix Multiplication

Model portability & usability:

Added Users Quickstart section to documentation including PyTorch
to LBANN mini-tutorial
Added section on callbacks with detailed instructions on summarize
images callback

Internal features:

I/O & data readers:

Added support for ImageNet data reader to use sample lists
Refactored sample list code to be more flexible and generalize
beyond JAG data reader
Added support for slab-based I/O in HDF5 data reader required by
DistConv implementations of CosmoFlow 3D volumes
Extended slab-based HDF5 data reader to support labels and
reconstruction modes for use with U-Net architecture

Datasets:

Build system and Dependent Libraries:

Bug fixes:

Properly reset data coordinator after each LTFB round
Fixed bug in weights proxy when weights buffer is reallocated
Bugfix for smiles data reader bound checking and simple LTFB data
distribution
Eliminated a race condition observed in VAE ATOM model with SMILES
data reader. Added a barrier after each data store mini-batch
exchange -- avoid race between non-blocking sends and receives and
later GPU kernel communication.

Provide feedback