TO DO ...
- Split video scripts based window_size and stride.
- Input a image_list array, get dynamic image of these image.
- Transform datasets to corresponding folder.
- Modify network achitecture based ResNet18, change the number of input node to 8.
- Train scripts.
- Inference scripts.
- Distributed inference scripts.
Hey we go, last step !!!!!